Abstract
The Environmental Monitoring and Assessment Program (EMAP) collects data that are used to statistically assess the environmental condition of large geographic regions. These data are then posted on the EMAP web site so that anyone can use them. Databases used for the statistical analyses, "analytical" databases, differ in design from the "general-use" databases used by a secondary audience. Their scope is usually restricted in time, in geographic extent, and in type and content of data, often being limited to a single scientific discipline. Their structure may be more horizontal than vertical, so that statistical programs can import the data easily. Their design is strongly influenced by the nature of the scientific analysis because the goal is to create a good computing environment for that analysis. We illustrate these aspects of design with an analytical database for estuaries in the U.S. mid-Atlantic region.
Similar content being viewed by others
References
Baldick, R., Clements, K.A., Pinjo-Dzigal, Z. and Davis, P.W.: 1997, ‘Implementing Nonquadratic Objective Functions for State Estimation and Bad Data Rejection’ IEEE Power Engineering Review 17, 67.
Buffum, H.W.: 1996. ‘Strategic uses of SAS Data Step Programming and SQL Passthrough to Query Oracle Databases’ Proceedings of the 21st Annual SAS Users Group International Conference.SAS Institute, Inc., Cary, NC, USA.
Buffum, H. and Hale, S.: 1998, MAIA-Estuaries 1997-1998. Data Format and Transfer ManualAtlantic Ecology Division, NHEERL, U.S. Environmental Protection Agency. Narragansett, RI, USA.
CBP: 1999, Chesapeake Bay Program web site, [http://www.chesapeakebay.net].
CENR: 1997, ‘Integrating the Nations Environmental Monitoring and Research Networks and Programs: A Proposed Framework’ Committee on Environment and Natural Resources, Ntl. Sci. and Tech. Council, Washington, DC, USA.
Chambers, J.: 1999, ‘Computing with Data: Concepts and Challenges’ The American Statistician 53, 73–84.
Edwards, D.: 1998, ‘Data Quality Control/Quality Assurance’ in: Data and Information Management in the Ecological Sciences: A Resource GuideMichener, W.K., Porter, J.H. and Stafford, S.G. (eds.), LTER Network Office, Univ. of New Mexico, Albuquerque, NM, USA.
EIMS: 1999, Environmental Information Management System web site, [http://www.epa.gov/eims].
EMAP: 1999, Environmental Monitoring and Assessment Program web site, [http://www.epa.gov/emap].
Farrey, P.M., Mooney-Seus, M.L. and Tausig, H.C.: 1999, Out of the Fog: Furthering the Establishment of an Electronic Environmental Information Exchange for the Gulf of Maine, Report 99–1, New England Aquarium, Boston, MA, USA.
Fayyad, U., Haussler, D. and Stolorz, P.: 1996, ‘Mining Scientific Data’ Communications of the ACM 39, 51–57.
Flierl, G.: 1990, ‘Scientific Database Workshop Position Paper’ in: Scientific Database Management, Panel Reports and Supporting MaterialFrench, J.C., Jones, A.K and Pfaltz, J.L. (eds.), TR 90–22, Dept. of Computer Science, Univ. of Virginia, Charlottesville, VA, USA.
Flournoy, N.: 1990, ‘Database Statistics’ in: Scientific Database Management, Panel Reports and Supporting MaterialFrench, J.C., Jones, A.K and Pfaltz, J.L. (eds.), TR 90–22, Dept. of Computer Science, Univ. of Virginia, Charlottesville, VA, USA.
French, J.C., Jones, A.K and Pfaltz, J.L. (eds.): 1990, ‘Scientific Database Management’ Report of the Invitational NSF Workshop on Scientific Database Management, March 1990. TR 90–21, Dept. of Computer Science, Univ. of Virginia, Charlottesville, VA, USA.
Glymour, C., Madigan, D., Pregibon, D. and Smyth, P.: 1997. ‘Statistical Themes and Lessons for Data Mining’ Data Mining and Knowledge Discovery 1, 11–28.
Gross, K.L., Pake, C.E., Allen, E., Bledsoe, C., Colwell, R., Dayton, P., Dethier, M., Helly, J., Holt, R., Morin, N., Michener, W., Pickett, S.T.A. and Stafford, S.: 1995, Final Report of the Ecological Society of America Committee on the Future of Long-term Ecological Data (FLED)[http://www.sdsc.edu/~ESA/FLED/FLED.html].
Hale, S.S., Bahner, L.H. and Paul, J.F.: in press, ‘Finding Common Ground in Managing Data Used for Regional Environmental Assessments’ Environmental Monitoring and Assessment.
Hale, S.S., Hughes, M.M, Paul, J.F., McAskill, R.S., Rego, S.A., Bender, D.R, Dodge, N.J., Richter, T.L and Copeland, J.L.: 1998, ‘Managing Scientific Data: The EMAP Approach’ Environmental Monitoring and Assessment 51, 429–440.
Hand, D.J.: 1998, ‘Data Mining: Statistics and More?’ The American Statistician 52, 112–118.
Jenne, R.: 1990, ‘Data Management for Climate and Global Change’ in: Scientific Database Management, Panel Reports and Supporting MaterialFrench, J.C., Jones, A.K and Pfaltz, J.L. (eds.), TR 90-22, Dept. of Computer Science, Univ. of Virginia, Charlottesville, VA, USA.
Jones, K.B., Riiters, K.J., Wickham, J.D., Tankersley Jr., R.D., O'Neill, R.V., Chaloud, D.J, Smith, E.R. and Neale, A.C.: 1997, An Ecological Assessment of the United States Mid-Atlantic Region: A Landscape AtlasEPA/600/R-97/130, U.S. Environmental Protection Agency, Office of Research and Development, Washington, DC, USA.
MAIA: 1999. Mid-Atlantic Integrated Assessment web site, [http://www.epa.gov/maia].
Michener, W.K.: 1998, ‘Ecological Metadata’ in: Data and Information Management in the Ecological Sciences: A Resource GuideMichener,W.K., Porter, J.H. and Stafford, S.G. (eds.), LTER Network Office, Univ. of New Mexico, Albuquerque, NM, USA.
NRC: 1995, Finding the Forest in the Trees: The Challenge of Combining Diverse Environmental DataNational Research Council. National Academy Press, Washington, DC, USA.
NRC: 1997, Bits of Power: Issues in Global Access to Scientific DataNational Research Council. National Academy Press, Washington, DC, USA.
Pfaltz, J.L.: 1990, ‘Differences Between Commercial and Scientific Databases’ in: Scientific Database Management, Panel Reports and Supporting MaterialFrench, J.C., Jones, A.K and Pfaltz, J.L. (eds.), TR 90–22, Dept. of Computer Science, Univ. of Virginia, Charlottesville, VA, USA.
Porter, J.H.:1998, ‘Scientific Databases for Environmental Research’ in: Data and Information Management in the Ecological Sciences: A Resource GuideMichener, W.K., Porter, J.H. and Stafford, S.G. (eds.), LTER Network Office, Univ. of New Mexico, Albuquerque, NM, USA.
Robbins, R.J.:1990, ‘Types of Scientific Databases’ in: Scientific Database Management, Panel Reports and Supporting MaterialFrench, J.C., Jones, A.K. and Pfaltz, J.L. (eds.), TR 90–22, Dept. of Computer Science, Univ. of Virginia, Charlottesville, VA, USA.
Shoshani, A.:1990, ‘On the Importance of Metadata Management for Scientific Applications’ in: Scientific Database Management, Panel Reports and Supporting MaterialFrench, J.C., Jones, A.K and Pfaltz, J.L. (eds.), TR 90–22, Dept. of Computer Science, Univ. of Virginia, Charlottesville, VA, USA.
STORET: 1999, STORET web site, [http:www.epa.gov/OWOW/STORET].
USEPA: 1997, EMAP Research StrategyU.S. Environmental Protection Agency, Office of Research and Development, NHEERL, Research Triangle Park, NC, USA.
USEPA: 1998, Condition of the Mid-Atlantic EstuariesEPA 600-R-98-147, Office of Research and Development, U.S. Environmental Protection Agency, Washington, DC, USA.
USEPA: (in prep.), Mid-Atlantic Highlands State of the StreamsOffice of Research and Development, U.S. Environmental Protection Agency, Washington, DC, USA.
Withee, G.W.: 1990, ‘Workshop on Scientific Data Bases’ in: Scientific Database Management, Panel Reports and Supporting MaterialFrench, J.C., Jones, A.K. and Pfaltz, J.L. (eds.), TR 90–22, Dept. of Computer Science, Univ. of Virginia, Charlottesville, VA, USA.
Zhang, B.M., Wang, S.Y. and Xiang, N.D.: 1992, ‘A Linear Recursive Bad Data Identification Method with Real-time Application to Power System State Estimation’ IEEE Transactions on Power Systems 7, 1378–1385.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Hale, S.S., Buffum, H.W. Designing Environmental Databases for Statistical Analyses. Environ Monit Assess 64, 55–68 (2000). https://doi.org/10.1023/A:1006438401496
Issue Date:
DOI: https://doi.org/10.1023/A:1006438401496