Data Package Best Practices

Overview

This website contains current "Best Practice" recommendations for ecological and environmental science data packages. These recommendations are directed towards the following goals:

  • Minimize heterogeneity of EML-described data packages to simplify development and re-use of software
  • Maximize interoperability to facilitate data synthesis
  • Provide guidance and clarification on
    • the use of Ecological Metadata Language (EML)
    • design a data package
    • prepare a data product for synthesis

To participate in the “Best Practices for Data Packages” project, see “About this site.”

EML Metadata Concepts

The recommendations for EML metadata apply to all data packages. This section subsumes V3 of the static PDF document “Best Practices for Dataset Metadata in Ecological Metadata Language (EML),” last updated in 2017. The entire most recent (versioned, citable) release will be made available as a PDF.

Best Practices for Dataset Metadata in Ecological Metadata Language (EML)

Data Package Design

Considerations for a well designed data package including special cases based on data type, format, or acquisition method. Examples are images, documents, raw data stored in other repositories.

Recommendations for data package design

Data Products for Synthesis Research

Recommendations for community developed data products. The data packages are derived from raw data and reformatted to meet certain data harmonization standards. Many of these data products have extensive related code bases, which recommendations can take into account and link to.

Recommendations for data products from specific scientific domains