EDI Dataset Preparation Guides
This website contains a series of documents about preparing and publishing datasets for the environmental sciences and similar contexts. Topics include community-developed metadata standards, serialization and markup formatting guidelines, best practices for content in ecological synthesis datasets, and more. This documentation is maintained by the Environmental Data Initiative (EDI) and all content has been developed and written in collaboration with EDI’s community of scientists, data managers, and repository users.
The purpose of these guides is to:
- Provide guidance for creating published data products that are findable, accessible, interoperable, and reusable (FAIR, Wilkinson et al. 2016)
- Augment the schema documentation for the Ecological Metadata Language (EML, Jones et al. 2019)
- Promote the development and adoption of community-standard data harmonization practices, which will lead to improved data integration and synthesis science outcomes
To contribute to these documents or participate in the associated working groups, see the About page or the repository README. For more on the history of this effort, and access to earlier editions of the guides, see the History page.
The guides
Date | Title | Description | Categories | |
---|---|---|---|---|
Feb 19, 2021 | Data Package Design for Special Cases | Community-developed considerations for creating well-designed datasets that include data specialized by type, format, or acquisition method. Examples are images, code, documents, and raw data in other repositories. | published | |
Jan 14, 2025 | Best Practices for Dataset Metadata in Ecological Metadata Language | These recommendations for creating EML metadata documents can be applied to most research datasets published by the environmental sciences community. | draft | |
Jan 14, 2025 | Ecological community survey data (ecocomDP) | This guide covers how to format community survey and biodiversity data to the “ecocomDP” harmonized data model. The ecocomDP standard and associated tools are a community-supported project involving the LTER Network, NEON, and others. | draft | |
Nov 1, 2017 | Best Practices for Dataset Metadata in Ecological Metadata Language (EML) | These recommendations for creating EML metadata documents can be applied to most research datasets published by the environmental sciences community. | archived |