A package of oftware tools and metadata for COSORE, a reproducibility-oriented community database for continuous soil respiration data.
To download the COSORE database, click on the Releases tab above.
Only free use data accepted.
The package, and the process of contributing data, should be as focused and simple as possible (but no simpler).
All contributors will be included on a database definition paper.
The database is completely open for reuse. We request that users cite the database definition paper, and strongly encourage them to (i) cite all dataset primary publications, and (ii) involve data contributors as co-authors when possible.
This database is not designed for, and should not be treated as, a permanent data repository. COSORE is a community database, but not an institutionally-backed repository like Figshare, DataONE, ESS-DIVE, etc. We recommend depositing your data in one of these first, and providing its DOI in metadata.
This database is comprised of a collection of datasets, each converted to a standard format and units. A dataset is one or more files of continuous (automated) soil respiration data, with accompanying metadata, with all measurements taken at a single site and with constant treatment assignments.
As much as possible, metadata are kept to a minimum. There are five metadata files, but only two of them absolutely need to be filled out:
- Site name
- Longitude
- Latitude
- Elevation [optional]
- Site timezone
- Site timezone name
- IGBP cover type
- Instrument name
- File format
- Timestamp format and timezone
- Primary publication DOI or URL [optional]
- Other publications DOI or URL [optional]
- Data DOI or URL [optional]
- Acknowledgment text [optional]
Information on arbitrary number (>=1) of contributors. The first contributor listed is assumed to be the point of contact for the dataset.
- First name, family name, email
- ORCID (https://orcid.org) [optional]
- Role (https://www.casrai.org/credit.html) [optional]
For each multiplexer port, can define:
- Treatment ("None" by default)
- Species [optional]
- Chamber area [optional]
- Collar depth [optional]
This maps between dataset fields and standardized database fields.
It include an optional compute-on-columns capability (e.g. to change units or combine columns). See ?map_columns for more information.
This file contains ancillary data: stand structure, carbon cycle, disturbance, etc. [all optional]
When asked (via csr_build() to build the synthesis dataset, the cosore R package
- Scans its
inst/extdatafolder for metadata on all installed datasets - Parses the metadata, and then based on
File_formatcalls the appropriate function to parse the raw data (which for size reasons are not currently included in the repository) - This is done via a drake pipeline, so we only rebuild datasets when needed
- Currently a
listis returned, in which each entry is an individual dataset, itself a list comprised of dataset objects (the description, contributors, etc., tables) - User-side convenience functions generate data frames ready (or readier) for analysis
- Reports are generated for the overall database and each individual dataset.
- The easiest way to get the data is on the Releases page.
- Structured/standardized continuous IRGA data
- Raw LI-8100A data
- Other data, e.g. long term survey measurements