Common Errors in Ecological Data Sharing

Objectives: (1) to identify common errors in data organization and metadata completeness that would preclude a “reader” from being able to interpret and re-use the data for a new purpose; and (2) to develop a set of best practices derived from these common errors that would guide researchers in crea...

Full description

Bibliographic Details
Main Authors: Robert B. Cook, William K. Michener, Karina E. Kervin
Format: Article
Language:English
Published: UMass Chan Medical School, Lamar Soutter Library 2013-04-01
Series:Journal of eScience Librarianship
Subjects:
Online Access:http://escholarship.umassmed.edu/jeslib/vol2/iss2/1/
_version_ 1797970900193116160
author Robert B. Cook
William K. Michener
Karina E. Kervin
author_facet Robert B. Cook
William K. Michener
Karina E. Kervin
author_sort Robert B. Cook
collection DOAJ
description Objectives: (1) to identify common errors in data organization and metadata completeness that would preclude a “reader” from being able to interpret and re-use the data for a new purpose; and (2) to develop a set of best practices derived from these common errors that would guide researchers in creating more usable data products that could be readily shared, interpreted, and used.Methods: We used directed qualitative content analysis to assess and categorize data and metadata errors identified by peer reviewers of data papers published in the Ecological Society of America’s (ESA) Ecological Archives. Descriptive statistics provided the relative frequency of the errors identified during the peer review process.Results: There were seven overarching error categories: Collection & Organization, Assure, Description, Preserve, Discover, Integrate, and Analyze/Visualize. These categories represent errors researchers regularly make at each stage of the Data Life Cycle. Collection & Organization and Description errors were some of the most common errors, both of which occurred in over 90% of the papers.Conclusions: Publishing data for sharing and reuse is error prone, and each stage of the Data Life Cycle presents opportunities for mistakes. The most common errors occurred when the researcher did not provide adequate metadata to enable others to interpret and potentially re-use the data. Fortunately, there are ways to minimize these mistakes through carefully recording all details about study context, data collection, QA/ QC, and analytical procedures from the beginning of a research project and then including this descriptive information in the metadata.
first_indexed 2024-04-11T03:23:11Z
format Article
id doaj.art-ff1238c4a26f4015808d06c86d73c2cb
institution Directory Open Access Journal
issn 2161-3974
language English
last_indexed 2024-04-11T03:23:11Z
publishDate 2013-04-01
publisher UMass Chan Medical School, Lamar Soutter Library
record_format Article
series Journal of eScience Librarianship
spelling doaj.art-ff1238c4a26f4015808d06c86d73c2cb2023-01-02T08:19:47ZengUMass Chan Medical School, Lamar Soutter LibraryJournal of eScience Librarianship2161-39742013-04-0122316http://dx.doi.org/10.7191/jeslib.2013.1024Common Errors in Ecological Data SharingRobert B. CookWilliam K. MichenerKarina E. KervinObjectives: (1) to identify common errors in data organization and metadata completeness that would preclude a “reader” from being able to interpret and re-use the data for a new purpose; and (2) to develop a set of best practices derived from these common errors that would guide researchers in creating more usable data products that could be readily shared, interpreted, and used.Methods: We used directed qualitative content analysis to assess and categorize data and metadata errors identified by peer reviewers of data papers published in the Ecological Society of America’s (ESA) Ecological Archives. Descriptive statistics provided the relative frequency of the errors identified during the peer review process.Results: There were seven overarching error categories: Collection & Organization, Assure, Description, Preserve, Discover, Integrate, and Analyze/Visualize. These categories represent errors researchers regularly make at each stage of the Data Life Cycle. Collection & Organization and Description errors were some of the most common errors, both of which occurred in over 90% of the papers.Conclusions: Publishing data for sharing and reuse is error prone, and each stage of the Data Life Cycle presents opportunities for mistakes. The most common errors occurred when the researcher did not provide adequate metadata to enable others to interpret and potentially re-use the data. Fortunately, there are ways to minimize these mistakes through carefully recording all details about study context, data collection, QA/ QC, and analytical procedures from the beginning of a research project and then including this descriptive information in the metadata.http://escholarship.umassmed.edu/jeslib/vol2/iss2/1/EcologyData publicationData managementData sharingData reuseBest practices
spellingShingle Robert B. Cook
William K. Michener
Karina E. Kervin
Common Errors in Ecological Data Sharing
Journal of eScience Librarianship
Ecology
Data publication
Data management
Data sharing
Data reuse
Best practices
title Common Errors in Ecological Data Sharing
title_full Common Errors in Ecological Data Sharing
title_fullStr Common Errors in Ecological Data Sharing
title_full_unstemmed Common Errors in Ecological Data Sharing
title_short Common Errors in Ecological Data Sharing
title_sort common errors in ecological data sharing
topic Ecology
Data publication
Data management
Data sharing
Data reuse
Best practices
url http://escholarship.umassmed.edu/jeslib/vol2/iss2/1/
work_keys_str_mv AT robertbcook commonerrorsinecologicaldatasharing
AT williamkmichener commonerrorsinecologicaldatasharing
AT karinaekervin commonerrorsinecologicaldatasharing