Modeling community standards for metadata as templates makes data FAIR
Abstract It is challenging to determine whether datasets are findable, accessible, interoperable, and reusable (FAIR) because the FAIR Guiding Principles refer to highly idiosyncratic criteria regarding the metadata used to annotate datasets. Specifically, the FAIR principles require metadata to be...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2022-11-01
|
Series: | Scientific Data |
Online Access: | https://doi.org/10.1038/s41597-022-01815-3 |
_version_ | 1828197926635044864 |
---|---|
author | Mark A. Musen Martin J. O’Connor Erik Schultes Marcos Martínez-Romero Josef Hardi John Graybeal |
author_facet | Mark A. Musen Martin J. O’Connor Erik Schultes Marcos Martínez-Romero Josef Hardi John Graybeal |
author_sort | Mark A. Musen |
collection | DOAJ |
description | Abstract It is challenging to determine whether datasets are findable, accessible, interoperable, and reusable (FAIR) because the FAIR Guiding Principles refer to highly idiosyncratic criteria regarding the metadata used to annotate datasets. Specifically, the FAIR principles require metadata to be “rich” and to adhere to “domain-relevant” community standards. Scientific communities should be able to define their own machine-actionable templates for metadata that encode these “rich,” discipline-specific elements. We have explored this template-based approach in the context of two software systems. One system is the CEDAR Workbench, which investigators use to author new metadata. The other is the FAIRware Workbench, which evaluates the metadata of archived datasets for their adherence to community standards. Benefits accrue when templates for metadata become central elements in an ecosystem of tools to manage online datasets—both because the templates serve as a community reference for what constitutes FAIR data, and because they embody that perspective in a form that can be distributed among a variety of software applications to assist with data stewardship and data sharing. |
first_indexed | 2024-04-12T10:28:49Z |
format | Article |
id | doaj.art-019fb43bfea2491093b368553bc23e99 |
institution | Directory Open Access Journal |
issn | 2052-4463 |
language | English |
last_indexed | 2024-04-12T10:28:49Z |
publishDate | 2022-11-01 |
publisher | Nature Portfolio |
record_format | Article |
series | Scientific Data |
spelling | doaj.art-019fb43bfea2491093b368553bc23e992022-12-22T03:36:55ZengNature PortfolioScientific Data2052-44632022-11-019111510.1038/s41597-022-01815-3Modeling community standards for metadata as templates makes data FAIRMark A. Musen0Martin J. O’Connor1Erik Schultes2Marcos Martínez-Romero3Josef Hardi4John Graybeal5Stanford Center for Biomedical Informatics Research, Stanford UniversityStanford Center for Biomedical Informatics Research, Stanford UniversityGO FAIR FoundationStanford Center for Biomedical Informatics Research, Stanford UniversityStanford Center for Biomedical Informatics Research, Stanford UniversityStanford Center for Biomedical Informatics Research, Stanford UniversityAbstract It is challenging to determine whether datasets are findable, accessible, interoperable, and reusable (FAIR) because the FAIR Guiding Principles refer to highly idiosyncratic criteria regarding the metadata used to annotate datasets. Specifically, the FAIR principles require metadata to be “rich” and to adhere to “domain-relevant” community standards. Scientific communities should be able to define their own machine-actionable templates for metadata that encode these “rich,” discipline-specific elements. We have explored this template-based approach in the context of two software systems. One system is the CEDAR Workbench, which investigators use to author new metadata. The other is the FAIRware Workbench, which evaluates the metadata of archived datasets for their adherence to community standards. Benefits accrue when templates for metadata become central elements in an ecosystem of tools to manage online datasets—both because the templates serve as a community reference for what constitutes FAIR data, and because they embody that perspective in a form that can be distributed among a variety of software applications to assist with data stewardship and data sharing.https://doi.org/10.1038/s41597-022-01815-3 |
spellingShingle | Mark A. Musen Martin J. O’Connor Erik Schultes Marcos Martínez-Romero Josef Hardi John Graybeal Modeling community standards for metadata as templates makes data FAIR Scientific Data |
title | Modeling community standards for metadata as templates makes data FAIR |
title_full | Modeling community standards for metadata as templates makes data FAIR |
title_fullStr | Modeling community standards for metadata as templates makes data FAIR |
title_full_unstemmed | Modeling community standards for metadata as templates makes data FAIR |
title_short | Modeling community standards for metadata as templates makes data FAIR |
title_sort | modeling community standards for metadata as templates makes data fair |
url | https://doi.org/10.1038/s41597-022-01815-3 |
work_keys_str_mv | AT markamusen modelingcommunitystandardsformetadataastemplatesmakesdatafair AT martinjoconnor modelingcommunitystandardsformetadataastemplatesmakesdatafair AT erikschultes modelingcommunitystandardsformetadataastemplatesmakesdatafair AT marcosmartinezromero modelingcommunitystandardsformetadataastemplatesmakesdatafair AT josefhardi modelingcommunitystandardsformetadataastemplatesmakesdatafair AT johngraybeal modelingcommunitystandardsformetadataastemplatesmakesdatafair |