Modeling community standards for metadata as templates makes data FAIR

Abstract It is challenging to determine whether datasets are findable, accessible, interoperable, and reusable (FAIR) because the FAIR Guiding Principles refer to highly idiosyncratic criteria regarding the metadata used to annotate datasets. Specifically, the FAIR principles require metadata to be...

Full description

Bibliographic Details
Main Authors: Mark A. Musen, Martin J. O’Connor, Erik Schultes, Marcos Martínez-Romero, Josef Hardi, John Graybeal
Format: Article
Language:English
Published: Nature Portfolio 2022-11-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-022-01815-3
_version_ 1811230341141102592
author Mark A. Musen
Martin J. O’Connor
Erik Schultes
Marcos Martínez-Romero
Josef Hardi
John Graybeal
author_facet Mark A. Musen
Martin J. O’Connor
Erik Schultes
Marcos Martínez-Romero
Josef Hardi
John Graybeal
author_sort Mark A. Musen
collection DOAJ
description Abstract It is challenging to determine whether datasets are findable, accessible, interoperable, and reusable (FAIR) because the FAIR Guiding Principles refer to highly idiosyncratic criteria regarding the metadata used to annotate datasets. Specifically, the FAIR principles require metadata to be “rich” and to adhere to “domain-relevant” community standards. Scientific communities should be able to define their own machine-actionable templates for metadata that encode these “rich,” discipline-specific elements. We have explored this template-based approach in the context of two software systems. One system is the CEDAR Workbench, which investigators use to author new metadata. The other is the FAIRware Workbench, which evaluates the metadata of archived datasets for their adherence to community standards. Benefits accrue when templates for metadata become central elements in an ecosystem of tools to manage online datasets—both because the templates serve as a community reference for what constitutes FAIR data, and because they embody that perspective in a form that can be distributed among a variety of software applications to assist with data stewardship and data sharing.
first_indexed 2024-04-12T10:28:49Z
format Article
id doaj.art-019fb43bfea2491093b368553bc23e99
institution Directory Open Access Journal
issn 2052-4463
language English
last_indexed 2024-04-12T10:28:49Z
publishDate 2022-11-01
publisher Nature Portfolio
record_format Article
series Scientific Data
spelling doaj.art-019fb43bfea2491093b368553bc23e992022-12-22T03:36:55ZengNature PortfolioScientific Data2052-44632022-11-019111510.1038/s41597-022-01815-3Modeling community standards for metadata as templates makes data FAIRMark A. Musen0Martin J. O’Connor1Erik Schultes2Marcos Martínez-Romero3Josef Hardi4John Graybeal5Stanford Center for Biomedical Informatics Research, Stanford UniversityStanford Center for Biomedical Informatics Research, Stanford UniversityGO FAIR FoundationStanford Center for Biomedical Informatics Research, Stanford UniversityStanford Center for Biomedical Informatics Research, Stanford UniversityStanford Center for Biomedical Informatics Research, Stanford UniversityAbstract It is challenging to determine whether datasets are findable, accessible, interoperable, and reusable (FAIR) because the FAIR Guiding Principles refer to highly idiosyncratic criteria regarding the metadata used to annotate datasets. Specifically, the FAIR principles require metadata to be “rich” and to adhere to “domain-relevant” community standards. Scientific communities should be able to define their own machine-actionable templates for metadata that encode these “rich,” discipline-specific elements. We have explored this template-based approach in the context of two software systems. One system is the CEDAR Workbench, which investigators use to author new metadata. The other is the FAIRware Workbench, which evaluates the metadata of archived datasets for their adherence to community standards. Benefits accrue when templates for metadata become central elements in an ecosystem of tools to manage online datasets—both because the templates serve as a community reference for what constitutes FAIR data, and because they embody that perspective in a form that can be distributed among a variety of software applications to assist with data stewardship and data sharing.https://doi.org/10.1038/s41597-022-01815-3
spellingShingle Mark A. Musen
Martin J. O’Connor
Erik Schultes
Marcos Martínez-Romero
Josef Hardi
John Graybeal
Modeling community standards for metadata as templates makes data FAIR
Scientific Data
title Modeling community standards for metadata as templates makes data FAIR
title_full Modeling community standards for metadata as templates makes data FAIR
title_fullStr Modeling community standards for metadata as templates makes data FAIR
title_full_unstemmed Modeling community standards for metadata as templates makes data FAIR
title_short Modeling community standards for metadata as templates makes data FAIR
title_sort modeling community standards for metadata as templates makes data fair
url https://doi.org/10.1038/s41597-022-01815-3
work_keys_str_mv AT markamusen modelingcommunitystandardsformetadataastemplatesmakesdatafair
AT martinjoconnor modelingcommunitystandardsformetadataastemplatesmakesdatafair
AT erikschultes modelingcommunitystandardsformetadataastemplatesmakesdatafair
AT marcosmartinezromero modelingcommunitystandardsformetadataastemplatesmakesdatafair
AT josefhardi modelingcommunitystandardsformetadataastemplatesmakesdatafair
AT johngraybeal modelingcommunitystandardsformetadataastemplatesmakesdatafair