Maximizing the utility of public data

The human genome project galvanized the scientific community around an ambitious goal. Upon completion, the project delivered several discoveries, and a new era of research commenced. More importantly, novel technologies and analysis methods materialized during the project period. The cost reduction...

Full description

Bibliographic Details
Main Authors: Mahmoud Ahmed, Hyun Joon Kim, Deok Ryong Kim
Format: Article
Language:English
Published: Frontiers Media S.A. 2023-03-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fgene.2023.1106631/full
_version_ 1797855249299406848
author Mahmoud Ahmed
Hyun Joon Kim
Deok Ryong Kim
author_facet Mahmoud Ahmed
Hyun Joon Kim
Deok Ryong Kim
author_sort Mahmoud Ahmed
collection DOAJ
description The human genome project galvanized the scientific community around an ambitious goal. Upon completion, the project delivered several discoveries, and a new era of research commenced. More importantly, novel technologies and analysis methods materialized during the project period. The cost reduction allowed many more labs to generate high-throughput datasets. The project also served as a model for other extensive collaborations that generated large datasets. These datasets were made public and continue to accumulate in repositories. As a result, the scientific community should consider how these data can be utilized effectively for the purposes of research and the public good. A dataset can be re-analyzed, curated, or integrated with other forms of data to enhance its utility. We highlight three important areas to achieve this goal in this brief perspective. We also emphasize the critical requirements for these strategies to be successful. We draw on our own experience and others in using publicly available datasets to support, develop, and extend our research interest. Finally, we underline the beneficiaries and discuss some risks involved in data reuse.
first_indexed 2024-04-09T20:20:38Z
format Article
id doaj.art-2be9fa9db81041b3a30823b95c04d372
institution Directory Open Access Journal
issn 1664-8021
language English
last_indexed 2024-04-09T20:20:38Z
publishDate 2023-03-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Genetics
spelling doaj.art-2be9fa9db81041b3a30823b95c04d3722023-03-31T04:55:32ZengFrontiers Media S.A.Frontiers in Genetics1664-80212023-03-011410.3389/fgene.2023.11066311106631Maximizing the utility of public dataMahmoud Ahmed0Hyun Joon Kim1Deok Ryong Kim2Department of Biochemistry and Convergence Medical Sciences, Institute of Health Sciences, College of Medicine, Gyeongsang National University, Jinju, Republic of KoreaDepartment of Anatomy and Convergence Medical Sciences, Institute of Health Sciences, College of Medicine, Gyeongsang National University, Jinju, Republic of KoreaDepartment of Biochemistry and Convergence Medical Sciences, Institute of Health Sciences, College of Medicine, Gyeongsang National University, Jinju, Republic of KoreaThe human genome project galvanized the scientific community around an ambitious goal. Upon completion, the project delivered several discoveries, and a new era of research commenced. More importantly, novel technologies and analysis methods materialized during the project period. The cost reduction allowed many more labs to generate high-throughput datasets. The project also served as a model for other extensive collaborations that generated large datasets. These datasets were made public and continue to accumulate in repositories. As a result, the scientific community should consider how these data can be utilized effectively for the purposes of research and the public good. A dataset can be re-analyzed, curated, or integrated with other forms of data to enhance its utility. We highlight three important areas to achieve this goal in this brief perspective. We also emphasize the critical requirements for these strategies to be successful. We draw on our own experience and others in using publicly available datasets to support, develop, and extend our research interest. Finally, we underline the beneficiaries and discuss some risks involved in data reuse.https://www.frontiersin.org/articles/10.3389/fgene.2023.1106631/fullpublic-datadata-reusedata-analysisdata-sharingreproducible-research
spellingShingle Mahmoud Ahmed
Hyun Joon Kim
Deok Ryong Kim
Maximizing the utility of public data
Frontiers in Genetics
public-data
data-reuse
data-analysis
data-sharing
reproducible-research
title Maximizing the utility of public data
title_full Maximizing the utility of public data
title_fullStr Maximizing the utility of public data
title_full_unstemmed Maximizing the utility of public data
title_short Maximizing the utility of public data
title_sort maximizing the utility of public data
topic public-data
data-reuse
data-analysis
data-sharing
reproducible-research
url https://www.frontiersin.org/articles/10.3389/fgene.2023.1106631/full
work_keys_str_mv AT mahmoudahmed maximizingtheutilityofpublicdata
AT hyunjoonkim maximizingtheutilityofpublicdata
AT deokryongkim maximizingtheutilityofpublicdata