Maximizing the utility of public data
The human genome project galvanized the scientific community around an ambitious goal. Upon completion, the project delivered several discoveries, and a new era of research commenced. More importantly, novel technologies and analysis methods materialized during the project period. The cost reduction...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2023-03-01
|
Series: | Frontiers in Genetics |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fgene.2023.1106631/full |
_version_ | 1797855249299406848 |
---|---|
author | Mahmoud Ahmed Hyun Joon Kim Deok Ryong Kim |
author_facet | Mahmoud Ahmed Hyun Joon Kim Deok Ryong Kim |
author_sort | Mahmoud Ahmed |
collection | DOAJ |
description | The human genome project galvanized the scientific community around an ambitious goal. Upon completion, the project delivered several discoveries, and a new era of research commenced. More importantly, novel technologies and analysis methods materialized during the project period. The cost reduction allowed many more labs to generate high-throughput datasets. The project also served as a model for other extensive collaborations that generated large datasets. These datasets were made public and continue to accumulate in repositories. As a result, the scientific community should consider how these data can be utilized effectively for the purposes of research and the public good. A dataset can be re-analyzed, curated, or integrated with other forms of data to enhance its utility. We highlight three important areas to achieve this goal in this brief perspective. We also emphasize the critical requirements for these strategies to be successful. We draw on our own experience and others in using publicly available datasets to support, develop, and extend our research interest. Finally, we underline the beneficiaries and discuss some risks involved in data reuse. |
first_indexed | 2024-04-09T20:20:38Z |
format | Article |
id | doaj.art-2be9fa9db81041b3a30823b95c04d372 |
institution | Directory Open Access Journal |
issn | 1664-8021 |
language | English |
last_indexed | 2024-04-09T20:20:38Z |
publishDate | 2023-03-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Genetics |
spelling | doaj.art-2be9fa9db81041b3a30823b95c04d3722023-03-31T04:55:32ZengFrontiers Media S.A.Frontiers in Genetics1664-80212023-03-011410.3389/fgene.2023.11066311106631Maximizing the utility of public dataMahmoud Ahmed0Hyun Joon Kim1Deok Ryong Kim2Department of Biochemistry and Convergence Medical Sciences, Institute of Health Sciences, College of Medicine, Gyeongsang National University, Jinju, Republic of KoreaDepartment of Anatomy and Convergence Medical Sciences, Institute of Health Sciences, College of Medicine, Gyeongsang National University, Jinju, Republic of KoreaDepartment of Biochemistry and Convergence Medical Sciences, Institute of Health Sciences, College of Medicine, Gyeongsang National University, Jinju, Republic of KoreaThe human genome project galvanized the scientific community around an ambitious goal. Upon completion, the project delivered several discoveries, and a new era of research commenced. More importantly, novel technologies and analysis methods materialized during the project period. The cost reduction allowed many more labs to generate high-throughput datasets. The project also served as a model for other extensive collaborations that generated large datasets. These datasets were made public and continue to accumulate in repositories. As a result, the scientific community should consider how these data can be utilized effectively for the purposes of research and the public good. A dataset can be re-analyzed, curated, or integrated with other forms of data to enhance its utility. We highlight three important areas to achieve this goal in this brief perspective. We also emphasize the critical requirements for these strategies to be successful. We draw on our own experience and others in using publicly available datasets to support, develop, and extend our research interest. Finally, we underline the beneficiaries and discuss some risks involved in data reuse.https://www.frontiersin.org/articles/10.3389/fgene.2023.1106631/fullpublic-datadata-reusedata-analysisdata-sharingreproducible-research |
spellingShingle | Mahmoud Ahmed Hyun Joon Kim Deok Ryong Kim Maximizing the utility of public data Frontiers in Genetics public-data data-reuse data-analysis data-sharing reproducible-research |
title | Maximizing the utility of public data |
title_full | Maximizing the utility of public data |
title_fullStr | Maximizing the utility of public data |
title_full_unstemmed | Maximizing the utility of public data |
title_short | Maximizing the utility of public data |
title_sort | maximizing the utility of public data |
topic | public-data data-reuse data-analysis data-sharing reproducible-research |
url | https://www.frontiersin.org/articles/10.3389/fgene.2023.1106631/full |
work_keys_str_mv | AT mahmoudahmed maximizingtheutilityofpublicdata AT hyunjoonkim maximizingtheutilityofpublicdata AT deokryongkim maximizingtheutilityofpublicdata |