Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades
Hemophilia is an inherited blood coagulation disorder caused by mutations on the coagulation factors VIII or IX genes. Although it is a relatively rare disease, the research community is actively working on this topic, producing almost 6000 manuscripts in the last 5 years. Given that the scientific...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SAGE Publishing
2022-09-01
|
Series: | Bioinformatics and Biology Insights |
Online Access: | https://doi.org/10.1177/11779322221125604 |
_version_ | 1798000481442725888 |
---|---|
author | Tiago JS Lopes Ricardo Rios Tatiane Nogueira |
author_facet | Tiago JS Lopes Ricardo Rios Tatiane Nogueira |
author_sort | Tiago JS Lopes |
collection | DOAJ |
description | Hemophilia is an inherited blood coagulation disorder caused by mutations on the coagulation factors VIII or IX genes. Although it is a relatively rare disease, the research community is actively working on this topic, producing almost 6000 manuscripts in the last 5 years. Given that the scientific literature is increasing so rapidly, even the most avid reader will find it difficult to follow it closely. In this study, we used sophisticated computational techniques to map the hemophilia literature of the last 60 years. We created a network structure to represent authorship collaborations, where the nodes are the researchers and 2 nodes are connected if they co-authored a manuscript. We accurately identified author clusters, namely, researchers who have collaborated systematically for several years, and used text mining techniques to automatically synthesize their research specialties. Overall, this study serves as a historical appreciation of the effort of thousands of hemophilia researchers and demonstrates that a computational framework is able to automatically identify collaboration networks and their research specialties. Importantly, we made all datasets and source code available for the community, and we anticipate that the methods introduced here will pave the way for the development of systems that generate compelling hypothesis based on patterns that are imperceptible to human researchers. |
first_indexed | 2024-04-11T11:20:58Z |
format | Article |
id | doaj.art-c79544f396dc4f85b1ae383babf875a5 |
institution | Directory Open Access Journal |
issn | 1177-9322 |
language | English |
last_indexed | 2024-04-11T11:20:58Z |
publishDate | 2022-09-01 |
publisher | SAGE Publishing |
record_format | Article |
series | Bioinformatics and Biology Insights |
spelling | doaj.art-c79544f396dc4f85b1ae383babf875a52022-12-22T04:27:05ZengSAGE PublishingBioinformatics and Biology Insights1177-93222022-09-011610.1177/11779322221125604Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 DecadesTiago JS Lopes0Ricardo Rios1Tatiane Nogueira2Department of Regenerative Medicine, National Center for Child Health and Development Research Institute, Tokyo, JapanDepartment of Computer Science, Federal University of Bahia, Salvador, BrazilDepartment of Computer Science, Federal University of Bahia, Salvador, BrazilHemophilia is an inherited blood coagulation disorder caused by mutations on the coagulation factors VIII or IX genes. Although it is a relatively rare disease, the research community is actively working on this topic, producing almost 6000 manuscripts in the last 5 years. Given that the scientific literature is increasing so rapidly, even the most avid reader will find it difficult to follow it closely. In this study, we used sophisticated computational techniques to map the hemophilia literature of the last 60 years. We created a network structure to represent authorship collaborations, where the nodes are the researchers and 2 nodes are connected if they co-authored a manuscript. We accurately identified author clusters, namely, researchers who have collaborated systematically for several years, and used text mining techniques to automatically synthesize their research specialties. Overall, this study serves as a historical appreciation of the effort of thousands of hemophilia researchers and demonstrates that a computational framework is able to automatically identify collaboration networks and their research specialties. Importantly, we made all datasets and source code available for the community, and we anticipate that the methods introduced here will pave the way for the development of systems that generate compelling hypothesis based on patterns that are imperceptible to human researchers.https://doi.org/10.1177/11779322221125604 |
spellingShingle | Tiago JS Lopes Ricardo Rios Tatiane Nogueira Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades Bioinformatics and Biology Insights |
title | Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades |
title_full | Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades |
title_fullStr | Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades |
title_full_unstemmed | Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades |
title_short | Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades |
title_sort | computational analyses reveal fundamental properties of the hemophilia literature in the last 6 decades |
url | https://doi.org/10.1177/11779322221125604 |
work_keys_str_mv | AT tiagojslopes computationalanalysesrevealfundamentalpropertiesofthehemophilialiteratureinthelast6decades AT ricardorios computationalanalysesrevealfundamentalpropertiesofthehemophilialiteratureinthelast6decades AT tatianenogueira computationalanalysesrevealfundamentalpropertiesofthehemophilialiteratureinthelast6decades |