Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades

Hemophilia is an inherited blood coagulation disorder caused by mutations on the coagulation factors VIII or IX genes. Although it is a relatively rare disease, the research community is actively working on this topic, producing almost 6000 manuscripts in the last 5 years. Given that the scientific...

Full description

Bibliographic Details
Main Authors: Tiago JS Lopes, Ricardo Rios, Tatiane Nogueira
Format: Article
Language:English
Published: SAGE Publishing 2022-09-01
Series:Bioinformatics and Biology Insights
Online Access:https://doi.org/10.1177/11779322221125604
_version_ 1798000481442725888
author Tiago JS Lopes
Ricardo Rios
Tatiane Nogueira
author_facet Tiago JS Lopes
Ricardo Rios
Tatiane Nogueira
author_sort Tiago JS Lopes
collection DOAJ
description Hemophilia is an inherited blood coagulation disorder caused by mutations on the coagulation factors VIII or IX genes. Although it is a relatively rare disease, the research community is actively working on this topic, producing almost 6000 manuscripts in the last 5 years. Given that the scientific literature is increasing so rapidly, even the most avid reader will find it difficult to follow it closely. In this study, we used sophisticated computational techniques to map the hemophilia literature of the last 60 years. We created a network structure to represent authorship collaborations, where the nodes are the researchers and 2 nodes are connected if they co-authored a manuscript. We accurately identified author clusters, namely, researchers who have collaborated systematically for several years, and used text mining techniques to automatically synthesize their research specialties. Overall, this study serves as a historical appreciation of the effort of thousands of hemophilia researchers and demonstrates that a computational framework is able to automatically identify collaboration networks and their research specialties. Importantly, we made all datasets and source code available for the community, and we anticipate that the methods introduced here will pave the way for the development of systems that generate compelling hypothesis based on patterns that are imperceptible to human researchers.
first_indexed 2024-04-11T11:20:58Z
format Article
id doaj.art-c79544f396dc4f85b1ae383babf875a5
institution Directory Open Access Journal
issn 1177-9322
language English
last_indexed 2024-04-11T11:20:58Z
publishDate 2022-09-01
publisher SAGE Publishing
record_format Article
series Bioinformatics and Biology Insights
spelling doaj.art-c79544f396dc4f85b1ae383babf875a52022-12-22T04:27:05ZengSAGE PublishingBioinformatics and Biology Insights1177-93222022-09-011610.1177/11779322221125604Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 DecadesTiago JS Lopes0Ricardo Rios1Tatiane Nogueira2Department of Regenerative Medicine, National Center for Child Health and Development Research Institute, Tokyo, JapanDepartment of Computer Science, Federal University of Bahia, Salvador, BrazilDepartment of Computer Science, Federal University of Bahia, Salvador, BrazilHemophilia is an inherited blood coagulation disorder caused by mutations on the coagulation factors VIII or IX genes. Although it is a relatively rare disease, the research community is actively working on this topic, producing almost 6000 manuscripts in the last 5 years. Given that the scientific literature is increasing so rapidly, even the most avid reader will find it difficult to follow it closely. In this study, we used sophisticated computational techniques to map the hemophilia literature of the last 60 years. We created a network structure to represent authorship collaborations, where the nodes are the researchers and 2 nodes are connected if they co-authored a manuscript. We accurately identified author clusters, namely, researchers who have collaborated systematically for several years, and used text mining techniques to automatically synthesize their research specialties. Overall, this study serves as a historical appreciation of the effort of thousands of hemophilia researchers and demonstrates that a computational framework is able to automatically identify collaboration networks and their research specialties. Importantly, we made all datasets and source code available for the community, and we anticipate that the methods introduced here will pave the way for the development of systems that generate compelling hypothesis based on patterns that are imperceptible to human researchers.https://doi.org/10.1177/11779322221125604
spellingShingle Tiago JS Lopes
Ricardo Rios
Tatiane Nogueira
Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades
Bioinformatics and Biology Insights
title Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades
title_full Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades
title_fullStr Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades
title_full_unstemmed Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades
title_short Computational Analyses Reveal Fundamental Properties of the Hemophilia Literature in the Last 6 Decades
title_sort computational analyses reveal fundamental properties of the hemophilia literature in the last 6 decades
url https://doi.org/10.1177/11779322221125604
work_keys_str_mv AT tiagojslopes computationalanalysesrevealfundamentalpropertiesofthehemophilialiteratureinthelast6decades
AT ricardorios computationalanalysesrevealfundamentalpropertiesofthehemophilialiteratureinthelast6decades
AT tatianenogueira computationalanalysesrevealfundamentalpropertiesofthehemophilialiteratureinthelast6decades