Constructing a novel clinical indicator model to predict the occurrence of thalassemia in pregnancy through machine learning algorithm
Thalassemia is one of the inherited hemoglobin disorders worldwide, resulting in ineffective erythropoiesis, chronic hemolytic anemia, compensatory hemopoietic expansion, hypercoagulability, etc., and when a mother carries the thalassemia gene, the child is more likely to have severe thalassemia. Fu...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2024-04-01
|
Series: | Frontiers in Hematology |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/frhem.2024.1341225/full |
_version_ | 1797224040095547392 |
---|---|
author | Yaoshui Long Wenxue Bai |
author_facet | Yaoshui Long Wenxue Bai |
author_sort | Yaoshui Long |
collection | DOAJ |
description | Thalassemia is one of the inherited hemoglobin disorders worldwide, resulting in ineffective erythropoiesis, chronic hemolytic anemia, compensatory hemopoietic expansion, hypercoagulability, etc., and when a mother carries the thalassemia gene, the child is more likely to have severe thalassemia. Furthermore, the economic and time costs of genetic testing for thalassemia prevent many thalassemia patients from being diagnosed in time. To solve this problem, we performed least absolute shrinkage and selection operator (LASSO) regression to analyze the correlation between thalassemia and blood routine indicators containing mean corpuscular volume (MCV), mean corpuscular hemoglobin (MCH), mean corpuscular hemoglobin concentration (MCHC), and red blood cell (RBC). We then built a nomogram to predict the occurrence of thalassemia, and receiver operating characteristic (ROC) curve was used to verify the prediction efficiency of this model. In total, we obtained 7,621 cases, including 847 thalassemia patients and 6,774 non-thalassemia. Among the 847 thalassemia patients, with a positivity rate of 67.2%, 569 cases were positive for α-thalassemia, and with a rate of 31.5%, 267 cases were positive for β-thalassemia. The remaining 11 cases were positive for both α- and β-thalassemia. Based on machine learning algorithm, we screened four optimal indicators, namely, MCV, MCH, RBC, and MCHC. The AUC value of MCV, MCH, RBC, and MCHC were 0.907, 0.906, 0.796, and 0.795, respectively. Moreover, the AUC value of the prediction model was 0.911. In summary, a novel and effective machine learning model was built to predict thalassemia, which functioned accurately, and may provide new insights for the early screening of thalassemia in the future. |
first_indexed | 2024-04-24T13:46:47Z |
format | Article |
id | doaj.art-0b4d9a6376554eae9b73dbb95b789968 |
institution | Directory Open Access Journal |
issn | 2813-3935 |
language | English |
last_indexed | 2024-04-24T13:46:47Z |
publishDate | 2024-04-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Hematology |
spelling | doaj.art-0b4d9a6376554eae9b73dbb95b7899682024-04-04T05:10:16ZengFrontiers Media S.A.Frontiers in Hematology2813-39352024-04-01310.3389/frhem.2024.13412251341225Constructing a novel clinical indicator model to predict the occurrence of thalassemia in pregnancy through machine learning algorithmYaoshui LongWenxue BaiThalassemia is one of the inherited hemoglobin disorders worldwide, resulting in ineffective erythropoiesis, chronic hemolytic anemia, compensatory hemopoietic expansion, hypercoagulability, etc., and when a mother carries the thalassemia gene, the child is more likely to have severe thalassemia. Furthermore, the economic and time costs of genetic testing for thalassemia prevent many thalassemia patients from being diagnosed in time. To solve this problem, we performed least absolute shrinkage and selection operator (LASSO) regression to analyze the correlation between thalassemia and blood routine indicators containing mean corpuscular volume (MCV), mean corpuscular hemoglobin (MCH), mean corpuscular hemoglobin concentration (MCHC), and red blood cell (RBC). We then built a nomogram to predict the occurrence of thalassemia, and receiver operating characteristic (ROC) curve was used to verify the prediction efficiency of this model. In total, we obtained 7,621 cases, including 847 thalassemia patients and 6,774 non-thalassemia. Among the 847 thalassemia patients, with a positivity rate of 67.2%, 569 cases were positive for α-thalassemia, and with a rate of 31.5%, 267 cases were positive for β-thalassemia. The remaining 11 cases were positive for both α- and β-thalassemia. Based on machine learning algorithm, we screened four optimal indicators, namely, MCV, MCH, RBC, and MCHC. The AUC value of MCV, MCH, RBC, and MCHC were 0.907, 0.906, 0.796, and 0.795, respectively. Moreover, the AUC value of the prediction model was 0.911. In summary, a novel and effective machine learning model was built to predict thalassemia, which functioned accurately, and may provide new insights for the early screening of thalassemia in the future.https://www.frontiersin.org/articles/10.3389/frhem.2024.1341225/fullthalassemiamachine learningblood routine indicatorspregnancyprediction |
spellingShingle | Yaoshui Long Wenxue Bai Constructing a novel clinical indicator model to predict the occurrence of thalassemia in pregnancy through machine learning algorithm Frontiers in Hematology thalassemia machine learning blood routine indicators pregnancy prediction |
title | Constructing a novel clinical indicator model to predict the occurrence of thalassemia in pregnancy through machine learning algorithm |
title_full | Constructing a novel clinical indicator model to predict the occurrence of thalassemia in pregnancy through machine learning algorithm |
title_fullStr | Constructing a novel clinical indicator model to predict the occurrence of thalassemia in pregnancy through machine learning algorithm |
title_full_unstemmed | Constructing a novel clinical indicator model to predict the occurrence of thalassemia in pregnancy through machine learning algorithm |
title_short | Constructing a novel clinical indicator model to predict the occurrence of thalassemia in pregnancy through machine learning algorithm |
title_sort | constructing a novel clinical indicator model to predict the occurrence of thalassemia in pregnancy through machine learning algorithm |
topic | thalassemia machine learning blood routine indicators pregnancy prediction |
url | https://www.frontiersin.org/articles/10.3389/frhem.2024.1341225/full |
work_keys_str_mv | AT yaoshuilong constructinganovelclinicalindicatormodeltopredicttheoccurrenceofthalassemiainpregnancythroughmachinelearningalgorithm AT wenxuebai constructinganovelclinicalindicatormodeltopredicttheoccurrenceofthalassemiainpregnancythroughmachinelearningalgorithm |