A diagnostic model for sepsis-induced acute lung injury using a consensus machine learning approach and its therapeutic implications
Abstract Background A significant proportion of septic patients with acute lung injury (ALI) are recognized late due to the absence of an efficient diagnostic test, leading to the postponed treatments and consequently higher mortality. Identifying diagnostic biomarkers may improve screening to ident...
Main Authors: | , , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2023-09-01
|
Series: | Journal of Translational Medicine |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12967-023-04499-4 |
_version_ | 1827708676661575680 |
---|---|
author | Yongxin Zheng Jinping Wang Zhaoyi Ling Jiamei Zhang Yuan Zeng Ke Wang Yu Zhang Lingbo Nong Ling Sang Yonghao Xu Xiaoqing Liu Yimin Li Yongbo Huang |
author_facet | Yongxin Zheng Jinping Wang Zhaoyi Ling Jiamei Zhang Yuan Zeng Ke Wang Yu Zhang Lingbo Nong Ling Sang Yonghao Xu Xiaoqing Liu Yimin Li Yongbo Huang |
author_sort | Yongxin Zheng |
collection | DOAJ |
description | Abstract Background A significant proportion of septic patients with acute lung injury (ALI) are recognized late due to the absence of an efficient diagnostic test, leading to the postponed treatments and consequently higher mortality. Identifying diagnostic biomarkers may improve screening to identify septic patients at high risk of ALI earlier and provide the potential effective therapeutic drugs. Machine learning represents a powerful approach for making sense of complex gene expression data to find robust ALI diagnostic biomarkers. Methods The datasets were obtained from GEO and ArrayExpress databases. Following quality control and normalization, the datasets (GSE66890, GSE10474 and GSE32707) were merged as the training set, and four machine learning feature selection methods (Elastic net, SVM, random forest and XGBoost) were applied to construct the diagnostic model. The other datasets were considered as the validation sets. To further evaluate the performance and predictive value of diagnostic model, nomogram, Decision Curve Analysis (DCA) and Clinical Impact Curve (CIC) were constructed. Finally, the potential small molecular compounds interacting with selected features were explored from the CTD database. Results The results of GSEA showed that immune response and metabolism might play an important role in the pathogenesis of sepsis-induced ALI. Then, 52 genes were identified as putative biomarkers by consensus feature selection from all four methods. Among them, 5 genes (ARHGDIB, ALDH1A1, TACR3, TREM1 and PI3) were selected by all methods and used to predict ALI diagnosis with high accuracy. The external datasets (E-MTAB-5273 and E-MTAB-5274) demonstrated that the diagnostic model had great accuracy with AUC value of 0.725 and 0.833, respectively. In addition, the nomogram, DCA and CIC showed that the diagnostic model had great performance and predictive value. Finally, the small molecular compounds (Curcumin, Tretinoin, Acetaminophen, Estradiol and Dexamethasone) were screened as the potential therapeutic agents for sepsis-induced ALI. Conclusion This consensus of multiple machine learning algorithms identified 5 genes that were able to distinguish ALI from septic patients. The diagnostic model could identify septic patients at high risk of ALI, and provide potential therapeutic targets for sepsis-induced ALI. |
first_indexed | 2024-03-10T17:07:55Z |
format | Article |
id | doaj.art-7dadff0584ed42aaa1e4c470a7d5c993 |
institution | Directory Open Access Journal |
issn | 1479-5876 |
language | English |
last_indexed | 2024-03-10T17:07:55Z |
publishDate | 2023-09-01 |
publisher | BMC |
record_format | Article |
series | Journal of Translational Medicine |
spelling | doaj.art-7dadff0584ed42aaa1e4c470a7d5c9932023-11-20T10:44:33ZengBMCJournal of Translational Medicine1479-58762023-09-0121111610.1186/s12967-023-04499-4A diagnostic model for sepsis-induced acute lung injury using a consensus machine learning approach and its therapeutic implicationsYongxin Zheng0Jinping Wang1Zhaoyi Ling2Jiamei Zhang3Yuan Zeng4Ke Wang5Yu Zhang6Lingbo Nong7Ling Sang8Yonghao Xu9Xiaoqing Liu10Yimin Li11Yongbo Huang12Department of Critical Care Medicine, The First Affiliated Hospital of Guangzhou Medical UniversityDepartment of Cardiovascular Medicine, The First Affiliated Hospital of Guangzhou Medical UniversityDepartment of Critical Care Medicine, The First Affiliated Hospital of Guangzhou Medical UniversityDepartment of Critical Care Medicine, The First Affiliated Hospital of Guangzhou Medical UniversityDepartment of Critical Care Medicine, The First Affiliated Hospital of Guangzhou Medical UniversityDepartment of Critical Care Medicine, The First Affiliated Hospital of Guangzhou Medical UniversityDepartment of Critical Care Medicine, The First Affiliated Hospital of Guangzhou Medical UniversityDepartment of Critical Care Medicine, The First Affiliated Hospital of Guangzhou Medical UniversityDepartment of Critical Care Medicine, The First Affiliated Hospital of Guangzhou Medical UniversityDepartment of Critical Care Medicine, The First Affiliated Hospital of Guangzhou Medical UniversityDepartment of Critical Care Medicine, The First Affiliated Hospital of Guangzhou Medical UniversityDepartment of Critical Care Medicine, The First Affiliated Hospital of Guangzhou Medical UniversityDepartment of Critical Care Medicine, The First Affiliated Hospital of Guangzhou Medical UniversityAbstract Background A significant proportion of septic patients with acute lung injury (ALI) are recognized late due to the absence of an efficient diagnostic test, leading to the postponed treatments and consequently higher mortality. Identifying diagnostic biomarkers may improve screening to identify septic patients at high risk of ALI earlier and provide the potential effective therapeutic drugs. Machine learning represents a powerful approach for making sense of complex gene expression data to find robust ALI diagnostic biomarkers. Methods The datasets were obtained from GEO and ArrayExpress databases. Following quality control and normalization, the datasets (GSE66890, GSE10474 and GSE32707) were merged as the training set, and four machine learning feature selection methods (Elastic net, SVM, random forest and XGBoost) were applied to construct the diagnostic model. The other datasets were considered as the validation sets. To further evaluate the performance and predictive value of diagnostic model, nomogram, Decision Curve Analysis (DCA) and Clinical Impact Curve (CIC) were constructed. Finally, the potential small molecular compounds interacting with selected features were explored from the CTD database. Results The results of GSEA showed that immune response and metabolism might play an important role in the pathogenesis of sepsis-induced ALI. Then, 52 genes were identified as putative biomarkers by consensus feature selection from all four methods. Among them, 5 genes (ARHGDIB, ALDH1A1, TACR3, TREM1 and PI3) were selected by all methods and used to predict ALI diagnosis with high accuracy. The external datasets (E-MTAB-5273 and E-MTAB-5274) demonstrated that the diagnostic model had great accuracy with AUC value of 0.725 and 0.833, respectively. In addition, the nomogram, DCA and CIC showed that the diagnostic model had great performance and predictive value. Finally, the small molecular compounds (Curcumin, Tretinoin, Acetaminophen, Estradiol and Dexamethasone) were screened as the potential therapeutic agents for sepsis-induced ALI. Conclusion This consensus of multiple machine learning algorithms identified 5 genes that were able to distinguish ALI from septic patients. The diagnostic model could identify septic patients at high risk of ALI, and provide potential therapeutic targets for sepsis-induced ALI.https://doi.org/10.1186/s12967-023-04499-4SepsisAcute lung injuryAcute respiratory distress syndromeMachine learningTranscriptome |
spellingShingle | Yongxin Zheng Jinping Wang Zhaoyi Ling Jiamei Zhang Yuan Zeng Ke Wang Yu Zhang Lingbo Nong Ling Sang Yonghao Xu Xiaoqing Liu Yimin Li Yongbo Huang A diagnostic model for sepsis-induced acute lung injury using a consensus machine learning approach and its therapeutic implications Journal of Translational Medicine Sepsis Acute lung injury Acute respiratory distress syndrome Machine learning Transcriptome |
title | A diagnostic model for sepsis-induced acute lung injury using a consensus machine learning approach and its therapeutic implications |
title_full | A diagnostic model for sepsis-induced acute lung injury using a consensus machine learning approach and its therapeutic implications |
title_fullStr | A diagnostic model for sepsis-induced acute lung injury using a consensus machine learning approach and its therapeutic implications |
title_full_unstemmed | A diagnostic model for sepsis-induced acute lung injury using a consensus machine learning approach and its therapeutic implications |
title_short | A diagnostic model for sepsis-induced acute lung injury using a consensus machine learning approach and its therapeutic implications |
title_sort | diagnostic model for sepsis induced acute lung injury using a consensus machine learning approach and its therapeutic implications |
topic | Sepsis Acute lung injury Acute respiratory distress syndrome Machine learning Transcriptome |
url | https://doi.org/10.1186/s12967-023-04499-4 |
work_keys_str_mv | AT yongxinzheng adiagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT jinpingwang adiagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT zhaoyiling adiagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT jiameizhang adiagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT yuanzeng adiagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT kewang adiagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT yuzhang adiagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT lingbonong adiagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT lingsang adiagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT yonghaoxu adiagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT xiaoqingliu adiagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT yiminli adiagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT yongbohuang adiagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT yongxinzheng diagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT jinpingwang diagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT zhaoyiling diagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT jiameizhang diagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT yuanzeng diagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT kewang diagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT yuzhang diagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT lingbonong diagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT lingsang diagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT yonghaoxu diagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT xiaoqingliu diagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT yiminli diagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications AT yongbohuang diagnosticmodelforsepsisinducedacutelunginjuryusingaconsensusmachinelearningapproachanditstherapeuticimplications |