Distinct molecular subtypes of systemic sclerosis and gene signature with diagnostic capability

BackgroundAs Systemic Sclerosis (SSc) is a connective tissue ailment that impacts various bodily systems. The study aims to clarify the molecular subtypes of SSc, with the ultimate objective of establishing a diagnostic model that can inform clinical treatment decisions.MethodsFive microarray datase...

Full description

Bibliographic Details
Main Authors: Qi Wang, Chen-Long Li, Li Wu, Jing-Yi Hu, Qi Yu, Sheng-Xiao Zhang, Pei-Feng He
Format: Article
Language:English
Published: Frontiers Media S.A. 2023-10-01
Series:Frontiers in Immunology
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fimmu.2023.1257802/full
_version_ 1797668154261897216
author Qi Wang
Qi Wang
Chen-Long Li
Chen-Long Li
Li Wu
Li Wu
Jing-Yi Hu
Qi Yu
Qi Yu
Sheng-Xiao Zhang
Pei-Feng He
Pei-Feng He
author_facet Qi Wang
Qi Wang
Chen-Long Li
Chen-Long Li
Li Wu
Li Wu
Jing-Yi Hu
Qi Yu
Qi Yu
Sheng-Xiao Zhang
Pei-Feng He
Pei-Feng He
author_sort Qi Wang
collection DOAJ
description BackgroundAs Systemic Sclerosis (SSc) is a connective tissue ailment that impacts various bodily systems. The study aims to clarify the molecular subtypes of SSc, with the ultimate objective of establishing a diagnostic model that can inform clinical treatment decisions.MethodsFive microarray datasets of SSc were retrieved from the GEO database. To eliminate batch effects, the combat algorithm was applied. Immune cell infiltration was evaluated using the xCell algorithm. The ConsensusClusterPlus algorithm was utilized to identify SSc subtypes. Limma was used to determine differential expression genes (DEGs). GSEA was used to determine pathway enrichment. A support vector machine (SVM), Random Forest(RF), Boruta and LASSO algorithm have been used to select the feature gene. Diagnostic models were developed using SVM, RF, and Logistic Regression (LR). A ROC curve was used to evaluate the performance of the model. The compound-gene relationship was obtained from the Comparative Toxicogenomics Database (CTD).ResultsThe identification of three immune subtypes in SSc samples was based on the expression profiles of immune cells. The utilization of 19 key intersectional DEGs among subtypes facilitated the classification of SSc patients into three robust subtypes (gene_ClusterA-C). Gene_ClusterA exhibited significant enrichment of B cells, while gene_ClusterC showed significant enrichment of monocytes. Moderate activation of various immune cells was observed in gene_ClusterB. We identified 8 feature genes. The SVM model demonstrating superior diagnostic performance. Furthermore, correlation analysis revealed a robust association between the feature genes and immune cells. Eight pertinent compounds, namely methotrexate, resveratrol, paclitaxel, trichloroethylene, formaldehyde, silicon dioxide, benzene, and tetrachloroethylene, were identified from the CTD.ConclusionThe present study has effectively devised an innovative molecular subtyping methodology for patients with SSc and a diagnostic model based on machine learning to aid in clinical treatment. The study has identified potential molecular targets for therapy, thereby offering novel perspectives for the treatment and investigation of SSc.
first_indexed 2024-03-11T20:25:04Z
format Article
id doaj.art-35cfbaa322d84738b28dd131a0aa004b
institution Directory Open Access Journal
issn 1664-3224
language English
last_indexed 2024-03-11T20:25:04Z
publishDate 2023-10-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Immunology
spelling doaj.art-35cfbaa322d84738b28dd131a0aa004b2023-10-02T17:09:04ZengFrontiers Media S.A.Frontiers in Immunology1664-32242023-10-011410.3389/fimmu.2023.12578021257802Distinct molecular subtypes of systemic sclerosis and gene signature with diagnostic capabilityQi Wang0Qi Wang1Chen-Long Li2Chen-Long Li3Li Wu4Li Wu5Jing-Yi Hu6Qi Yu7Qi Yu8Sheng-Xiao Zhang9Pei-Feng He10Pei-Feng He11School of Basic Medical Sciences, Shanxi Medical University, Taiyuan, ChinaShanxi Key Laboratory of Big Data for Clinical Decision Research, Taiyuan, ChinaSchool of Basic Medical Sciences, Shanxi Medical University, Taiyuan, ChinaShanxi Key Laboratory of Big Data for Clinical Decision Research, Taiyuan, ChinaSchool of Basic Medical Sciences, Shanxi Medical University, Taiyuan, ChinaDepartment of Anesthesiology , Shanxi Provincial People’s Hospital (Fifth Hospital) of Shanxi Medical University, Taiyuan, ChinaSchool of Management, Shanxi Medical University, Taiyuan, ChinaShanxi Key Laboratory of Big Data for Clinical Decision Research, Taiyuan, ChinaSchool of Management, Shanxi Medical University, Taiyuan, ChinaDepartment of Rheumatology, Second Hospital of Shanxi Medical University, Taiyuan, ChinaShanxi Key Laboratory of Big Data for Clinical Decision Research, Taiyuan, ChinaSchool of Management, Shanxi Medical University, Taiyuan, ChinaBackgroundAs Systemic Sclerosis (SSc) is a connective tissue ailment that impacts various bodily systems. The study aims to clarify the molecular subtypes of SSc, with the ultimate objective of establishing a diagnostic model that can inform clinical treatment decisions.MethodsFive microarray datasets of SSc were retrieved from the GEO database. To eliminate batch effects, the combat algorithm was applied. Immune cell infiltration was evaluated using the xCell algorithm. The ConsensusClusterPlus algorithm was utilized to identify SSc subtypes. Limma was used to determine differential expression genes (DEGs). GSEA was used to determine pathway enrichment. A support vector machine (SVM), Random Forest(RF), Boruta and LASSO algorithm have been used to select the feature gene. Diagnostic models were developed using SVM, RF, and Logistic Regression (LR). A ROC curve was used to evaluate the performance of the model. The compound-gene relationship was obtained from the Comparative Toxicogenomics Database (CTD).ResultsThe identification of three immune subtypes in SSc samples was based on the expression profiles of immune cells. The utilization of 19 key intersectional DEGs among subtypes facilitated the classification of SSc patients into three robust subtypes (gene_ClusterA-C). Gene_ClusterA exhibited significant enrichment of B cells, while gene_ClusterC showed significant enrichment of monocytes. Moderate activation of various immune cells was observed in gene_ClusterB. We identified 8 feature genes. The SVM model demonstrating superior diagnostic performance. Furthermore, correlation analysis revealed a robust association between the feature genes and immune cells. Eight pertinent compounds, namely methotrexate, resveratrol, paclitaxel, trichloroethylene, formaldehyde, silicon dioxide, benzene, and tetrachloroethylene, were identified from the CTD.ConclusionThe present study has effectively devised an innovative molecular subtyping methodology for patients with SSc and a diagnostic model based on machine learning to aid in clinical treatment. The study has identified potential molecular targets for therapy, thereby offering novel perspectives for the treatment and investigation of SSc.https://www.frontiersin.org/articles/10.3389/fimmu.2023.1257802/fullsystemic sclerosisunsupervised machine learningmolecular subtypesimmune microenvironmentdiagnostic
spellingShingle Qi Wang
Qi Wang
Chen-Long Li
Chen-Long Li
Li Wu
Li Wu
Jing-Yi Hu
Qi Yu
Qi Yu
Sheng-Xiao Zhang
Pei-Feng He
Pei-Feng He
Distinct molecular subtypes of systemic sclerosis and gene signature with diagnostic capability
Frontiers in Immunology
systemic sclerosis
unsupervised machine learning
molecular subtypes
immune microenvironment
diagnostic
title Distinct molecular subtypes of systemic sclerosis and gene signature with diagnostic capability
title_full Distinct molecular subtypes of systemic sclerosis and gene signature with diagnostic capability
title_fullStr Distinct molecular subtypes of systemic sclerosis and gene signature with diagnostic capability
title_full_unstemmed Distinct molecular subtypes of systemic sclerosis and gene signature with diagnostic capability
title_short Distinct molecular subtypes of systemic sclerosis and gene signature with diagnostic capability
title_sort distinct molecular subtypes of systemic sclerosis and gene signature with diagnostic capability
topic systemic sclerosis
unsupervised machine learning
molecular subtypes
immune microenvironment
diagnostic
url https://www.frontiersin.org/articles/10.3389/fimmu.2023.1257802/full
work_keys_str_mv AT qiwang distinctmolecularsubtypesofsystemicsclerosisandgenesignaturewithdiagnosticcapability
AT qiwang distinctmolecularsubtypesofsystemicsclerosisandgenesignaturewithdiagnosticcapability
AT chenlongli distinctmolecularsubtypesofsystemicsclerosisandgenesignaturewithdiagnosticcapability
AT chenlongli distinctmolecularsubtypesofsystemicsclerosisandgenesignaturewithdiagnosticcapability
AT liwu distinctmolecularsubtypesofsystemicsclerosisandgenesignaturewithdiagnosticcapability
AT liwu distinctmolecularsubtypesofsystemicsclerosisandgenesignaturewithdiagnosticcapability
AT jingyihu distinctmolecularsubtypesofsystemicsclerosisandgenesignaturewithdiagnosticcapability
AT qiyu distinctmolecularsubtypesofsystemicsclerosisandgenesignaturewithdiagnosticcapability
AT qiyu distinctmolecularsubtypesofsystemicsclerosisandgenesignaturewithdiagnosticcapability
AT shengxiaozhang distinctmolecularsubtypesofsystemicsclerosisandgenesignaturewithdiagnosticcapability
AT peifenghe distinctmolecularsubtypesofsystemicsclerosisandgenesignaturewithdiagnosticcapability
AT peifenghe distinctmolecularsubtypesofsystemicsclerosisandgenesignaturewithdiagnosticcapability