Regularized survival learning and cross-database analysis enabled identification of colorectal cancer prognosis-related immune genes
Colon adenocarcinoma is the most common type of colorectal cancer. The prognosis of advanced colorectal cancer patients who received treatment is still very poor. Therefore, identifying new biomarkers for prognosis prediction has important significance for improving treatment strategies. However, th...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2023-02-01
|
Series: | Frontiers in Genetics |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fgene.2023.1148470/full |
_version_ | 1797897113572474880 |
---|---|
author | Dongmei Ai Mingmei Wang Qingchuan Zhang Longwei Cheng Yishu Wang Xiuqin Liu Li C. Xia |
author_facet | Dongmei Ai Mingmei Wang Qingchuan Zhang Longwei Cheng Yishu Wang Xiuqin Liu Li C. Xia |
author_sort | Dongmei Ai |
collection | DOAJ |
description | Colon adenocarcinoma is the most common type of colorectal cancer. The prognosis of advanced colorectal cancer patients who received treatment is still very poor. Therefore, identifying new biomarkers for prognosis prediction has important significance for improving treatment strategies. However, the power of biomarker analyses was limited by the used sample size of individual database. In this study, we combined Genotype-Tissue Expression (GTEx) and The Cancer Genome Atlas (TCGA) databases to expand the number of healthy tissue samples. We screened differentially expressed genes between the GTEx healthy samples and TCGA tumor samples. Subsequently, we applied least absolute shrinkage and selection operator (LASSO) regression and multivariate Cox analysis to identify nine prognosis-related immune genes: ANGPTL4, IDO1, NOX1, CXCL3, LTB4R, IL1RL2, CD72, NOS2, and NUDT6. We computed the risk scores of samples based on the expression levels of these genes and divided patients into high- and low-risk groups according to this risk score. Survival analysis results showed a significant difference in survival rate between the two risk groups. The high-risk group had a significantly lower overall survival rate and poorer prognosis. We found the receiver operating characteristic based on the risk score was showed to accurately predict patients’ prognosis. These prognosis-related immune genes may be potential biomarkers for colorectal cancer diagnosis and treatment. Our open-source code is freely available from GitHub at https://github.com/gutmicrobes/Prognosis-model.git. |
first_indexed | 2024-04-10T07:53:43Z |
format | Article |
id | doaj.art-f90abe0e49904f5f8fee7cb858f62871 |
institution | Directory Open Access Journal |
issn | 1664-8021 |
language | English |
last_indexed | 2024-04-10T07:53:43Z |
publishDate | 2023-02-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Genetics |
spelling | doaj.art-f90abe0e49904f5f8fee7cb858f628712023-02-23T07:51:22ZengFrontiers Media S.A.Frontiers in Genetics1664-80212023-02-011410.3389/fgene.2023.11484701148470Regularized survival learning and cross-database analysis enabled identification of colorectal cancer prognosis-related immune genesDongmei Ai0Mingmei Wang1Qingchuan Zhang2Longwei Cheng3Yishu Wang4Xiuqin Liu5Li C. Xia6School of Mathematics and Physics, University of Science and Technology Beijing, Beijing, ChinaSchool of Mathematics and Physics, University of Science and Technology Beijing, Beijing, ChinaNational Engineering Laboratory for Agri-Product Quality Traceability, Beijing Technology and Business University, Beijing, ChinaSchool of Mathematics and Physics, University of Science and Technology Beijing, Beijing, ChinaSchool of Mathematics and Physics, University of Science and Technology Beijing, Beijing, ChinaSchool of Mathematics and Physics, University of Science and Technology Beijing, Beijing, ChinaSchool of Mathematics, South China University of Technology, Guangzhou, ChinaColon adenocarcinoma is the most common type of colorectal cancer. The prognosis of advanced colorectal cancer patients who received treatment is still very poor. Therefore, identifying new biomarkers for prognosis prediction has important significance for improving treatment strategies. However, the power of biomarker analyses was limited by the used sample size of individual database. In this study, we combined Genotype-Tissue Expression (GTEx) and The Cancer Genome Atlas (TCGA) databases to expand the number of healthy tissue samples. We screened differentially expressed genes between the GTEx healthy samples and TCGA tumor samples. Subsequently, we applied least absolute shrinkage and selection operator (LASSO) regression and multivariate Cox analysis to identify nine prognosis-related immune genes: ANGPTL4, IDO1, NOX1, CXCL3, LTB4R, IL1RL2, CD72, NOS2, and NUDT6. We computed the risk scores of samples based on the expression levels of these genes and divided patients into high- and low-risk groups according to this risk score. Survival analysis results showed a significant difference in survival rate between the two risk groups. The high-risk group had a significantly lower overall survival rate and poorer prognosis. We found the receiver operating characteristic based on the risk score was showed to accurately predict patients’ prognosis. These prognosis-related immune genes may be potential biomarkers for colorectal cancer diagnosis and treatment. Our open-source code is freely available from GitHub at https://github.com/gutmicrobes/Prognosis-model.git.https://www.frontiersin.org/articles/10.3389/fgene.2023.1148470/fullLASSOmultivariate cox analysisprognosisimmune genecolorectal cancer |
spellingShingle | Dongmei Ai Mingmei Wang Qingchuan Zhang Longwei Cheng Yishu Wang Xiuqin Liu Li C. Xia Regularized survival learning and cross-database analysis enabled identification of colorectal cancer prognosis-related immune genes Frontiers in Genetics LASSO multivariate cox analysis prognosis immune gene colorectal cancer |
title | Regularized survival learning and cross-database analysis enabled identification of colorectal cancer prognosis-related immune genes |
title_full | Regularized survival learning and cross-database analysis enabled identification of colorectal cancer prognosis-related immune genes |
title_fullStr | Regularized survival learning and cross-database analysis enabled identification of colorectal cancer prognosis-related immune genes |
title_full_unstemmed | Regularized survival learning and cross-database analysis enabled identification of colorectal cancer prognosis-related immune genes |
title_short | Regularized survival learning and cross-database analysis enabled identification of colorectal cancer prognosis-related immune genes |
title_sort | regularized survival learning and cross database analysis enabled identification of colorectal cancer prognosis related immune genes |
topic | LASSO multivariate cox analysis prognosis immune gene colorectal cancer |
url | https://www.frontiersin.org/articles/10.3389/fgene.2023.1148470/full |
work_keys_str_mv | AT dongmeiai regularizedsurvivallearningandcrossdatabaseanalysisenabledidentificationofcolorectalcancerprognosisrelatedimmunegenes AT mingmeiwang regularizedsurvivallearningandcrossdatabaseanalysisenabledidentificationofcolorectalcancerprognosisrelatedimmunegenes AT qingchuanzhang regularizedsurvivallearningandcrossdatabaseanalysisenabledidentificationofcolorectalcancerprognosisrelatedimmunegenes AT longweicheng regularizedsurvivallearningandcrossdatabaseanalysisenabledidentificationofcolorectalcancerprognosisrelatedimmunegenes AT yishuwang regularizedsurvivallearningandcrossdatabaseanalysisenabledidentificationofcolorectalcancerprognosisrelatedimmunegenes AT xiuqinliu regularizedsurvivallearningandcrossdatabaseanalysisenabledidentificationofcolorectalcancerprognosisrelatedimmunegenes AT licxia regularizedsurvivallearningandcrossdatabaseanalysisenabledidentificationofcolorectalcancerprognosisrelatedimmunegenes |