Using machine learning methods to study the tumour microenvironment and its biomarkers in osteosarcoma metastasis
Background: The long-term prognosis for patients with osteosarcoma (OS) metastasis remains unfavourable, highlighting the urgent need for research that explores potential biomarkers using innovative methodologies. Methods: This study explored potential biomarkers for OS metastasis by analysing data...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2024-04-01
|
Series: | Heliyon |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2405844024053532 |
_version_ | 1827288748074729472 |
---|---|
author | Guangyuan Liu Shaochun Wang Jinhui Liu Jiangli Zhang Xiqing Pan Xiao Fan Tingting Shao Yi Sun |
author_facet | Guangyuan Liu Shaochun Wang Jinhui Liu Jiangli Zhang Xiqing Pan Xiao Fan Tingting Shao Yi Sun |
author_sort | Guangyuan Liu |
collection | DOAJ |
description | Background: The long-term prognosis for patients with osteosarcoma (OS) metastasis remains unfavourable, highlighting the urgent need for research that explores potential biomarkers using innovative methodologies. Methods: This study explored potential biomarkers for OS metastasis by analysing data from the Cancer Genome Atlas Program (TCGA) and Gene Expression Omnibus (GEO) databases. The synthetic minority oversampling technique (SMOTE) was employed to tackle class imbalances, while genes were selected using four feature selection algorithms (Monte Carlo feature selection [MCFS], Borota, minimum-redundancy maximum-relevance [mRMR], and light gradient-boosting machine [LightGBM]) based on the gene expression matrix. Four machine learning (ML) algorithms (support vector machine [SVM], extreme gradient boosting [XGBoost], random forest [RF], and k-nearest neighbours [kNN]) were utilized to determine the optimal number of genes for building the model. Interpretable machine learning (IML) was applied to construct prediction networks, revealing potential relationships among the selected genes. Additionally, enrichment analysis, survival analysis, and immune infiltration were performed on the featured genes. Results: In DS1, DS2, and DS3, the IML algorithm identified 53, 45, and 46 features, respectively. Using the merged gene set, we obtained a total of 79 interpretable prediction rules for OS metastasis. We subsequently conducted an in-depth investigation on 39 crucial molecules associated with predicting OS metastasis, elucidating their roles within the tumour microenvironment. Importantly, we found that certain genes act as both predictors and differentially expressed genes. Finally, our study unveiled statistically significant differences in survival between the high and low expression groups of TRIP4, S100A9, SELL and SLC11A1, and there was a certain correlation between these genes and 22 various immune cells. Conclusions: The biomarkers discovered in this study hold significant implications for personalized therapies, potentially enhancing the clinical prognosis of patients with OS. |
first_indexed | 2024-04-24T11:36:57Z |
format | Article |
id | doaj.art-0f1f5f5ec62e4389a6483c63270f62a8 |
institution | Directory Open Access Journal |
issn | 2405-8440 |
language | English |
last_indexed | 2024-04-24T11:36:57Z |
publishDate | 2024-04-01 |
publisher | Elsevier |
record_format | Article |
series | Heliyon |
spelling | doaj.art-0f1f5f5ec62e4389a6483c63270f62a82024-04-10T04:29:18ZengElsevierHeliyon2405-84402024-04-01107e29322Using machine learning methods to study the tumour microenvironment and its biomarkers in osteosarcoma metastasisGuangyuan Liu0Shaochun Wang1Jinhui Liu2Jiangli Zhang3Xiqing Pan4Xiao Fan5Tingting Shao6Yi Sun7The First Department of Orthopedic Surgery, Third Hospital of Shijiazhuang, Tiyu South Avenue No.15, Shijiazhuang, Hebei Province, ChinaDepartment of Oncology, Shijiazhuang People's Hospital, No.365, Jian Hua Nan Da Jie, Shijiazhuang, Hebei Province, ChinaThe First Department of Orthopedic Surgery, Third Hospital of Shijiazhuang, Tiyu South Avenue No.15, Shijiazhuang, Hebei Province, ChinaThe First Department of Orthopedic Surgery, Third Hospital of Shijiazhuang, Tiyu South Avenue No.15, Shijiazhuang, Hebei Province, ChinaThe First Department of Orthopedic Surgery, Third Hospital of Shijiazhuang, Tiyu South Avenue No.15, Shijiazhuang, Hebei Province, ChinaThe First Department of Orthopedic Surgery, Third Hospital of Shijiazhuang, Tiyu South Avenue No.15, Shijiazhuang, Hebei Province, ChinaDepartment of Pediatrics, Peking University First Hospital, 8 Xishku Street, Xicheng District, Beijing, ChinaDepartment of Surgery, Shijiazhuang People's Hospital, No.365, Jian Hua Nan Da Jie, Shijiazhuang, Hebei Province, China; Corresponding authorNo.365, Jian Hua Nan Da Jie, Shijiazhuang, Hebei Province, 050011, ChinaBackground: The long-term prognosis for patients with osteosarcoma (OS) metastasis remains unfavourable, highlighting the urgent need for research that explores potential biomarkers using innovative methodologies. Methods: This study explored potential biomarkers for OS metastasis by analysing data from the Cancer Genome Atlas Program (TCGA) and Gene Expression Omnibus (GEO) databases. The synthetic minority oversampling technique (SMOTE) was employed to tackle class imbalances, while genes were selected using four feature selection algorithms (Monte Carlo feature selection [MCFS], Borota, minimum-redundancy maximum-relevance [mRMR], and light gradient-boosting machine [LightGBM]) based on the gene expression matrix. Four machine learning (ML) algorithms (support vector machine [SVM], extreme gradient boosting [XGBoost], random forest [RF], and k-nearest neighbours [kNN]) were utilized to determine the optimal number of genes for building the model. Interpretable machine learning (IML) was applied to construct prediction networks, revealing potential relationships among the selected genes. Additionally, enrichment analysis, survival analysis, and immune infiltration were performed on the featured genes. Results: In DS1, DS2, and DS3, the IML algorithm identified 53, 45, and 46 features, respectively. Using the merged gene set, we obtained a total of 79 interpretable prediction rules for OS metastasis. We subsequently conducted an in-depth investigation on 39 crucial molecules associated with predicting OS metastasis, elucidating their roles within the tumour microenvironment. Importantly, we found that certain genes act as both predictors and differentially expressed genes. Finally, our study unveiled statistically significant differences in survival between the high and low expression groups of TRIP4, S100A9, SELL and SLC11A1, and there was a certain correlation between these genes and 22 various immune cells. Conclusions: The biomarkers discovered in this study hold significant implications for personalized therapies, potentially enhancing the clinical prognosis of patients with OS.http://www.sciencedirect.com/science/article/pii/S2405844024053532OsteosarcomaMachine learningFeature selectionBiomarkersTumour microenvironment |
spellingShingle | Guangyuan Liu Shaochun Wang Jinhui Liu Jiangli Zhang Xiqing Pan Xiao Fan Tingting Shao Yi Sun Using machine learning methods to study the tumour microenvironment and its biomarkers in osteosarcoma metastasis Heliyon Osteosarcoma Machine learning Feature selection Biomarkers Tumour microenvironment |
title | Using machine learning methods to study the tumour microenvironment and its biomarkers in osteosarcoma metastasis |
title_full | Using machine learning methods to study the tumour microenvironment and its biomarkers in osteosarcoma metastasis |
title_fullStr | Using machine learning methods to study the tumour microenvironment and its biomarkers in osteosarcoma metastasis |
title_full_unstemmed | Using machine learning methods to study the tumour microenvironment and its biomarkers in osteosarcoma metastasis |
title_short | Using machine learning methods to study the tumour microenvironment and its biomarkers in osteosarcoma metastasis |
title_sort | using machine learning methods to study the tumour microenvironment and its biomarkers in osteosarcoma metastasis |
topic | Osteosarcoma Machine learning Feature selection Biomarkers Tumour microenvironment |
url | http://www.sciencedirect.com/science/article/pii/S2405844024053532 |
work_keys_str_mv | AT guangyuanliu usingmachinelearningmethodstostudythetumourmicroenvironmentanditsbiomarkersinosteosarcomametastasis AT shaochunwang usingmachinelearningmethodstostudythetumourmicroenvironmentanditsbiomarkersinosteosarcomametastasis AT jinhuiliu usingmachinelearningmethodstostudythetumourmicroenvironmentanditsbiomarkersinosteosarcomametastasis AT jianglizhang usingmachinelearningmethodstostudythetumourmicroenvironmentanditsbiomarkersinosteosarcomametastasis AT xiqingpan usingmachinelearningmethodstostudythetumourmicroenvironmentanditsbiomarkersinosteosarcomametastasis AT xiaofan usingmachinelearningmethodstostudythetumourmicroenvironmentanditsbiomarkersinosteosarcomametastasis AT tingtingshao usingmachinelearningmethodstostudythetumourmicroenvironmentanditsbiomarkersinosteosarcomametastasis AT yisun usingmachinelearningmethodstostudythetumourmicroenvironmentanditsbiomarkersinosteosarcomametastasis |