Hierarchical Voting-Based Feature Selection and Ensemble Learning Model Scheme for Glioma Grading with Clinical and Molecular Characteristics

Determining the aggressiveness of gliomas, termed grading, is a critical step toward treatment optimization to increase the survival rate and decrease treatment toxicity for patients. Streamlined grading using molecular information has the potential to facilitate decision making in the clinic and ai...

Full description

Bibliographic Details
Main Authors: Erdal Tasci, Ying Zhuge, Harpreet Kaur, Kevin Camphausen, Andra Valentina Krauze
Format: Article
Language:English
Published: MDPI AG 2022-11-01
Series:International Journal of Molecular Sciences
Subjects:
Online Access:https://www.mdpi.com/1422-0067/23/22/14155
_version_ 1797465087088263168
author Erdal Tasci
Ying Zhuge
Harpreet Kaur
Kevin Camphausen
Andra Valentina Krauze
author_facet Erdal Tasci
Ying Zhuge
Harpreet Kaur
Kevin Camphausen
Andra Valentina Krauze
author_sort Erdal Tasci
collection DOAJ
description Determining the aggressiveness of gliomas, termed grading, is a critical step toward treatment optimization to increase the survival rate and decrease treatment toxicity for patients. Streamlined grading using molecular information has the potential to facilitate decision making in the clinic and aid in treatment planning. In recent years, molecular markers have increasingly gained importance in the classification of tumors. In this study, we propose a novel hierarchical voting-based methodology for improving the performance results of the feature selection stage and machine learning models for glioma grading with clinical and molecular predictors. To identify the best scheme for the given soft-voting-based ensemble learning model selections, we utilized publicly available TCGA and CGGA datasets and employed four dimensionality reduction methods to carry out a voting-based ensemble feature selection and five supervised models, with a total of sixteen combination sets. We also compared our proposed feature selection method with the LASSO feature selection method in isolation. The computational results indicate that the proposed method achieves 87.606% and 79.668% accuracy rates on TCGA and CGGA datasets, respectively, outperforming the LASSO feature selection method.
first_indexed 2024-03-09T18:16:26Z
format Article
id doaj.art-285dc3740f85492085b1ca3f2d0d56bc
institution Directory Open Access Journal
issn 1661-6596
1422-0067
language English
last_indexed 2024-03-09T18:16:26Z
publishDate 2022-11-01
publisher MDPI AG
record_format Article
series International Journal of Molecular Sciences
spelling doaj.art-285dc3740f85492085b1ca3f2d0d56bc2023-11-24T08:39:28ZengMDPI AGInternational Journal of Molecular Sciences1661-65961422-00672022-11-0123221415510.3390/ijms232214155Hierarchical Voting-Based Feature Selection and Ensemble Learning Model Scheme for Glioma Grading with Clinical and Molecular CharacteristicsErdal Tasci0Ying Zhuge1Harpreet Kaur2Kevin Camphausen3Andra Valentina Krauze4Radiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Building 10, Bethesda, MD 20892, USARadiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Building 10, Bethesda, MD 20892, USARadiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Building 10, Bethesda, MD 20892, USARadiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Building 10, Bethesda, MD 20892, USARadiation Oncology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Building 10, Bethesda, MD 20892, USADetermining the aggressiveness of gliomas, termed grading, is a critical step toward treatment optimization to increase the survival rate and decrease treatment toxicity for patients. Streamlined grading using molecular information has the potential to facilitate decision making in the clinic and aid in treatment planning. In recent years, molecular markers have increasingly gained importance in the classification of tumors. In this study, we propose a novel hierarchical voting-based methodology for improving the performance results of the feature selection stage and machine learning models for glioma grading with clinical and molecular predictors. To identify the best scheme for the given soft-voting-based ensemble learning model selections, we utilized publicly available TCGA and CGGA datasets and employed four dimensionality reduction methods to carry out a voting-based ensemble feature selection and five supervised models, with a total of sixteen combination sets. We also compared our proposed feature selection method with the LASSO feature selection method in isolation. The computational results indicate that the proposed method achieves 87.606% and 79.668% accuracy rates on TCGA and CGGA datasets, respectively, outperforming the LASSO feature selection method.https://www.mdpi.com/1422-0067/23/22/14155diagnosticbrain tumorgliomagradingmolecular dataoncology
spellingShingle Erdal Tasci
Ying Zhuge
Harpreet Kaur
Kevin Camphausen
Andra Valentina Krauze
Hierarchical Voting-Based Feature Selection and Ensemble Learning Model Scheme for Glioma Grading with Clinical and Molecular Characteristics
International Journal of Molecular Sciences
diagnostic
brain tumor
glioma
grading
molecular data
oncology
title Hierarchical Voting-Based Feature Selection and Ensemble Learning Model Scheme for Glioma Grading with Clinical and Molecular Characteristics
title_full Hierarchical Voting-Based Feature Selection and Ensemble Learning Model Scheme for Glioma Grading with Clinical and Molecular Characteristics
title_fullStr Hierarchical Voting-Based Feature Selection and Ensemble Learning Model Scheme for Glioma Grading with Clinical and Molecular Characteristics
title_full_unstemmed Hierarchical Voting-Based Feature Selection and Ensemble Learning Model Scheme for Glioma Grading with Clinical and Molecular Characteristics
title_short Hierarchical Voting-Based Feature Selection and Ensemble Learning Model Scheme for Glioma Grading with Clinical and Molecular Characteristics
title_sort hierarchical voting based feature selection and ensemble learning model scheme for glioma grading with clinical and molecular characteristics
topic diagnostic
brain tumor
glioma
grading
molecular data
oncology
url https://www.mdpi.com/1422-0067/23/22/14155
work_keys_str_mv AT erdaltasci hierarchicalvotingbasedfeatureselectionandensemblelearningmodelschemeforgliomagradingwithclinicalandmolecularcharacteristics
AT yingzhuge hierarchicalvotingbasedfeatureselectionandensemblelearningmodelschemeforgliomagradingwithclinicalandmolecularcharacteristics
AT harpreetkaur hierarchicalvotingbasedfeatureselectionandensemblelearningmodelschemeforgliomagradingwithclinicalandmolecularcharacteristics
AT kevincamphausen hierarchicalvotingbasedfeatureselectionandensemblelearningmodelschemeforgliomagradingwithclinicalandmolecularcharacteristics
AT andravalentinakrauze hierarchicalvotingbasedfeatureselectionandensemblelearningmodelschemeforgliomagradingwithclinicalandmolecularcharacteristics