Improving Recurrence Prediction Accuracy of Ovarian Cancer Using Multi-phase Feature Selection Methodology

Ovarian cancer stands in the sixth position among the most commonly occurring cancers in the world. Because of the high rate of recurrence, this gynecological malignancy seems to be a vital reason behind cancer-related death among women as tumor recurrence stands as an obstacle in ovarian cancer tre...

Full description

Bibliographic Details
Main Authors: S. Sujamol, E. R. Vimina, U. Krishnakumar
Format: Article
Language:English
Published: Taylor & Francis Group 2021-02-01
Series:Applied Artificial Intelligence
Online Access:http://dx.doi.org/10.1080/08839514.2020.1854988
_version_ 1797684877540196352
author S. Sujamol
E. R. Vimina
U. Krishnakumar
author_facet S. Sujamol
E. R. Vimina
U. Krishnakumar
author_sort S. Sujamol
collection DOAJ
description Ovarian cancer stands in the sixth position among the most commonly occurring cancers in the world. Because of the high rate of recurrence, this gynecological malignancy seems to be a vital reason behind cancer-related death among women as tumor recurrence stands as an obstacle in ovarian cancer treatment. It is crucial to find those recurrence causing factors in order to plan suitable therapies with high prognostic results. Hence, in this work, a multistage feature selection methodology is proposed to identify key MiRNAs and clinical features for improving the accuracy of ovarian cancer recurrence prediction. MiRNA expression profiles of ovarian cancer patients and their corresponding clinical data were downloaded from the TCGA cancer repository. From 588 MiRNAs, 6 key MiRNAs were selected using the Inheritable Bi-objective Combinatorial Genetic Algorithm (IBCGA) followed by factor analysis. The biological importance of the resultant MiRNAs in cancer and cellular pathways were studied using Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis. Further, recurrence prediction was performed using the obtained MiRNA expression profiles and clinical factors, chosen using correlation analysis. The proposed approach using the selected features yielded a prediction accuracy of 91.86% using the XGBoost classifier while the same without feature selection was 76.59%. Compared to previous similar works, this model provides a better result in terms of accuracy and reveals influential MiRNAs in ovarian cancer.
first_indexed 2024-03-12T00:36:06Z
format Article
id doaj.art-5e36b3c01d614f1194d628fed96376d8
institution Directory Open Access Journal
issn 0883-9514
1087-6545
language English
last_indexed 2024-03-12T00:36:06Z
publishDate 2021-02-01
publisher Taylor & Francis Group
record_format Article
series Applied Artificial Intelligence
spelling doaj.art-5e36b3c01d614f1194d628fed96376d82023-09-15T09:33:58ZengTaylor & Francis GroupApplied Artificial Intelligence0883-95141087-65452021-02-0135320622610.1080/08839514.2020.18549881854988Improving Recurrence Prediction Accuracy of Ovarian Cancer Using Multi-phase Feature Selection MethodologyS. Sujamol0E. R. Vimina1U. Krishnakumar2School of Arts and Sciences, Amrita Vishwa VidyapeethamSchool of Arts and Sciences, Amrita Vishwa VidyapeethamSchool of Arts and Sciences, Amrita Vishwa VidyapeethamOvarian cancer stands in the sixth position among the most commonly occurring cancers in the world. Because of the high rate of recurrence, this gynecological malignancy seems to be a vital reason behind cancer-related death among women as tumor recurrence stands as an obstacle in ovarian cancer treatment. It is crucial to find those recurrence causing factors in order to plan suitable therapies with high prognostic results. Hence, in this work, a multistage feature selection methodology is proposed to identify key MiRNAs and clinical features for improving the accuracy of ovarian cancer recurrence prediction. MiRNA expression profiles of ovarian cancer patients and their corresponding clinical data were downloaded from the TCGA cancer repository. From 588 MiRNAs, 6 key MiRNAs were selected using the Inheritable Bi-objective Combinatorial Genetic Algorithm (IBCGA) followed by factor analysis. The biological importance of the resultant MiRNAs in cancer and cellular pathways were studied using Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis. Further, recurrence prediction was performed using the obtained MiRNA expression profiles and clinical factors, chosen using correlation analysis. The proposed approach using the selected features yielded a prediction accuracy of 91.86% using the XGBoost classifier while the same without feature selection was 76.59%. Compared to previous similar works, this model provides a better result in terms of accuracy and reveals influential MiRNAs in ovarian cancer.http://dx.doi.org/10.1080/08839514.2020.1854988
spellingShingle S. Sujamol
E. R. Vimina
U. Krishnakumar
Improving Recurrence Prediction Accuracy of Ovarian Cancer Using Multi-phase Feature Selection Methodology
Applied Artificial Intelligence
title Improving Recurrence Prediction Accuracy of Ovarian Cancer Using Multi-phase Feature Selection Methodology
title_full Improving Recurrence Prediction Accuracy of Ovarian Cancer Using Multi-phase Feature Selection Methodology
title_fullStr Improving Recurrence Prediction Accuracy of Ovarian Cancer Using Multi-phase Feature Selection Methodology
title_full_unstemmed Improving Recurrence Prediction Accuracy of Ovarian Cancer Using Multi-phase Feature Selection Methodology
title_short Improving Recurrence Prediction Accuracy of Ovarian Cancer Using Multi-phase Feature Selection Methodology
title_sort improving recurrence prediction accuracy of ovarian cancer using multi phase feature selection methodology
url http://dx.doi.org/10.1080/08839514.2020.1854988
work_keys_str_mv AT ssujamol improvingrecurrencepredictionaccuracyofovariancancerusingmultiphasefeatureselectionmethodology
AT ervimina improvingrecurrencepredictionaccuracyofovariancancerusingmultiphasefeatureselectionmethodology
AT ukrishnakumar improvingrecurrencepredictionaccuracyofovariancancerusingmultiphasefeatureselectionmethodology