PRADclass: Hybrid Gleason Grade-Informed Computational Strategy Identifies Consensus Biomarker Features Predictive of Aggressive Prostate Adenocarcinoma

Background Prostate adenocarcinoma (PRAD) is a common cancer diagnosis among men globally, yet large gaps in our knowledge persist with respect to the molecular bases of its progression and aggression. It is mostly indolent and slow-growing, but aggressive prostate cancers need to be recognized earl...

Full description

Bibliographic Details
Main Authors: Alex Stanley Balraj MTech, Sangeetha Muthamilselvan MTech, Rachanaa Raja BTech, Ashok Palaniappan PhD
Format: Article
Language:English
Published: SAGE Publishing 2024-01-01
Series:Technology in Cancer Research & Treatment
Online Access:https://doi.org/10.1177/15330338231222389
_version_ 1797296240890740736
author Alex Stanley Balraj MTech
Sangeetha Muthamilselvan MTech
Rachanaa Raja BTech
Ashok Palaniappan PhD
author_facet Alex Stanley Balraj MTech
Sangeetha Muthamilselvan MTech
Rachanaa Raja BTech
Ashok Palaniappan PhD
author_sort Alex Stanley Balraj MTech
collection DOAJ
description Background Prostate adenocarcinoma (PRAD) is a common cancer diagnosis among men globally, yet large gaps in our knowledge persist with respect to the molecular bases of its progression and aggression. It is mostly indolent and slow-growing, but aggressive prostate cancers need to be recognized early for optimising treatment, with a view to reducing mortality. Methods Based on TCGA transcriptomic data pertaining to PRAD and the associated clinical metadata, we determined the sample Gleason grade, and used it to execute: (i) Gleason-grade wise linear modeling, followed by five contrasts against controls and ten contrasts between grades; and (ii) Gleason-grade wise network modeling via weighted gene correlation network analysis (WGCNA). Candidate biomarkers were obtained from the above analysis and the consensus found. The consensus biomarkers were used as the feature space to train ML models for classifying a sample as benign, indolent or aggressive. Results The statistical modeling yielded 77 Gleason grade-salient genes while the WGCNA algorithm yielded 1003 trait-specific key genes in grade-wise significant modules. Consensus analysis of the two approaches identified two genes in Grade-1 (SLC43A1 and PHGR1), 26 genes in Grade-4 (including LOC100128675, PPP1R3C, NECAB1, UBXN10, SERPINA5, CLU, RASL12, DGKG, FHL1, NCAM1, and CEND1), and seven genes in Grade-5 (CBX2, DPYS, FAM72B, SHCBP1, TMEM132A, TPX2, UBE2C). A RandomForest model trained and optimized on these 35 biomarkers for the ternary classification problem yielded a balanced accuracy ∼ 86% on external validation. Conclusions The consensus of multiple parallel computational strategies has unmasked candidate Gleason grade-specific biomarkers. PRADclass, a validated AI model featurizing these biomarkers achieved good performance, and could be trialed to predict the differentiation of prostate cancers. PRADclass is available for academic use at: https://apalania.shinyapps.io/pradclass (online) and https://github.com/apalania/pradclass (command-line interface).
first_indexed 2024-03-07T22:00:35Z
format Article
id doaj.art-e1138c6b25d842a78003951638d83c65
institution Directory Open Access Journal
issn 1533-0338
language English
last_indexed 2024-03-07T22:00:35Z
publishDate 2024-01-01
publisher SAGE Publishing
record_format Article
series Technology in Cancer Research & Treatment
spelling doaj.art-e1138c6b25d842a78003951638d83c652024-02-24T05:03:24ZengSAGE PublishingTechnology in Cancer Research & Treatment1533-03382024-01-012310.1177/15330338231222389PRADclass: Hybrid Gleason Grade-Informed Computational Strategy Identifies Consensus Biomarker Features Predictive of Aggressive Prostate AdenocarcinomaAlex Stanley Balraj MTech0Sangeetha Muthamilselvan MTech1Rachanaa Raja BTech2Ashok Palaniappan PhD3 Department of Bioinformatics, School of Chemical and Biotechnology, , Thanjavur, India Department of Bioinformatics, School of Chemical and Biotechnology, , Thanjavur, India Department of Pharmaceutical Technology, UCE, , Trichy, India Department of Bioinformatics, School of Chemical and Biotechnology, , Thanjavur, IndiaBackground Prostate adenocarcinoma (PRAD) is a common cancer diagnosis among men globally, yet large gaps in our knowledge persist with respect to the molecular bases of its progression and aggression. It is mostly indolent and slow-growing, but aggressive prostate cancers need to be recognized early for optimising treatment, with a view to reducing mortality. Methods Based on TCGA transcriptomic data pertaining to PRAD and the associated clinical metadata, we determined the sample Gleason grade, and used it to execute: (i) Gleason-grade wise linear modeling, followed by five contrasts against controls and ten contrasts between grades; and (ii) Gleason-grade wise network modeling via weighted gene correlation network analysis (WGCNA). Candidate biomarkers were obtained from the above analysis and the consensus found. The consensus biomarkers were used as the feature space to train ML models for classifying a sample as benign, indolent or aggressive. Results The statistical modeling yielded 77 Gleason grade-salient genes while the WGCNA algorithm yielded 1003 trait-specific key genes in grade-wise significant modules. Consensus analysis of the two approaches identified two genes in Grade-1 (SLC43A1 and PHGR1), 26 genes in Grade-4 (including LOC100128675, PPP1R3C, NECAB1, UBXN10, SERPINA5, CLU, RASL12, DGKG, FHL1, NCAM1, and CEND1), and seven genes in Grade-5 (CBX2, DPYS, FAM72B, SHCBP1, TMEM132A, TPX2, UBE2C). A RandomForest model trained and optimized on these 35 biomarkers for the ternary classification problem yielded a balanced accuracy ∼ 86% on external validation. Conclusions The consensus of multiple parallel computational strategies has unmasked candidate Gleason grade-specific biomarkers. PRADclass, a validated AI model featurizing these biomarkers achieved good performance, and could be trialed to predict the differentiation of prostate cancers. PRADclass is available for academic use at: https://apalania.shinyapps.io/pradclass (online) and https://github.com/apalania/pradclass (command-line interface).https://doi.org/10.1177/15330338231222389
spellingShingle Alex Stanley Balraj MTech
Sangeetha Muthamilselvan MTech
Rachanaa Raja BTech
Ashok Palaniappan PhD
PRADclass: Hybrid Gleason Grade-Informed Computational Strategy Identifies Consensus Biomarker Features Predictive of Aggressive Prostate Adenocarcinoma
Technology in Cancer Research & Treatment
title PRADclass: Hybrid Gleason Grade-Informed Computational Strategy Identifies Consensus Biomarker Features Predictive of Aggressive Prostate Adenocarcinoma
title_full PRADclass: Hybrid Gleason Grade-Informed Computational Strategy Identifies Consensus Biomarker Features Predictive of Aggressive Prostate Adenocarcinoma
title_fullStr PRADclass: Hybrid Gleason Grade-Informed Computational Strategy Identifies Consensus Biomarker Features Predictive of Aggressive Prostate Adenocarcinoma
title_full_unstemmed PRADclass: Hybrid Gleason Grade-Informed Computational Strategy Identifies Consensus Biomarker Features Predictive of Aggressive Prostate Adenocarcinoma
title_short PRADclass: Hybrid Gleason Grade-Informed Computational Strategy Identifies Consensus Biomarker Features Predictive of Aggressive Prostate Adenocarcinoma
title_sort pradclass hybrid gleason grade informed computational strategy identifies consensus biomarker features predictive of aggressive prostate adenocarcinoma
url https://doi.org/10.1177/15330338231222389
work_keys_str_mv AT alexstanleybalrajmtech pradclasshybridgleasongradeinformedcomputationalstrategyidentifiesconsensusbiomarkerfeaturespredictiveofaggressiveprostateadenocarcinoma
AT sangeethamuthamilselvanmtech pradclasshybridgleasongradeinformedcomputationalstrategyidentifiesconsensusbiomarkerfeaturespredictiveofaggressiveprostateadenocarcinoma
AT rachanaarajabtech pradclasshybridgleasongradeinformedcomputationalstrategyidentifiesconsensusbiomarkerfeaturespredictiveofaggressiveprostateadenocarcinoma
AT ashokpalaniappanphd pradclasshybridgleasongradeinformedcomputationalstrategyidentifiesconsensusbiomarkerfeaturespredictiveofaggressiveprostateadenocarcinoma