Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicing

Abstract Cancer research has long relied on non-silent mutations. Yet, it has become overwhelmingly clear that silent mutations can affect gene expression and cancer cell fitness. One fundamental mechanism that apparently silent mutations can severely disrupt is alternative splicing. Here we introdu...

Full description

Bibliographic Details
Main Authors: Nicolas Lynn, Tamir Tuller
Format: Article
Language:English
Published: Nature Portfolio 2024-03-01
Series:npj Systems Biology and Applications
Online Access:https://doi.org/10.1038/s41540-024-00351-7
_version_ 1797266696437760000
author Nicolas Lynn
Tamir Tuller
author_facet Nicolas Lynn
Tamir Tuller
author_sort Nicolas Lynn
collection DOAJ
description Abstract Cancer research has long relied on non-silent mutations. Yet, it has become overwhelmingly clear that silent mutations can affect gene expression and cancer cell fitness. One fundamental mechanism that apparently silent mutations can severely disrupt is alternative splicing. Here we introduce Oncosplice, a tool that scores mutations based on models of proteomes generated using aberrant splicing predictions. Oncosplice leverages a highly accurate neural network that predicts splice sites within arbitrary mRNA sequences, a greedy transcript constructor that considers alternate arrangements of splicing blueprints, and an algorithm that grades the functional divergence between proteins based on evolutionary conservation. By applying this tool to 12M somatic mutations we identify 8K deleterious variants that are significantly depleted within the healthy population; we demonstrate the tool’s ability to identify clinically validated pathogenic variants with a positive predictive value of 94%; we show strong enrichment of predicted deleterious mutations across pan-cancer drivers. We also achieve improved patient survival estimation using a proposed set of novel cancer-involved genes. Ultimately, this pipeline enables accelerated insight-gathering of sequence-specific consequences for a class of understudied mutations and provides an efficient way of filtering through massive variant datasets – functionalities with immediate experimental and clinical applications.
first_indexed 2024-04-25T01:04:48Z
format Article
id doaj.art-4ce35cd2201a476b9e2677f7a5fcfba0
institution Directory Open Access Journal
issn 2056-7189
language English
last_indexed 2024-04-25T01:04:48Z
publishDate 2024-03-01
publisher Nature Portfolio
record_format Article
series npj Systems Biology and Applications
spelling doaj.art-4ce35cd2201a476b9e2677f7a5fcfba02024-03-10T12:15:47ZengNature Portfolionpj Systems Biology and Applications2056-71892024-03-0110111510.1038/s41540-024-00351-7Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicingNicolas Lynn0Tamir Tuller1Department of Biomedical Engineering, the Engineering Faculty, Tel Aviv UniversityDepartment of Biomedical Engineering, the Engineering Faculty, Tel Aviv UniversityAbstract Cancer research has long relied on non-silent mutations. Yet, it has become overwhelmingly clear that silent mutations can affect gene expression and cancer cell fitness. One fundamental mechanism that apparently silent mutations can severely disrupt is alternative splicing. Here we introduce Oncosplice, a tool that scores mutations based on models of proteomes generated using aberrant splicing predictions. Oncosplice leverages a highly accurate neural network that predicts splice sites within arbitrary mRNA sequences, a greedy transcript constructor that considers alternate arrangements of splicing blueprints, and an algorithm that grades the functional divergence between proteins based on evolutionary conservation. By applying this tool to 12M somatic mutations we identify 8K deleterious variants that are significantly depleted within the healthy population; we demonstrate the tool’s ability to identify clinically validated pathogenic variants with a positive predictive value of 94%; we show strong enrichment of predicted deleterious mutations across pan-cancer drivers. We also achieve improved patient survival estimation using a proposed set of novel cancer-involved genes. Ultimately, this pipeline enables accelerated insight-gathering of sequence-specific consequences for a class of understudied mutations and provides an efficient way of filtering through massive variant datasets – functionalities with immediate experimental and clinical applications.https://doi.org/10.1038/s41540-024-00351-7
spellingShingle Nicolas Lynn
Tamir Tuller
Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicing
npj Systems Biology and Applications
title Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicing
title_full Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicing
title_fullStr Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicing
title_full_unstemmed Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicing
title_short Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicing
title_sort detecting and understanding meaningful cancerous mutations based on computational models of mrna splicing
url https://doi.org/10.1038/s41540-024-00351-7
work_keys_str_mv AT nicolaslynn detectingandunderstandingmeaningfulcancerousmutationsbasedoncomputationalmodelsofmrnasplicing
AT tamirtuller detectingandunderstandingmeaningfulcancerousmutationsbasedoncomputationalmodelsofmrnasplicing