Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicing
Abstract Cancer research has long relied on non-silent mutations. Yet, it has become overwhelmingly clear that silent mutations can affect gene expression and cancer cell fitness. One fundamental mechanism that apparently silent mutations can severely disrupt is alternative splicing. Here we introdu...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2024-03-01
|
Series: | npj Systems Biology and Applications |
Online Access: | https://doi.org/10.1038/s41540-024-00351-7 |
_version_ | 1797266696437760000 |
---|---|
author | Nicolas Lynn Tamir Tuller |
author_facet | Nicolas Lynn Tamir Tuller |
author_sort | Nicolas Lynn |
collection | DOAJ |
description | Abstract Cancer research has long relied on non-silent mutations. Yet, it has become overwhelmingly clear that silent mutations can affect gene expression and cancer cell fitness. One fundamental mechanism that apparently silent mutations can severely disrupt is alternative splicing. Here we introduce Oncosplice, a tool that scores mutations based on models of proteomes generated using aberrant splicing predictions. Oncosplice leverages a highly accurate neural network that predicts splice sites within arbitrary mRNA sequences, a greedy transcript constructor that considers alternate arrangements of splicing blueprints, and an algorithm that grades the functional divergence between proteins based on evolutionary conservation. By applying this tool to 12M somatic mutations we identify 8K deleterious variants that are significantly depleted within the healthy population; we demonstrate the tool’s ability to identify clinically validated pathogenic variants with a positive predictive value of 94%; we show strong enrichment of predicted deleterious mutations across pan-cancer drivers. We also achieve improved patient survival estimation using a proposed set of novel cancer-involved genes. Ultimately, this pipeline enables accelerated insight-gathering of sequence-specific consequences for a class of understudied mutations and provides an efficient way of filtering through massive variant datasets – functionalities with immediate experimental and clinical applications. |
first_indexed | 2024-04-25T01:04:48Z |
format | Article |
id | doaj.art-4ce35cd2201a476b9e2677f7a5fcfba0 |
institution | Directory Open Access Journal |
issn | 2056-7189 |
language | English |
last_indexed | 2024-04-25T01:04:48Z |
publishDate | 2024-03-01 |
publisher | Nature Portfolio |
record_format | Article |
series | npj Systems Biology and Applications |
spelling | doaj.art-4ce35cd2201a476b9e2677f7a5fcfba02024-03-10T12:15:47ZengNature Portfolionpj Systems Biology and Applications2056-71892024-03-0110111510.1038/s41540-024-00351-7Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicingNicolas Lynn0Tamir Tuller1Department of Biomedical Engineering, the Engineering Faculty, Tel Aviv UniversityDepartment of Biomedical Engineering, the Engineering Faculty, Tel Aviv UniversityAbstract Cancer research has long relied on non-silent mutations. Yet, it has become overwhelmingly clear that silent mutations can affect gene expression and cancer cell fitness. One fundamental mechanism that apparently silent mutations can severely disrupt is alternative splicing. Here we introduce Oncosplice, a tool that scores mutations based on models of proteomes generated using aberrant splicing predictions. Oncosplice leverages a highly accurate neural network that predicts splice sites within arbitrary mRNA sequences, a greedy transcript constructor that considers alternate arrangements of splicing blueprints, and an algorithm that grades the functional divergence between proteins based on evolutionary conservation. By applying this tool to 12M somatic mutations we identify 8K deleterious variants that are significantly depleted within the healthy population; we demonstrate the tool’s ability to identify clinically validated pathogenic variants with a positive predictive value of 94%; we show strong enrichment of predicted deleterious mutations across pan-cancer drivers. We also achieve improved patient survival estimation using a proposed set of novel cancer-involved genes. Ultimately, this pipeline enables accelerated insight-gathering of sequence-specific consequences for a class of understudied mutations and provides an efficient way of filtering through massive variant datasets – functionalities with immediate experimental and clinical applications.https://doi.org/10.1038/s41540-024-00351-7 |
spellingShingle | Nicolas Lynn Tamir Tuller Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicing npj Systems Biology and Applications |
title | Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicing |
title_full | Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicing |
title_fullStr | Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicing |
title_full_unstemmed | Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicing |
title_short | Detecting and understanding meaningful cancerous mutations based on computational models of mRNA splicing |
title_sort | detecting and understanding meaningful cancerous mutations based on computational models of mrna splicing |
url | https://doi.org/10.1038/s41540-024-00351-7 |
work_keys_str_mv | AT nicolaslynn detectingandunderstandingmeaningfulcancerousmutationsbasedoncomputationalmodelsofmrnasplicing AT tamirtuller detectingandunderstandingmeaningfulcancerousmutationsbasedoncomputationalmodelsofmrnasplicing |