Pan-cancer screen for mutations in non-coding elements with conservation and cancer specificity reveals correlations with expression and survival

Cancer develops by accumulation of somatic driver mutations, which impact cellular function. Mutations in non-coding regulatory regions can now be studied genome-wide and further characterized by correlation with gene expression and clinical outcome to identify driver candidates. Using a new two-sta...

Full description

Bibliographic Details
Main Authors: Hornshøj, Henrik, Nielsen, Morten Muhlig, Sinnott-Armstrong, Nicholas A., Świtnicki, Michał P., Juul, Malene, Madsen, Tobias, Sallari, Richard, Kellis, Manolis, Ørntoft, Torben, Hobolth, Asger, Pedersen, Jakob Skou
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Language:English
Published: Springer Science and Business Media LLC 2020
Online Access:https://hdl.handle.net/1721.1/126059
_version_ 1810975665419190272
author Hornshøj, Henrik
Nielsen, Morten Muhlig
Sinnott-Armstrong, Nicholas A.
Świtnicki, Michał P.
Juul, Malene
Madsen, Tobias
Sallari, Richard
Kellis, Manolis
Ørntoft, Torben
Hobolth, Asger
Pedersen, Jakob Skou
author2 Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
author_facet Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Hornshøj, Henrik
Nielsen, Morten Muhlig
Sinnott-Armstrong, Nicholas A.
Świtnicki, Michał P.
Juul, Malene
Madsen, Tobias
Sallari, Richard
Kellis, Manolis
Ørntoft, Torben
Hobolth, Asger
Pedersen, Jakob Skou
author_sort Hornshøj, Henrik
collection MIT
description Cancer develops by accumulation of somatic driver mutations, which impact cellular function. Mutations in non-coding regulatory regions can now be studied genome-wide and further characterized by correlation with gene expression and clinical outcome to identify driver candidates. Using a new two-stage procedure, called ncDriver, we first screened 507 ICGC whole-genomes from 10 cancer types for non-coding elements, in which mutations are both recurrent and have elevated conservation or cancer specificity. This identified 160 significant non-coding elements, including the TERT promoter, a well-known non-coding driver element, as well as elements associated with known cancer genes and regulatory genes (e.g., PAX5, TOX3, PCF11, MAPRE3). However, in some significant elements, mutations appear to stem from localized mutational processes rather than recurrent positive selection in some cases. To further characterize the driver potential of the identified elements and shortlist candidates, we identified elements where presence of mutations correlated significantly with expression levels (e.g., TERT and CDH10) and survival (e.g., CDH9 and CDH10) in an independent set of 505 TCGA whole-genome samples. In a larger pan-cancer set of 4128 TCGA exomes with expression profiling, we identified mutational correlation with expression for additional elements (e.g., near GATA3, CDC6, ZNF217, and CTCF transcription factor binding sites). Survival analysis further pointed to MIR122, a known marker of poor prognosis in liver cancer. In conclusion, the screen for significant mutation patterns coupled with correlative mutational analysis identified new individual driver candidates and suggest that some non-coding mutations recurrently affect expression and play a role in cancer development.
first_indexed 2024-09-23T08:42:59Z
format Article
id mit-1721.1/126059
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T08:42:59Z
publishDate 2020
publisher Springer Science and Business Media LLC
record_format dspace
spelling mit-1721.1/1260592022-09-23T14:04:40Z Pan-cancer screen for mutations in non-coding elements with conservation and cancer specificity reveals correlations with expression and survival Hornshøj, Henrik Nielsen, Morten Muhlig Sinnott-Armstrong, Nicholas A. Świtnicki, Michał P. Juul, Malene Madsen, Tobias Sallari, Richard Kellis, Manolis Ørntoft, Torben Hobolth, Asger Pedersen, Jakob Skou Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Cancer develops by accumulation of somatic driver mutations, which impact cellular function. Mutations in non-coding regulatory regions can now be studied genome-wide and further characterized by correlation with gene expression and clinical outcome to identify driver candidates. Using a new two-stage procedure, called ncDriver, we first screened 507 ICGC whole-genomes from 10 cancer types for non-coding elements, in which mutations are both recurrent and have elevated conservation or cancer specificity. This identified 160 significant non-coding elements, including the TERT promoter, a well-known non-coding driver element, as well as elements associated with known cancer genes and regulatory genes (e.g., PAX5, TOX3, PCF11, MAPRE3). However, in some significant elements, mutations appear to stem from localized mutational processes rather than recurrent positive selection in some cases. To further characterize the driver potential of the identified elements and shortlist candidates, we identified elements where presence of mutations correlated significantly with expression levels (e.g., TERT and CDH10) and survival (e.g., CDH9 and CDH10) in an independent set of 505 TCGA whole-genome samples. In a larger pan-cancer set of 4128 TCGA exomes with expression profiling, we identified mutational correlation with expression for additional elements (e.g., near GATA3, CDC6, ZNF217, and CTCF transcription factor binding sites). Survival analysis further pointed to MIR122, a known marker of poor prognosis in liver cancer. In conclusion, the screen for significant mutation patterns coupled with correlative mutational analysis identified new individual driver candidates and suggest that some non-coding mutations recurrently affect expression and play a role in cancer development. 2020-07-07T14:04:09Z 2020-07-07T14:04:09Z 2018-01 2017-11 2019-07-18T13:20:26Z Article http://purl.org/eprint/type/JournalArticle 2056-7944 https://hdl.handle.net/1721.1/126059 Hornshøj, Henrik et al. "Pan-cancer screen for mutations in non-coding elements with conservation and cancer specificity reveals correlations with expression and survival." NPJ Genomic Medicine 3, 1 (January 2018): 1 © 2018 The Author(s) en http://dx.doi.org/10.1038/s41525-017-0040-5 NPJ Genomic Medicine Creative Commons Attribution 4.0 International license https://creativecommons.org/licenses/by/4.0/ application/pdf Springer Science and Business Media LLC Nature
spellingShingle Hornshøj, Henrik
Nielsen, Morten Muhlig
Sinnott-Armstrong, Nicholas A.
Świtnicki, Michał P.
Juul, Malene
Madsen, Tobias
Sallari, Richard
Kellis, Manolis
Ørntoft, Torben
Hobolth, Asger
Pedersen, Jakob Skou
Pan-cancer screen for mutations in non-coding elements with conservation and cancer specificity reveals correlations with expression and survival
title Pan-cancer screen for mutations in non-coding elements with conservation and cancer specificity reveals correlations with expression and survival
title_full Pan-cancer screen for mutations in non-coding elements with conservation and cancer specificity reveals correlations with expression and survival
title_fullStr Pan-cancer screen for mutations in non-coding elements with conservation and cancer specificity reveals correlations with expression and survival
title_full_unstemmed Pan-cancer screen for mutations in non-coding elements with conservation and cancer specificity reveals correlations with expression and survival
title_short Pan-cancer screen for mutations in non-coding elements with conservation and cancer specificity reveals correlations with expression and survival
title_sort pan cancer screen for mutations in non coding elements with conservation and cancer specificity reveals correlations with expression and survival
url https://hdl.handle.net/1721.1/126059
work_keys_str_mv AT hornshøjhenrik pancancerscreenformutationsinnoncodingelementswithconservationandcancerspecificityrevealscorrelationswithexpressionandsurvival
AT nielsenmortenmuhlig pancancerscreenformutationsinnoncodingelementswithconservationandcancerspecificityrevealscorrelationswithexpressionandsurvival
AT sinnottarmstrongnicholasa pancancerscreenformutationsinnoncodingelementswithconservationandcancerspecificityrevealscorrelationswithexpressionandsurvival
AT switnickimichałp pancancerscreenformutationsinnoncodingelementswithconservationandcancerspecificityrevealscorrelationswithexpressionandsurvival
AT juulmalene pancancerscreenformutationsinnoncodingelementswithconservationandcancerspecificityrevealscorrelationswithexpressionandsurvival
AT madsentobias pancancerscreenformutationsinnoncodingelementswithconservationandcancerspecificityrevealscorrelationswithexpressionandsurvival
AT sallaririchard pancancerscreenformutationsinnoncodingelementswithconservationandcancerspecificityrevealscorrelationswithexpressionandsurvival
AT kellismanolis pancancerscreenformutationsinnoncodingelementswithconservationandcancerspecificityrevealscorrelationswithexpressionandsurvival
AT ørntofttorben pancancerscreenformutationsinnoncodingelementswithconservationandcancerspecificityrevealscorrelationswithexpressionandsurvival
AT hobolthasger pancancerscreenformutationsinnoncodingelementswithconservationandcancerspecificityrevealscorrelationswithexpressionandsurvival
AT pedersenjakobskou pancancerscreenformutationsinnoncodingelementswithconservationandcancerspecificityrevealscorrelationswithexpressionandsurvival