Identifying Driver Genes in Cancer by Triangulating Gene Expression, Gene Location, and Survival Data

Driver genes are directly responsible for oncogenesis and identifying them is essential in order to fully understand the mechanisms of cancer. However, it is difficult to delineate them from the larger pool of genes that are deregulated in cancer (ie, passenger genes). In order to address this probl...

Full description

Bibliographic Details
Main Authors: Sigrid Rouam, Lance D. Miller, R. Krishna Murthy Karuturi
Format: Article
Language:English
Published: SAGE Publishing 2014-01-01
Series:Cancer Informatics
Online Access:https://doi.org/10.4137/CIN.S18302
_version_ 1818551216296165376
author Sigrid Rouam
Lance D. Miller
R. Krishna Murthy Karuturi
author_facet Sigrid Rouam
Lance D. Miller
R. Krishna Murthy Karuturi
author_sort Sigrid Rouam
collection DOAJ
description Driver genes are directly responsible for oncogenesis and identifying them is essential in order to fully understand the mechanisms of cancer. However, it is difficult to delineate them from the larger pool of genes that are deregulated in cancer (ie, passenger genes). In order to address this problem, we developed an approach called TRIAngulating Gene Expression (TRIAGE through clinico-genomic intersects). Here, we present a refinement of this approach incorporating a new scoring methodology to identify putative driver genes that are deregulated in cancer. TRIAGE triangulates - or integrates -three levels of information: gene expression, gene location, and patient survival. First, TRIAGE identifies regions of deregulated expression (ie, expression footprints) by deriving a newly established measure called the Local Singular Value Decomposition (LSVD) score for each locus. Driver genes are then distinguished from passenger genes using dual survival analyses. Incorporating measurements of gene expression and weighting them according to the LSVD weight of each tumor, these analyses are performed using the genes located in significant expression footprints. Here, we first use simulated data to characterize the newly established LSVD score. We then present the results of our application of this refined version of TRIAGE to gene expression data from five cancer types. This refined version of TRIAGE not only allowed us to identify known prominent driver genes, such as MMP1, IL8 , and COL1A2 , but it also led us to identify several novel ones. These results illustrate that TRIAGE complements existing tools, allows for the identification of genes that drive cancer and could perhaps elucidate potential future targets of novel anticancer therapeutics.
first_indexed 2024-12-12T08:56:55Z
format Article
id doaj.art-e0e1352f7a7b4558825115f5c56af7c7
institution Directory Open Access Journal
issn 1176-9351
language English
last_indexed 2024-12-12T08:56:55Z
publishDate 2014-01-01
publisher SAGE Publishing
record_format Article
series Cancer Informatics
spelling doaj.art-e0e1352f7a7b4558825115f5c56af7c72022-12-22T00:29:57ZengSAGE PublishingCancer Informatics1176-93512014-01-0113s610.4137/CIN.S18302Identifying Driver Genes in Cancer by Triangulating Gene Expression, Gene Location, and Survival DataSigrid Rouam0Lance D. Miller1R. Krishna Murthy Karuturi2Procter and Gamble International Operations SA Singapore Branch, Statistics Asia, Singapore.Department of Cancer Biology, Wake Forest University School of Medicine, Winston-Salem, NC, USA.Computational Sciences, The Jackson Laboratory, Bar Harbor, ME, USA.Driver genes are directly responsible for oncogenesis and identifying them is essential in order to fully understand the mechanisms of cancer. However, it is difficult to delineate them from the larger pool of genes that are deregulated in cancer (ie, passenger genes). In order to address this problem, we developed an approach called TRIAngulating Gene Expression (TRIAGE through clinico-genomic intersects). Here, we present a refinement of this approach incorporating a new scoring methodology to identify putative driver genes that are deregulated in cancer. TRIAGE triangulates - or integrates -three levels of information: gene expression, gene location, and patient survival. First, TRIAGE identifies regions of deregulated expression (ie, expression footprints) by deriving a newly established measure called the Local Singular Value Decomposition (LSVD) score for each locus. Driver genes are then distinguished from passenger genes using dual survival analyses. Incorporating measurements of gene expression and weighting them according to the LSVD weight of each tumor, these analyses are performed using the genes located in significant expression footprints. Here, we first use simulated data to characterize the newly established LSVD score. We then present the results of our application of this refined version of TRIAGE to gene expression data from five cancer types. This refined version of TRIAGE not only allowed us to identify known prominent driver genes, such as MMP1, IL8 , and COL1A2 , but it also led us to identify several novel ones. These results illustrate that TRIAGE complements existing tools, allows for the identification of genes that drive cancer and could perhaps elucidate potential future targets of novel anticancer therapeutics.https://doi.org/10.4137/CIN.S18302
spellingShingle Sigrid Rouam
Lance D. Miller
R. Krishna Murthy Karuturi
Identifying Driver Genes in Cancer by Triangulating Gene Expression, Gene Location, and Survival Data
Cancer Informatics
title Identifying Driver Genes in Cancer by Triangulating Gene Expression, Gene Location, and Survival Data
title_full Identifying Driver Genes in Cancer by Triangulating Gene Expression, Gene Location, and Survival Data
title_fullStr Identifying Driver Genes in Cancer by Triangulating Gene Expression, Gene Location, and Survival Data
title_full_unstemmed Identifying Driver Genes in Cancer by Triangulating Gene Expression, Gene Location, and Survival Data
title_short Identifying Driver Genes in Cancer by Triangulating Gene Expression, Gene Location, and Survival Data
title_sort identifying driver genes in cancer by triangulating gene expression gene location and survival data
url https://doi.org/10.4137/CIN.S18302
work_keys_str_mv AT sigridrouam identifyingdrivergenesincancerbytriangulatinggeneexpressiongenelocationandsurvivaldata
AT lancedmiller identifyingdrivergenesincancerbytriangulatinggeneexpressiongenelocationandsurvivaldata
AT rkrishnamurthykaruturi identifyingdrivergenesincancerbytriangulatinggeneexpressiongenelocationandsurvivaldata