Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks

Abstract Background Genome-wide Association Studies (GWAS) aims to uncover the link between genomic variation and phenotype. They have been actively applied in cancer biology to investigate associations between variations and cancer phenotypes, such as susceptibility to certain types of cancer and p...

Full description

Bibliographic Details
Main Authors: Minsu Kim, Jennifer E. Huffman, Amy Justice, Ian Goethert, Greeshma Agasthya, VA Million Veteran Program, Ioana Danciu
Format: Article
Language:English
Published: BMC 2022-07-01
Series:BMC Medical Genomics
Subjects:
Online Access:https://doi.org/10.1186/s12920-022-01298-6
_version_ 1818518338906619904
author Minsu Kim
Jennifer E. Huffman
Amy Justice
Ian Goethert
Greeshma Agasthya
VA Million Veteran Program
Ioana Danciu
author_facet Minsu Kim
Jennifer E. Huffman
Amy Justice
Ian Goethert
Greeshma Agasthya
VA Million Veteran Program
Ioana Danciu
author_sort Minsu Kim
collection DOAJ
description Abstract Background Genome-wide Association Studies (GWAS) aims to uncover the link between genomic variation and phenotype. They have been actively applied in cancer biology to investigate associations between variations and cancer phenotypes, such as susceptibility to certain types of cancer and predisposed responsiveness to specific treatments. Since GWAS primarily focuses on finding associations between individual genomic variations and cancer phenotypes, there are limitations in understanding the mechanisms by which cancer phenotypes are cooperatively affected by more than one genomic variation. Results This paper proposes a network representation learning approach to learn associations among genomic variations using a prostate cancer cohort. The learned associations are encoded into representations that can be used to identify functional modules of genomic variations within genes associated with early- and late-onset prostate cancer. The proposed method was applied to a prostate cancer cohort provided by the Veterans Administration’s Million Veteran Program to identify candidates for functional modules associated with early-onset prostate cancer. The cohort included 33,159 prostate cancer patients, 3181 early-onset patients, and 29,978 late-onset patients. The reproducibility of the proposed approach clearly showed that the proposed approach can improve the model performance in terms of robustness. Conclusions To our knowledge, this is the first attempt to use a network representation learning approach to learn associations among genomic variations within genes. Associations learned in this way can lead to an understanding of the underlying mechanisms of how genomic variations cooperatively affect each cancer phenotype. This method can reveal unknown knowledge in the field of cancer biology and can be utilized to design more advanced cancer-targeted therapies.
first_indexed 2024-12-11T01:08:57Z
format Article
id doaj.art-a212cc619aab486085c607e0384701f3
institution Directory Open Access Journal
issn 1755-8794
language English
last_indexed 2024-12-11T01:08:57Z
publishDate 2022-07-01
publisher BMC
record_format Article
series BMC Medical Genomics
spelling doaj.art-a212cc619aab486085c607e0384701f32022-12-22T01:26:06ZengBMCBMC Medical Genomics1755-87942022-07-0115111010.1186/s12920-022-01298-6Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networksMinsu Kim0Jennifer E. Huffman1Amy Justice2Ian Goethert3Greeshma Agasthya4VA Million Veteran ProgramIoana Danciu5Computer Science and Mathematics Division, Oak Ridge National LaboratoryCenter for Population Genomics, MAVERIC, VA Boston Healthcare SystemDepartment of Veterans Affairs Connecticut Healthcare SystemInformation Technology Services Division, Oak Ridge National LaboratoryComputational Sciences and Engineering Division, Oak Ridge National LaboratoryAdvanced Computing for Health Sciences Group, Oak Ridge National LaboratoryAbstract Background Genome-wide Association Studies (GWAS) aims to uncover the link between genomic variation and phenotype. They have been actively applied in cancer biology to investigate associations between variations and cancer phenotypes, such as susceptibility to certain types of cancer and predisposed responsiveness to specific treatments. Since GWAS primarily focuses on finding associations between individual genomic variations and cancer phenotypes, there are limitations in understanding the mechanisms by which cancer phenotypes are cooperatively affected by more than one genomic variation. Results This paper proposes a network representation learning approach to learn associations among genomic variations using a prostate cancer cohort. The learned associations are encoded into representations that can be used to identify functional modules of genomic variations within genes associated with early- and late-onset prostate cancer. The proposed method was applied to a prostate cancer cohort provided by the Veterans Administration’s Million Veteran Program to identify candidates for functional modules associated with early-onset prostate cancer. The cohort included 33,159 prostate cancer patients, 3181 early-onset patients, and 29,978 late-onset patients. The reproducibility of the proposed approach clearly showed that the proposed approach can improve the model performance in terms of robustness. Conclusions To our knowledge, this is the first attempt to use a network representation learning approach to learn associations among genomic variations within genes. Associations learned in this way can lead to an understanding of the underlying mechanisms of how genomic variations cooperatively affect each cancer phenotype. This method can reveal unknown knowledge in the field of cancer biology and can be utilized to design more advanced cancer-targeted therapies.https://doi.org/10.1186/s12920-022-01298-6Genome-wide Association StudyNetwork Representation LearningMachine Learning
spellingShingle Minsu Kim
Jennifer E. Huffman
Amy Justice
Ian Goethert
Greeshma Agasthya
VA Million Veteran Program
Ioana Danciu
Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks
BMC Medical Genomics
Genome-wide Association Study
Network Representation Learning
Machine Learning
title Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks
title_full Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks
title_fullStr Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks
title_full_unstemmed Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks
title_short Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks
title_sort identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks
topic Genome-wide Association Study
Network Representation Learning
Machine Learning
url https://doi.org/10.1186/s12920-022-01298-6
work_keys_str_mv AT minsukim identifyingintragenicfunctionalmodulesofgenomicvariationsassociatedwithcancerphenotypesbylearningrepresentationofassociationnetworks
AT jenniferehuffman identifyingintragenicfunctionalmodulesofgenomicvariationsassociatedwithcancerphenotypesbylearningrepresentationofassociationnetworks
AT amyjustice identifyingintragenicfunctionalmodulesofgenomicvariationsassociatedwithcancerphenotypesbylearningrepresentationofassociationnetworks
AT iangoethert identifyingintragenicfunctionalmodulesofgenomicvariationsassociatedwithcancerphenotypesbylearningrepresentationofassociationnetworks
AT greeshmaagasthya identifyingintragenicfunctionalmodulesofgenomicvariationsassociatedwithcancerphenotypesbylearningrepresentationofassociationnetworks
AT vamillionveteranprogram identifyingintragenicfunctionalmodulesofgenomicvariationsassociatedwithcancerphenotypesbylearningrepresentationofassociationnetworks
AT ioanadanciu identifyingintragenicfunctionalmodulesofgenomicvariationsassociatedwithcancerphenotypesbylearningrepresentationofassociationnetworks