Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks
Abstract Background Genome-wide Association Studies (GWAS) aims to uncover the link between genomic variation and phenotype. They have been actively applied in cancer biology to investigate associations between variations and cancer phenotypes, such as susceptibility to certain types of cancer and p...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2022-07-01
|
Series: | BMC Medical Genomics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12920-022-01298-6 |
_version_ | 1818518338906619904 |
---|---|
author | Minsu Kim Jennifer E. Huffman Amy Justice Ian Goethert Greeshma Agasthya VA Million Veteran Program Ioana Danciu |
author_facet | Minsu Kim Jennifer E. Huffman Amy Justice Ian Goethert Greeshma Agasthya VA Million Veteran Program Ioana Danciu |
author_sort | Minsu Kim |
collection | DOAJ |
description | Abstract Background Genome-wide Association Studies (GWAS) aims to uncover the link between genomic variation and phenotype. They have been actively applied in cancer biology to investigate associations between variations and cancer phenotypes, such as susceptibility to certain types of cancer and predisposed responsiveness to specific treatments. Since GWAS primarily focuses on finding associations between individual genomic variations and cancer phenotypes, there are limitations in understanding the mechanisms by which cancer phenotypes are cooperatively affected by more than one genomic variation. Results This paper proposes a network representation learning approach to learn associations among genomic variations using a prostate cancer cohort. The learned associations are encoded into representations that can be used to identify functional modules of genomic variations within genes associated with early- and late-onset prostate cancer. The proposed method was applied to a prostate cancer cohort provided by the Veterans Administration’s Million Veteran Program to identify candidates for functional modules associated with early-onset prostate cancer. The cohort included 33,159 prostate cancer patients, 3181 early-onset patients, and 29,978 late-onset patients. The reproducibility of the proposed approach clearly showed that the proposed approach can improve the model performance in terms of robustness. Conclusions To our knowledge, this is the first attempt to use a network representation learning approach to learn associations among genomic variations within genes. Associations learned in this way can lead to an understanding of the underlying mechanisms of how genomic variations cooperatively affect each cancer phenotype. This method can reveal unknown knowledge in the field of cancer biology and can be utilized to design more advanced cancer-targeted therapies. |
first_indexed | 2024-12-11T01:08:57Z |
format | Article |
id | doaj.art-a212cc619aab486085c607e0384701f3 |
institution | Directory Open Access Journal |
issn | 1755-8794 |
language | English |
last_indexed | 2024-12-11T01:08:57Z |
publishDate | 2022-07-01 |
publisher | BMC |
record_format | Article |
series | BMC Medical Genomics |
spelling | doaj.art-a212cc619aab486085c607e0384701f32022-12-22T01:26:06ZengBMCBMC Medical Genomics1755-87942022-07-0115111010.1186/s12920-022-01298-6Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networksMinsu Kim0Jennifer E. Huffman1Amy Justice2Ian Goethert3Greeshma Agasthya4VA Million Veteran ProgramIoana Danciu5Computer Science and Mathematics Division, Oak Ridge National LaboratoryCenter for Population Genomics, MAVERIC, VA Boston Healthcare SystemDepartment of Veterans Affairs Connecticut Healthcare SystemInformation Technology Services Division, Oak Ridge National LaboratoryComputational Sciences and Engineering Division, Oak Ridge National LaboratoryAdvanced Computing for Health Sciences Group, Oak Ridge National LaboratoryAbstract Background Genome-wide Association Studies (GWAS) aims to uncover the link between genomic variation and phenotype. They have been actively applied in cancer biology to investigate associations between variations and cancer phenotypes, such as susceptibility to certain types of cancer and predisposed responsiveness to specific treatments. Since GWAS primarily focuses on finding associations between individual genomic variations and cancer phenotypes, there are limitations in understanding the mechanisms by which cancer phenotypes are cooperatively affected by more than one genomic variation. Results This paper proposes a network representation learning approach to learn associations among genomic variations using a prostate cancer cohort. The learned associations are encoded into representations that can be used to identify functional modules of genomic variations within genes associated with early- and late-onset prostate cancer. The proposed method was applied to a prostate cancer cohort provided by the Veterans Administration’s Million Veteran Program to identify candidates for functional modules associated with early-onset prostate cancer. The cohort included 33,159 prostate cancer patients, 3181 early-onset patients, and 29,978 late-onset patients. The reproducibility of the proposed approach clearly showed that the proposed approach can improve the model performance in terms of robustness. Conclusions To our knowledge, this is the first attempt to use a network representation learning approach to learn associations among genomic variations within genes. Associations learned in this way can lead to an understanding of the underlying mechanisms of how genomic variations cooperatively affect each cancer phenotype. This method can reveal unknown knowledge in the field of cancer biology and can be utilized to design more advanced cancer-targeted therapies.https://doi.org/10.1186/s12920-022-01298-6Genome-wide Association StudyNetwork Representation LearningMachine Learning |
spellingShingle | Minsu Kim Jennifer E. Huffman Amy Justice Ian Goethert Greeshma Agasthya VA Million Veteran Program Ioana Danciu Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks BMC Medical Genomics Genome-wide Association Study Network Representation Learning Machine Learning |
title | Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks |
title_full | Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks |
title_fullStr | Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks |
title_full_unstemmed | Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks |
title_short | Identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks |
title_sort | identifying intragenic functional modules of genomic variations associated with cancer phenotypes by learning representation of association networks |
topic | Genome-wide Association Study Network Representation Learning Machine Learning |
url | https://doi.org/10.1186/s12920-022-01298-6 |
work_keys_str_mv | AT minsukim identifyingintragenicfunctionalmodulesofgenomicvariationsassociatedwithcancerphenotypesbylearningrepresentationofassociationnetworks AT jenniferehuffman identifyingintragenicfunctionalmodulesofgenomicvariationsassociatedwithcancerphenotypesbylearningrepresentationofassociationnetworks AT amyjustice identifyingintragenicfunctionalmodulesofgenomicvariationsassociatedwithcancerphenotypesbylearningrepresentationofassociationnetworks AT iangoethert identifyingintragenicfunctionalmodulesofgenomicvariationsassociatedwithcancerphenotypesbylearningrepresentationofassociationnetworks AT greeshmaagasthya identifyingintragenicfunctionalmodulesofgenomicvariationsassociatedwithcancerphenotypesbylearningrepresentationofassociationnetworks AT vamillionveteranprogram identifyingintragenicfunctionalmodulesofgenomicvariationsassociatedwithcancerphenotypesbylearningrepresentationofassociationnetworks AT ioanadanciu identifyingintragenicfunctionalmodulesofgenomicvariationsassociatedwithcancerphenotypesbylearningrepresentationofassociationnetworks |