Systems biology approach to identify gene network signatures for colorectal cancer

In this work, we integrated prior knowledge from GWAS studies or gene signatures (curated gene sets from MSigDB and/or GeneSigDB), gene set enrichment analysis (GSEA), and gene/protein network modeling together to identify gene network signatures from microarray data. We demonstrated how to apply th...

Full description

Bibliographic Details
Main Authors: Madhankumar eSonachalam, Jeffrey eShen, Hui eHuang, Xiaogang eWu
Format: Article
Language:English
Published: Frontiers Media S.A. 2012-05-01
Series:Frontiers in Genetics
Subjects:
Online Access:http://journal.frontiersin.org/Journal/10.3389/fgene.2012.00080/full
_version_ 1819046720140476416
author Madhankumar eSonachalam
Jeffrey eShen
Hui eHuang
Xiaogang eWu
author_facet Madhankumar eSonachalam
Jeffrey eShen
Hui eHuang
Xiaogang eWu
author_sort Madhankumar eSonachalam
collection DOAJ
description In this work, we integrated prior knowledge from GWAS studies or gene signatures (curated gene sets from MSigDB and/or GeneSigDB), gene set enrichment analysis (GSEA), and gene/protein network modeling together to identify gene network signatures from microarray data. We demonstrated how to apply this approach into discovering gene network signatures for colorectal cancer (CRC) from microarray datasets at three levels - genome, transcriptome, and proteome. First, we use GSEA to analyze the microarray data through enriching differential genes in different CRC-related gene sets from two publicly-available up-to-date gene set databases - Molecular Signatures Database (MSigDB) and Gene Signatures Database (GeneSigDB). Second, we compare the enriched gene sets through enrichment score (ES), false-discovery rate (FDR) and nominal p-value. Third, we construct an integrated protein-protein interaction (PPI) network through connecting these enriched genes by using a human annotated and predicted protein interaction (HAPPI) database, with a confidence score (CS) labeled for each interaction. Finally, we map differential expression values onto the constructed network to build a comprehensive network model containing visualized genome, transcriptome, and proteome data. The results show that although MSigDB is more suitable for GSEA analysis than GeneSigDB, the integrated PPI network connecting the enriched genes from both MSigDB and GeneSigDB can provide more complete view for discovering gene signatures. We also find several important sub-network signatures for colorectal cancer, such as TP53 sub-network, PCNA sub-network and IL8 sub-network, corresponding to apoptosis, DNA repair, and immune response respectively.
first_indexed 2024-12-21T10:48:57Z
format Article
id doaj.art-f8f8a5508bc444d2b829e2fc2c84473c
institution Directory Open Access Journal
issn 1664-8021
language English
last_indexed 2024-12-21T10:48:57Z
publishDate 2012-05-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Genetics
spelling doaj.art-f8f8a5508bc444d2b829e2fc2c84473c2022-12-21T19:06:44ZengFrontiers Media S.A.Frontiers in Genetics1664-80212012-05-01310.3389/fgene.2012.0008021698Systems biology approach to identify gene network signatures for colorectal cancerMadhankumar eSonachalam0Jeffrey eShen1Hui eHuang2Xiaogang eWu3Indiana University-Purdue University IndianapolisIndiana University-Purdue University IndianapolisIndiana University-Purdue University IndianapolisIndiana University-Purdue University IndianapolisIn this work, we integrated prior knowledge from GWAS studies or gene signatures (curated gene sets from MSigDB and/or GeneSigDB), gene set enrichment analysis (GSEA), and gene/protein network modeling together to identify gene network signatures from microarray data. We demonstrated how to apply this approach into discovering gene network signatures for colorectal cancer (CRC) from microarray datasets at three levels - genome, transcriptome, and proteome. First, we use GSEA to analyze the microarray data through enriching differential genes in different CRC-related gene sets from two publicly-available up-to-date gene set databases - Molecular Signatures Database (MSigDB) and Gene Signatures Database (GeneSigDB). Second, we compare the enriched gene sets through enrichment score (ES), false-discovery rate (FDR) and nominal p-value. Third, we construct an integrated protein-protein interaction (PPI) network through connecting these enriched genes by using a human annotated and predicted protein interaction (HAPPI) database, with a confidence score (CS) labeled for each interaction. Finally, we map differential expression values onto the constructed network to build a comprehensive network model containing visualized genome, transcriptome, and proteome data. The results show that although MSigDB is more suitable for GSEA analysis than GeneSigDB, the integrated PPI network connecting the enriched genes from both MSigDB and GeneSigDB can provide more complete view for discovering gene signatures. We also find several important sub-network signatures for colorectal cancer, such as TP53 sub-network, PCNA sub-network and IL8 sub-network, corresponding to apoptosis, DNA repair, and immune response respectively.http://journal.frontiersin.org/Journal/10.3389/fgene.2012.00080/fullMicroarray Analysiscolorectal cancernetwork biologygene set enrichment analysisgene expression signatures
spellingShingle Madhankumar eSonachalam
Jeffrey eShen
Hui eHuang
Xiaogang eWu
Systems biology approach to identify gene network signatures for colorectal cancer
Frontiers in Genetics
Microarray Analysis
colorectal cancer
network biology
gene set enrichment analysis
gene expression signatures
title Systems biology approach to identify gene network signatures for colorectal cancer
title_full Systems biology approach to identify gene network signatures for colorectal cancer
title_fullStr Systems biology approach to identify gene network signatures for colorectal cancer
title_full_unstemmed Systems biology approach to identify gene network signatures for colorectal cancer
title_short Systems biology approach to identify gene network signatures for colorectal cancer
title_sort systems biology approach to identify gene network signatures for colorectal cancer
topic Microarray Analysis
colorectal cancer
network biology
gene set enrichment analysis
gene expression signatures
url http://journal.frontiersin.org/Journal/10.3389/fgene.2012.00080/full
work_keys_str_mv AT madhankumaresonachalam systemsbiologyapproachtoidentifygenenetworksignaturesforcolorectalcancer
AT jeffreyeshen systemsbiologyapproachtoidentifygenenetworksignaturesforcolorectalcancer
AT huiehuang systemsbiologyapproachtoidentifygenenetworksignaturesforcolorectalcancer
AT xiaogangewu systemsbiologyapproachtoidentifygenenetworksignaturesforcolorectalcancer