Learning transcriptional regulatory relationships using sparse graphical models.
Understanding the organization and function of transcriptional regulatory networks by analyzing high-throughput gene expression profiles is a key problem in computational biology. The challenges in this work are 1) the lack of complete knowledge of the regulatory relationship between the regulators...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2012-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC3346750?pdf=render |
_version_ | 1818035508951908352 |
---|---|
author | Xiang Zhang Wei Cheng Jennifer Listgarten Carl Kadie Shunping Huang Wei Wang David Heckerman |
author_facet | Xiang Zhang Wei Cheng Jennifer Listgarten Carl Kadie Shunping Huang Wei Wang David Heckerman |
author_sort | Xiang Zhang |
collection | DOAJ |
description | Understanding the organization and function of transcriptional regulatory networks by analyzing high-throughput gene expression profiles is a key problem in computational biology. The challenges in this work are 1) the lack of complete knowledge of the regulatory relationship between the regulators and the associated genes, 2) the potential for spurious associations due to confounding factors, and 3) the number of parameters to learn is usually larger than the number of available microarray experiments. We present a sparse (L1 regularized) graphical model to address these challenges. Our model incorporates known transcription factors and introduces hidden variables to represent possible unknown transcription and confounding factors. The expression level of a gene is modeled as a linear combination of the expression levels of known transcription factors and hidden factors. Using gene expression data covering 39,296 oligonucleotide probes from 1109 human liver samples, we demonstrate that our model better predicts out-of-sample data than a model with no hidden variables. We also show that some of the gene sets associated with hidden variables are strongly correlated with Gene Ontology categories. The software including source code is available at http://grnl1.codeplex.com. |
first_indexed | 2024-12-10T06:56:11Z |
format | Article |
id | doaj.art-6c3b8ada5bb84cd2af120c56aa23b8ef |
institution | Directory Open Access Journal |
issn | 1932-6203 |
language | English |
last_indexed | 2024-12-10T06:56:11Z |
publishDate | 2012-01-01 |
publisher | Public Library of Science (PLoS) |
record_format | Article |
series | PLoS ONE |
spelling | doaj.art-6c3b8ada5bb84cd2af120c56aa23b8ef2022-12-22T01:58:25ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0175e3576210.1371/journal.pone.0035762Learning transcriptional regulatory relationships using sparse graphical models.Xiang ZhangWei ChengJennifer ListgartenCarl KadieShunping HuangWei WangDavid HeckermanUnderstanding the organization and function of transcriptional regulatory networks by analyzing high-throughput gene expression profiles is a key problem in computational biology. The challenges in this work are 1) the lack of complete knowledge of the regulatory relationship between the regulators and the associated genes, 2) the potential for spurious associations due to confounding factors, and 3) the number of parameters to learn is usually larger than the number of available microarray experiments. We present a sparse (L1 regularized) graphical model to address these challenges. Our model incorporates known transcription factors and introduces hidden variables to represent possible unknown transcription and confounding factors. The expression level of a gene is modeled as a linear combination of the expression levels of known transcription factors and hidden factors. Using gene expression data covering 39,296 oligonucleotide probes from 1109 human liver samples, we demonstrate that our model better predicts out-of-sample data than a model with no hidden variables. We also show that some of the gene sets associated with hidden variables are strongly correlated with Gene Ontology categories. The software including source code is available at http://grnl1.codeplex.com.http://europepmc.org/articles/PMC3346750?pdf=render |
spellingShingle | Xiang Zhang Wei Cheng Jennifer Listgarten Carl Kadie Shunping Huang Wei Wang David Heckerman Learning transcriptional regulatory relationships using sparse graphical models. PLoS ONE |
title | Learning transcriptional regulatory relationships using sparse graphical models. |
title_full | Learning transcriptional regulatory relationships using sparse graphical models. |
title_fullStr | Learning transcriptional regulatory relationships using sparse graphical models. |
title_full_unstemmed | Learning transcriptional regulatory relationships using sparse graphical models. |
title_short | Learning transcriptional regulatory relationships using sparse graphical models. |
title_sort | learning transcriptional regulatory relationships using sparse graphical models |
url | http://europepmc.org/articles/PMC3346750?pdf=render |
work_keys_str_mv | AT xiangzhang learningtranscriptionalregulatoryrelationshipsusingsparsegraphicalmodels AT weicheng learningtranscriptionalregulatoryrelationshipsusingsparsegraphicalmodels AT jenniferlistgarten learningtranscriptionalregulatoryrelationshipsusingsparsegraphicalmodels AT carlkadie learningtranscriptionalregulatoryrelationshipsusingsparsegraphicalmodels AT shunpinghuang learningtranscriptionalregulatoryrelationshipsusingsparsegraphicalmodels AT weiwang learningtranscriptionalregulatoryrelationshipsusingsparsegraphicalmodels AT davidheckerman learningtranscriptionalregulatoryrelationshipsusingsparsegraphicalmodels |