High Resolution Genome Wide Binding Event Finding and Motif Discovery Reveals Transcription Factor Spatial Binding Constraints

An essential component of genome function is the syntax of genomic regulatory elements that determine how diverse transcription factors interact to orchestrate a program of regulatory control. A precise characterization of in vivo spacing constraints between key transcription factors would reveal ke...

Full description

Bibliographic Details
Main Authors: Guo, Yuchun, Mahony, Shaun
Other Authors: Massachusetts Institute of Technology. Computational and Systems Biology Program
Format: Article
Language:en_US
Published: Public Library of Science 2012
Online Access:http://hdl.handle.net/1721.1/74577
https://orcid.org/0000-0003-2357-1546
_version_ 1811074876163751936
author Guo, Yuchun
Mahony, Shaun
author2 Massachusetts Institute of Technology. Computational and Systems Biology Program
author_facet Massachusetts Institute of Technology. Computational and Systems Biology Program
Guo, Yuchun
Mahony, Shaun
author_sort Guo, Yuchun
collection MIT
description An essential component of genome function is the syntax of genomic regulatory elements that determine how diverse transcription factors interact to orchestrate a program of regulatory control. A precise characterization of in vivo spacing constraints between key transcription factors would reveal key aspects of this genomic regulatory language. To discover novel transcription factor spatial binding constraints in vivo, we developed a new integrative computational method, genome wide event finding and motif discovery (GEM). GEM resolves ChIP data into explanatory motifs and binding events at high spatial resolution by linking binding event discovery and motif discovery with positional priors in the context of a generative probabilistic model of ChIP data and genome sequence. GEM analysis of 63 transcription factors in 214 ENCODE human ChIP-Seq experiments recovers more known factor motifs than other contemporary methods, and discovers six new motifs for factors with unknown binding specificity. GEM's adaptive learning of binding-event read distributions allows it to further improve upon previous methods for processing ChIP-Seq and ChIP-exo data to yield unsurpassed spatial resolution and discovery of closely spaced binding events of the same factor. In a systematic analysis of in vivo sequence-specific transcription factor binding using GEM, we have found hundreds of spatial binding constraints between factors. GEM found 37 examples of factor binding constraints in mouse ES cells, including strong distance-specific constraints between Klf4 and other key regulatory factors. In human ENCODE data, GEM found 390 examples of spatially constrained pair-wise binding, including such novel pairs as c-Fos:c-Jun/USF1, CTCF/Egr1, and HNF4A/FOXA1. The discovery of new factor-factor spatial constraints in ChIP data is significant because it proposes testable models for regulatory factor interactions that will help elucidate genome function and the implementation of combinatorial control.
first_indexed 2024-09-23T09:56:45Z
format Article
id mit-1721.1/74577
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T09:56:45Z
publishDate 2012
publisher Public Library of Science
record_format dspace
spelling mit-1721.1/745772022-09-30T17:51:39Z High Resolution Genome Wide Binding Event Finding and Motif Discovery Reveals Transcription Factor Spatial Binding Constraints Guo, Yuchun Mahony, Shaun Massachusetts Institute of Technology. Computational and Systems Biology Program Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Guo, Yuchun Mahony, Shaun An essential component of genome function is the syntax of genomic regulatory elements that determine how diverse transcription factors interact to orchestrate a program of regulatory control. A precise characterization of in vivo spacing constraints between key transcription factors would reveal key aspects of this genomic regulatory language. To discover novel transcription factor spatial binding constraints in vivo, we developed a new integrative computational method, genome wide event finding and motif discovery (GEM). GEM resolves ChIP data into explanatory motifs and binding events at high spatial resolution by linking binding event discovery and motif discovery with positional priors in the context of a generative probabilistic model of ChIP data and genome sequence. GEM analysis of 63 transcription factors in 214 ENCODE human ChIP-Seq experiments recovers more known factor motifs than other contemporary methods, and discovers six new motifs for factors with unknown binding specificity. GEM's adaptive learning of binding-event read distributions allows it to further improve upon previous methods for processing ChIP-Seq and ChIP-exo data to yield unsurpassed spatial resolution and discovery of closely spaced binding events of the same factor. In a systematic analysis of in vivo sequence-specific transcription factor binding using GEM, we have found hundreds of spatial binding constraints between factors. GEM found 37 examples of factor binding constraints in mouse ES cells, including strong distance-specific constraints between Klf4 and other key regulatory factors. In human ENCODE data, GEM found 390 examples of spatially constrained pair-wise binding, including such novel pairs as c-Fos:c-Jun/USF1, CTCF/Egr1, and HNF4A/FOXA1. The discovery of new factor-factor spatial constraints in ChIP data is significant because it proposes testable models for regulatory factor interactions that will help elucidate genome function and the implementation of combinatorial control. 2012-11-06T17:25:14Z 2012-11-06T17:25:14Z 2012-08 2012-03 Article http://purl.org/eprint/type/JournalArticle 1553-734X 1553-7358 http://hdl.handle.net/1721.1/74577 Guo, Yuchun, Shaun Mahony, and David K. Gifford. “High Resolution Genome Wide Binding Event Finding and Motif Discovery Reveals Transcription Factor Spatial Binding Constraints.” Ed. Stein Aerts. PLoS Computational Biology 8.8 (2012): e1002638. https://orcid.org/0000-0003-2357-1546 en_US http://dx.doi.org/10.1371/journal.pcbi.1002638 PLoS Computational Biology Creative Commons Attribution http://creativecommons.org/licenses/by/2.5/ application/pdf Public Library of Science PLoS
spellingShingle Guo, Yuchun
Mahony, Shaun
High Resolution Genome Wide Binding Event Finding and Motif Discovery Reveals Transcription Factor Spatial Binding Constraints
title High Resolution Genome Wide Binding Event Finding and Motif Discovery Reveals Transcription Factor Spatial Binding Constraints
title_full High Resolution Genome Wide Binding Event Finding and Motif Discovery Reveals Transcription Factor Spatial Binding Constraints
title_fullStr High Resolution Genome Wide Binding Event Finding and Motif Discovery Reveals Transcription Factor Spatial Binding Constraints
title_full_unstemmed High Resolution Genome Wide Binding Event Finding and Motif Discovery Reveals Transcription Factor Spatial Binding Constraints
title_short High Resolution Genome Wide Binding Event Finding and Motif Discovery Reveals Transcription Factor Spatial Binding Constraints
title_sort high resolution genome wide binding event finding and motif discovery reveals transcription factor spatial binding constraints
url http://hdl.handle.net/1721.1/74577
https://orcid.org/0000-0003-2357-1546
work_keys_str_mv AT guoyuchun highresolutiongenomewidebindingeventfindingandmotifdiscoveryrevealstranscriptionfactorspatialbindingconstraints
AT mahonyshaun highresolutiongenomewidebindingeventfindingandmotifdiscoveryrevealstranscriptionfactorspatialbindingconstraints