BayesPeak: Bayesian analysis of ChIP-seq data

Abstract Background High-throughput sequencing technology has become popular and widely used to study protein and DNA interactions. Chromatin immunoprecipitation, followed by sequencing of the resulting samples, produces large amounts of data that can b...

Full description

Bibliographic Details
Main Authors:	Stark Rory, Spyrou Christiana, Lynch Andy G, Tavaré Simon
Format:	Article
Language:	English
Published:	BMC 2009-09-01
Series:	BMC Bioinformatics
Online Access:	http://www.biomedcentral.com/1471-2105/10/299

_version_	1818525783574970368
author	Stark Rory Spyrou Christiana Lynch Andy G Tavaré Simon
author_facet	Stark Rory Spyrou Christiana Lynch Andy G Tavaré Simon
author_sort	Stark Rory
collection	DOAJ
description	<p>Abstract</p> <p>Background</p> <p>High-throughput sequencing technology has become popular and widely used to study protein and DNA interactions. Chromatin immunoprecipitation, followed by sequencing of the resulting samples, produces large amounts of data that can be used to map genomic features such as transcription factor binding sites and histone modifications.</p> <p>Methods</p> <p>Our proposed statistical algorithm, BayesPeak, uses a fully Bayesian hidden Markov model to detect enriched locations in the genome. The structure accommodates the natural features of the Solexa/Illumina sequencing data and allows for overdispersion in the abundance of reads in different regions. Moreover, a control sample can be incorporated in the analysis to account for experimental and sequence biases. Markov chain Monte Carlo algorithms are applied to estimate the posterior distributions of the model parameters, and posterior probabilities are used to detect the sites of interest.</p> <p>Conclusion</p> <p>We have presented a flexible approach for identifying peaks from ChIP-seq reads, suitable for use on both transcription factor binding and histone modification data. Our method estimates probabilities of enrichment that can be used in downstream analysis. The method is assessed using experimentally verified data and is shown to provide high-confidence calls with low false positive rates.</p>
first_indexed	2024-12-11T06:13:38Z
format	Article
id	doaj.art-a16b67fd622344cda05d20623f6df8b5
institution	Directory Open Access Journal
issn	1471-2105
language	English
last_indexed	2024-12-11T06:13:38Z
publishDate	2009-09-01
publisher	BMC
record_format	Article
series	BMC Bioinformatics
spelling	doaj.art-a16b67fd622344cda05d20623f6df8b52022-12-22T01:18:03ZengBMCBMC Bioinformatics1471-21052009-09-0110129910.1186/1471-2105-10-299BayesPeak: Bayesian analysis of ChIP-seq dataStark RorySpyrou ChristianaLynch Andy GTavaré Simon<p>Abstract</p> <p>Background</p> <p>High-throughput sequencing technology has become popular and widely used to study protein and DNA interactions. Chromatin immunoprecipitation, followed by sequencing of the resulting samples, produces large amounts of data that can be used to map genomic features such as transcription factor binding sites and histone modifications.</p> <p>Methods</p> <p>Our proposed statistical algorithm, BayesPeak, uses a fully Bayesian hidden Markov model to detect enriched locations in the genome. The structure accommodates the natural features of the Solexa/Illumina sequencing data and allows for overdispersion in the abundance of reads in different regions. Moreover, a control sample can be incorporated in the analysis to account for experimental and sequence biases. Markov chain Monte Carlo algorithms are applied to estimate the posterior distributions of the model parameters, and posterior probabilities are used to detect the sites of interest.</p> <p>Conclusion</p> <p>We have presented a flexible approach for identifying peaks from ChIP-seq reads, suitable for use on both transcription factor binding and histone modification data. Our method estimates probabilities of enrichment that can be used in downstream analysis. The method is assessed using experimentally verified data and is shown to provide high-confidence calls with low false positive rates.</p>http://www.biomedcentral.com/1471-2105/10/299
spellingShingle	Stark Rory Spyrou Christiana Lynch Andy G Tavaré Simon BayesPeak: Bayesian analysis of ChIP-seq data BMC Bioinformatics
title	BayesPeak: Bayesian analysis of ChIP-seq data
title_full	BayesPeak: Bayesian analysis of ChIP-seq data
title_fullStr	BayesPeak: Bayesian analysis of ChIP-seq data
title_full_unstemmed	BayesPeak: Bayesian analysis of ChIP-seq data
title_short	BayesPeak: Bayesian analysis of ChIP-seq data
title_sort	bayespeak bayesian analysis of chip seq data
url	http://www.biomedcentral.com/1471-2105/10/299
work_keys_str_mv	AT starkrory bayespeakbayesiananalysisofchipseqdata AT spyrouchristiana bayespeakbayesiananalysisofchipseqdata AT lynchandyg bayespeakbayesiananalysisofchipseqdata AT tavaresimon bayespeakbayesiananalysisofchipseqdata

BayesPeak: Bayesian analysis of ChIP-seq data

Similar Items