Methods for time series analysis of RNA-seq data with application to human Th17 cell differentiation

Motivation: Gene expression profiling using RNA-seq is a powerful technique for screening RNA species’ landscapes and their dynamics in an unbiased way. While several advanced methods exist for differential expression analysis of RNA-seq data, proper tools to anal.yze RNA-seq time-course have not be...

Full description

Bibliographic Details
Main Authors: Aiho, Tarmo, Chen, Zhi, Salo, Verna, Tripathi, Subhash, Lahesmaa, Riitta, Lahdesmaki, Harri, Butty, Vincent L G, Burge, Christopher B
Other Authors: Massachusetts Institute of Technology. Department of Biology
Format: Article
Language:en_US
Published: Oxford University Press 2015
Online Access:http://hdl.handle.net/1721.1/96263
_version_ 1811080304578789376
author Aiho, Tarmo
Chen, Zhi
Salo, Verna
Tripathi, Subhash
Lahesmaa, Riitta
Lahdesmaki, Harri
Butty, Vincent L G
Burge, Christopher B
author2 Massachusetts Institute of Technology. Department of Biology
author_facet Massachusetts Institute of Technology. Department of Biology
Aiho, Tarmo
Chen, Zhi
Salo, Verna
Tripathi, Subhash
Lahesmaa, Riitta
Lahdesmaki, Harri
Butty, Vincent L G
Burge, Christopher B
author_sort Aiho, Tarmo
collection MIT
description Motivation: Gene expression profiling using RNA-seq is a powerful technique for screening RNA species’ landscapes and their dynamics in an unbiased way. While several advanced methods exist for differential expression analysis of RNA-seq data, proper tools to anal.yze RNA-seq time-course have not been proposed. Results: In this study, we use RNA-seq to measure gene expression during the early human T helper 17 (Th17) cell differentiation and T-cell activation (Th0). To quantify Th17-specific gene expression dynamics, we present a novel statistical methodology, DyNB, for analyzing time-course RNA-seq data. We use non-parametric Gaussian processes to model temporal correlation in gene expression and combine that with negative binomial likelihood for the count data. To account for experiment-specific biases in gene expression dynamics, such as differences in cell differentiation efficiencies, we propose a method to rescale the dynamics between replicated measurements. We develop an MCMC sampling method to make inference of differential expression dynamics between conditions. DyNB identifies several known and novel genes involved in Th17 differentiation. Analysis of differentiation efficiencies revealed consistent patterns in gene expression dynamics between different cultures. We use qRT-PCR to validate differential expression and differentiation efficiencies for selected genes. Comparison of the results with those obtained via traditional timepoint-wise analysis shows that time-course analysis together with time rescaling between cultures identifies differentially expressed genes which would not otherwise be detected. Availability: An implementation of the proposed computational methods will be available at http://research.ics.aalto.fi/csb/software/
first_indexed 2024-09-23T11:29:00Z
format Article
id mit-1721.1/96263
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T11:29:00Z
publishDate 2015
publisher Oxford University Press
record_format dspace
spelling mit-1721.1/962632022-10-01T03:56:26Z Methods for time series analysis of RNA-seq data with application to human Th17 cell differentiation Aiho, Tarmo Chen, Zhi Salo, Verna Tripathi, Subhash Lahesmaa, Riitta Lahdesmaki, Harri Butty, Vincent L G Burge, Christopher B Massachusetts Institute of Technology. Department of Biology Butty, Vincent Burge, Christopher B. Motivation: Gene expression profiling using RNA-seq is a powerful technique for screening RNA species’ landscapes and their dynamics in an unbiased way. While several advanced methods exist for differential expression analysis of RNA-seq data, proper tools to anal.yze RNA-seq time-course have not been proposed. Results: In this study, we use RNA-seq to measure gene expression during the early human T helper 17 (Th17) cell differentiation and T-cell activation (Th0). To quantify Th17-specific gene expression dynamics, we present a novel statistical methodology, DyNB, for analyzing time-course RNA-seq data. We use non-parametric Gaussian processes to model temporal correlation in gene expression and combine that with negative binomial likelihood for the count data. To account for experiment-specific biases in gene expression dynamics, such as differences in cell differentiation efficiencies, we propose a method to rescale the dynamics between replicated measurements. We develop an MCMC sampling method to make inference of differential expression dynamics between conditions. DyNB identifies several known and novel genes involved in Th17 differentiation. Analysis of differentiation efficiencies revealed consistent patterns in gene expression dynamics between different cultures. We use qRT-PCR to validate differential expression and differentiation efficiencies for selected genes. Comparison of the results with those obtained via traditional timepoint-wise analysis shows that time-course analysis together with time rescaling between cultures identifies differentially expressed genes which would not otherwise be detected. Availability: An implementation of the proposed computational methods will be available at http://research.ics.aalto.fi/csb/software/ Academy of Finland (Centre of Excellence in Moleculary Systems Immunology and Physiology Research (2012-2017) Grant 135320) Seventh Framework Programme (European Commission) (Grant EC-FP7-SYBILLA-201106) EU ERASysBio ERA-NET Sigrid Juslius Foundation FICS Graduate School 2015-03-30T19:32:49Z 2015-03-30T19:32:49Z 2014-06 Article http://purl.org/eprint/type/JournalArticle 1367-4803 1460-2059 http://hdl.handle.net/1721.1/96263 Aijo, T., V. Butty, Z. Chen, V. Salo, S. Tripathi, C. B. Burge, R. Lahesmaa, and H. Lahdesmaki. “Methods for Time Series Analysis of RNA-Seq Data with Application to Human Th17 Cell Differentiation.” Bioinformatics 30, no. 12 (June 15, 2014): i113–i120. en_US http://dx.doi.org/10.1093/bioinformatics/btu274 Bioinformatics Creative Commons Attribution http://creativecommons.org/licenses/by/3.0/ application/pdf Oxford University Press Bioinformatics
spellingShingle Aiho, Tarmo
Chen, Zhi
Salo, Verna
Tripathi, Subhash
Lahesmaa, Riitta
Lahdesmaki, Harri
Butty, Vincent L G
Burge, Christopher B
Methods for time series analysis of RNA-seq data with application to human Th17 cell differentiation
title Methods for time series analysis of RNA-seq data with application to human Th17 cell differentiation
title_full Methods for time series analysis of RNA-seq data with application to human Th17 cell differentiation
title_fullStr Methods for time series analysis of RNA-seq data with application to human Th17 cell differentiation
title_full_unstemmed Methods for time series analysis of RNA-seq data with application to human Th17 cell differentiation
title_short Methods for time series analysis of RNA-seq data with application to human Th17 cell differentiation
title_sort methods for time series analysis of rna seq data with application to human th17 cell differentiation
url http://hdl.handle.net/1721.1/96263
work_keys_str_mv AT aihotarmo methodsfortimeseriesanalysisofrnaseqdatawithapplicationtohumanth17celldifferentiation
AT chenzhi methodsfortimeseriesanalysisofrnaseqdatawithapplicationtohumanth17celldifferentiation
AT saloverna methodsfortimeseriesanalysisofrnaseqdatawithapplicationtohumanth17celldifferentiation
AT tripathisubhash methodsfortimeseriesanalysisofrnaseqdatawithapplicationtohumanth17celldifferentiation
AT lahesmaariitta methodsfortimeseriesanalysisofrnaseqdatawithapplicationtohumanth17celldifferentiation
AT lahdesmakiharri methodsfortimeseriesanalysisofrnaseqdatawithapplicationtohumanth17celldifferentiation
AT buttyvincentlg methodsfortimeseriesanalysisofrnaseqdatawithapplicationtohumanth17celldifferentiation
AT burgechristopherb methodsfortimeseriesanalysisofrnaseqdatawithapplicationtohumanth17celldifferentiation