High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression

Thesis: Ph. D., Massachusetts Institute of Technology, Department of Biology, 2015.

Bibliographic Details
Main Author: Koppstein, David N. P. (David Neal Pira)
Other Authors: David P. Bartel.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2015
Subjects:
Online Access:http://hdl.handle.net/1721.1/98629
_version_ 1826193523727138816
author Koppstein, David N. P. (David Neal Pira)
author2 David P. Bartel.
author_facet David P. Bartel.
Koppstein, David N. P. (David Neal Pira)
author_sort Koppstein, David N. P. (David Neal Pira)
collection MIT
description Thesis: Ph. D., Massachusetts Institute of Technology, Department of Biology, 2015.
first_indexed 2024-09-23T09:40:29Z
format Thesis
id mit-1721.1/98629
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T09:40:29Z
publishDate 2015
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/986292019-04-10T14:54:43Z High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression Koppstein, David N. P. (David Neal Pira) David P. Bartel. Massachusetts Institute of Technology. Department of Biology. Massachusetts Institute of Technology. Department of Biology. Biology. Thesis: Ph. D., Massachusetts Institute of Technology, Department of Biology, 2015. Cataloged from PDF version of thesis. Includes bibliographical references. Next-generation sequencing techniques are unparalleled in their resolution and dynamic range, but are limited by read depletion at transcript ends. Protocols that specifically target these ends overcome this limitation and enable the study of biological phenomena that would otherwise prove refractory to RNA-Seq. Here, we use two such techniques to study heterogeneous sequences at the 5' ends of influenza transcripts and alternative polyadenylation at the 3' ends of vertebrate transcripts. The 5' ends of influenza mRNAs include heterogeneous sequences derived from host RNAs. In a process termed cap snatching, the viral polymerase cleaves host RNAs ~10-13 nucleotides downstream of their caps and uses the resulting fragments to prime viral transcription. High-throughput 5' rapid amplification of cDNA ends resulted in 54 million chimeric reads containing host-derived leaders. These sequences provided evidence for stuttering during transcription initiation and an influence of the viral template on the extent of realignment. Accounting for realignment suggested a common preference by the polymerase irrespective of the viral template, and suggested that a single base pair is sufficient to prime transcription. Mapping trimmed leaders to annotated transcription start sites (TSSs) revealed that the most abundant leaders correspond to small nuclear RNAs, consistent with cap snatching of nascent transcripts. The 3' ends of mRNAs are generally appended with a poly(A) tail, but alternative polyadenylation sites may vary depending on cellular context. 3P-Seq is a method that specifically captures alternative polyadenylation sites without relying on oligo(dT) priming, which may cause artifacts. Applying 3P-Seq to eukaryotic model organisms improved their gene annotations and provided insight into targeting by microRNAs, a class of ~21-23 nucleotide RNAs that mediate mRNA destabilization. The isoform ratios of transcripts containing miR-155 sites were predictive of the extent to which these transcripts would respond to miR-155 transfection. Conversely, knocking out miR-22 in mice specifically upregulated isoforms containing miR-22 sites, suggesting that microRNAs reciprocally affect the 3'-UTR landscape. Lastly, analysis of other datasets derived from zebrafish embryos revealed broad lengthening of 3'-UTR isoforms during development and noncanonical polyadenylation during the maternal-to-zygotic transition. by David N.P. Koppstein. Ph. D. 2015-09-17T19:00:48Z 2015-09-17T19:00:48Z 2015 2015 Thesis http://hdl.handle.net/1721.1/98629 920672621 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 259 pages application/pdf Massachusetts Institute of Technology
spellingShingle Biology.
Koppstein, David N. P. (David Neal Pira)
High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression
title High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression
title_full High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression
title_fullStr High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression
title_full_unstemmed High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression
title_short High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression
title_sort high throughput sequencing of rna 5 and 3 termini yields insights into viral and vertebrate gene expression
topic Biology.
url http://hdl.handle.net/1721.1/98629
work_keys_str_mv AT koppsteindavidnpdavidnealpira highthroughputsequencingofrna5and3terminiyieldsinsightsintoviralandvertebrategeneexpression