High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Biology, 2015.
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Language: | eng |
Published: |
Massachusetts Institute of Technology
2015
|
Subjects: | |
Online Access: | http://hdl.handle.net/1721.1/98629 |
_version_ | 1826193523727138816 |
---|---|
author | Koppstein, David N. P. (David Neal Pira) |
author2 | David P. Bartel. |
author_facet | David P. Bartel. Koppstein, David N. P. (David Neal Pira) |
author_sort | Koppstein, David N. P. (David Neal Pira) |
collection | MIT |
description | Thesis: Ph. D., Massachusetts Institute of Technology, Department of Biology, 2015. |
first_indexed | 2024-09-23T09:40:29Z |
format | Thesis |
id | mit-1721.1/98629 |
institution | Massachusetts Institute of Technology |
language | eng |
last_indexed | 2024-09-23T09:40:29Z |
publishDate | 2015 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/986292019-04-10T14:54:43Z High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression Koppstein, David N. P. (David Neal Pira) David P. Bartel. Massachusetts Institute of Technology. Department of Biology. Massachusetts Institute of Technology. Department of Biology. Biology. Thesis: Ph. D., Massachusetts Institute of Technology, Department of Biology, 2015. Cataloged from PDF version of thesis. Includes bibliographical references. Next-generation sequencing techniques are unparalleled in their resolution and dynamic range, but are limited by read depletion at transcript ends. Protocols that specifically target these ends overcome this limitation and enable the study of biological phenomena that would otherwise prove refractory to RNA-Seq. Here, we use two such techniques to study heterogeneous sequences at the 5' ends of influenza transcripts and alternative polyadenylation at the 3' ends of vertebrate transcripts. The 5' ends of influenza mRNAs include heterogeneous sequences derived from host RNAs. In a process termed cap snatching, the viral polymerase cleaves host RNAs ~10-13 nucleotides downstream of their caps and uses the resulting fragments to prime viral transcription. High-throughput 5' rapid amplification of cDNA ends resulted in 54 million chimeric reads containing host-derived leaders. These sequences provided evidence for stuttering during transcription initiation and an influence of the viral template on the extent of realignment. Accounting for realignment suggested a common preference by the polymerase irrespective of the viral template, and suggested that a single base pair is sufficient to prime transcription. Mapping trimmed leaders to annotated transcription start sites (TSSs) revealed that the most abundant leaders correspond to small nuclear RNAs, consistent with cap snatching of nascent transcripts. The 3' ends of mRNAs are generally appended with a poly(A) tail, but alternative polyadenylation sites may vary depending on cellular context. 3P-Seq is a method that specifically captures alternative polyadenylation sites without relying on oligo(dT) priming, which may cause artifacts. Applying 3P-Seq to eukaryotic model organisms improved their gene annotations and provided insight into targeting by microRNAs, a class of ~21-23 nucleotide RNAs that mediate mRNA destabilization. The isoform ratios of transcripts containing miR-155 sites were predictive of the extent to which these transcripts would respond to miR-155 transfection. Conversely, knocking out miR-22 in mice specifically upregulated isoforms containing miR-22 sites, suggesting that microRNAs reciprocally affect the 3'-UTR landscape. Lastly, analysis of other datasets derived from zebrafish embryos revealed broad lengthening of 3'-UTR isoforms during development and noncanonical polyadenylation during the maternal-to-zygotic transition. by David N.P. Koppstein. Ph. D. 2015-09-17T19:00:48Z 2015-09-17T19:00:48Z 2015 2015 Thesis http://hdl.handle.net/1721.1/98629 920672621 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 259 pages application/pdf Massachusetts Institute of Technology |
spellingShingle | Biology. Koppstein, David N. P. (David Neal Pira) High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression |
title | High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression |
title_full | High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression |
title_fullStr | High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression |
title_full_unstemmed | High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression |
title_short | High-throughput sequencing of RNA 5'- and 3'-termini yields insights into viral and vertebrate gene expression |
title_sort | high throughput sequencing of rna 5 and 3 termini yields insights into viral and vertebrate gene expression |
topic | Biology. |
url | http://hdl.handle.net/1721.1/98629 |
work_keys_str_mv | AT koppsteindavidnpdavidnealpira highthroughputsequencingofrna5and3terminiyieldsinsightsintoviralandvertebrategeneexpression |