The mechanism and function of pervasive noncoding transcription in the mammalian genome

Thesis: Ph. D., Massachusetts Institute of Technology, Computational and Systems Biology Program, 2014.

Bibliographic Details
Main Author: Wu, Xuebing, Ph. D. Massachusetts Institute of Technology
Other Authors: Phillip A. Sharp and Christopher B. Burge.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2015
Subjects:
Online Access:http://hdl.handle.net/1721.1/93042
_version_ 1826212746378608640
author Wu, Xuebing, Ph. D. Massachusetts Institute of Technology
author2 Phillip A. Sharp and Christopher B. Burge.
author_facet Phillip A. Sharp and Christopher B. Burge.
Wu, Xuebing, Ph. D. Massachusetts Institute of Technology
author_sort Wu, Xuebing, Ph. D. Massachusetts Institute of Technology
collection MIT
description Thesis: Ph. D., Massachusetts Institute of Technology, Computational and Systems Biology Program, 2014.
first_indexed 2024-09-23T15:37:34Z
format Thesis
id mit-1721.1/93042
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T15:37:34Z
publishDate 2015
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/930422019-04-10T13:59:34Z The mechanism and function of pervasive noncoding transcription in the mammalian genome Wu, Xuebing, Ph. D. Massachusetts Institute of Technology Phillip A. Sharp and Christopher B. Burge. Massachusetts Institute of Technology. Computational and Systems Biology Program. Massachusetts Institute of Technology. Computational and Systems Biology Program. Computational and Systems Biology Program. Thesis: Ph. D., Massachusetts Institute of Technology, Computational and Systems Biology Program, 2014. Cataloged from PDF version of thesis. Includes bibliographical references The vast majority of the mammalian genome does not encode proteins. Only 2% of the genome is exonic, yet recent deep survey of human transcripitome suggested that 75% of the genome is transcribed, including half of the intergenic regions. Such pervasive transcription typically leads to short-lived, low-copy number noncoding RNAs (ncRNAs). We are starting to understand the biogenesis and mechanisms regulating the noncoding transcription. However, it is still unclear what's the functional impact of pervasive transcription and the ncRNAs at the level of the'genome, the cell, and the organism. A large fraction of ncRNAs in cells is generated by divergent transcription that occurs at the majority of mammalian gene promoters. RNA polymerases transcribe divergently on opposite strands, producing precursor mRNAs (pre-mRNAs) on one side and promoter upstream antisense RNAs (uaRNAs) on the other side. Like typical products of pervasive transcription, uaRNAs are relatively short and unstable as compared to pre-mRNAs, suggesting there are mechanisms suppressing uaRNA transcription and enforcing promoter directionality. We describe the Ul-PAS axis, a mechanism that enhances gene transcription but suppresses noncoding transcription. Two RNA processing signals, the Ul signal, or 5' splice site sequences recognized by Ul snRNP during splicing, and polyadenylation signal (PAS), differentially mark the two sides of gene transcription start site (TSS), ensuring the generation of full-length mRNA but inducing early termination of uaRNAs. The Ul-PAS axis also suppresses pervasive transcription on the antisense strand of genes, as well as intergenic transcription. Transcription is a mutagenic process that could accelerate evolution. We uncover a link between pervasive transcription and genome evolution. Specifically, transcription-induced mutational bias in germ cells could strengthen the Ul-PAS axis, which in turn enhances transcription, thus forming a positive feedback loop, which eventually drives new gene origination, and facilitates genome rearrangements. Tools to directly interfere with transcription with specificity are necessary to understand the function of noncoding transcription, especially when the RNA product is rapidly degraded or nonfunctional. The newly emerged CRISPR-Cas9 system provides the opportunity to target any desired locus. We comprehensively characterize the binding specificity of Cas9 in the mouse genome. We find that Cas9 specificity varies dramatically but in a predictable manner, depending on the seed sequence and chromatin accessibility. Our results will facilitate Cas9 target design and enable genome manipulation with high precision. by Xuebing Wu. Ph. D. 2015-01-20T17:57:09Z 2015-01-20T17:57:09Z 2014 2014 Thesis http://hdl.handle.net/1721.1/93042 899262974 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 211 pages application/pdf Massachusetts Institute of Technology
spellingShingle Computational and Systems Biology Program.
Wu, Xuebing, Ph. D. Massachusetts Institute of Technology
The mechanism and function of pervasive noncoding transcription in the mammalian genome
title The mechanism and function of pervasive noncoding transcription in the mammalian genome
title_full The mechanism and function of pervasive noncoding transcription in the mammalian genome
title_fullStr The mechanism and function of pervasive noncoding transcription in the mammalian genome
title_full_unstemmed The mechanism and function of pervasive noncoding transcription in the mammalian genome
title_short The mechanism and function of pervasive noncoding transcription in the mammalian genome
title_sort mechanism and function of pervasive noncoding transcription in the mammalian genome
topic Computational and Systems Biology Program.
url http://hdl.handle.net/1721.1/93042
work_keys_str_mv AT wuxuebingphdmassachusettsinstituteoftechnology themechanismandfunctionofpervasivenoncodingtranscriptioninthemammaliangenome
AT wuxuebingphdmassachusettsinstituteoftechnology mechanismandfunctionofpervasivenoncodingtranscriptioninthemammaliangenome