A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq.

Repetitive elements (REs) comprise 40-60% of the mammalian genome and have been shown to epigenetically influence the expression of genes through the formation of fusion transcript (FTs). We previously showed that an intracisternal A particle forms an FT with the agouti gene in mice, causing obesity...

Full description

Bibliographic Details
Main Authors: Tianyuan Wang, Janine H Santos, Jian Feng, David C Fargo, Li Shen, Gonzalo Riadi, Elizabeth Keeley, Zachary S Rosh, Eric J Nestler, Richard P Woychik
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2016-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4945064?pdf=render
_version_ 1828382178063417344
author Tianyuan Wang
Janine H Santos
Jian Feng
David C Fargo
Li Shen
Gonzalo Riadi
Elizabeth Keeley
Zachary S Rosh
Eric J Nestler
Richard P Woychik
author_facet Tianyuan Wang
Janine H Santos
Jian Feng
David C Fargo
Li Shen
Gonzalo Riadi
Elizabeth Keeley
Zachary S Rosh
Eric J Nestler
Richard P Woychik
author_sort Tianyuan Wang
collection DOAJ
description Repetitive elements (REs) comprise 40-60% of the mammalian genome and have been shown to epigenetically influence the expression of genes through the formation of fusion transcript (FTs). We previously showed that an intracisternal A particle forms an FT with the agouti gene in mice, causing obesity/type 2 diabetes. To determine the frequency of FTs genome-wide, we developed a TopHat-Fusion-based analytical pipeline to identify FTs with high specificity. We applied it to an RNA-seq dataset from the nucleus accumbens (NAc) of mice repeatedly exposed to cocaine. Cocaine was previously shown to increase the expression of certain REs in this brain region. Using this pipeline that can be applied to single- or paired-end reads, we identified 438 genes expressing 813 different FTs in the NAc. Although all types of studied repeats were present in FTs, simple sequence repeats were underrepresented. Most importantly, reverse-transcription and quantitative PCR validated the expression of selected FTs in an independent cohort of animals, which also revealed that some FTs are the prominent isoforms expressed in the NAc by some genes. In other RNA-seq datasets, developmental expression as well as tissue specificity of some FTs differed from their corresponding non-fusion counterparts. Finally, in silico analysis predicted changes in the structure of proteins encoded by some FTs, potentially resulting in gain or loss of function. Collectively, these results indicate the robustness of our pipeline in detecting these new isoforms of genes, which we believe provides a valuable tool to aid in better understanding the broad role of REs in mammalian cellular biology.
first_indexed 2024-12-10T04:27:52Z
format Article
id doaj.art-15d56a31a03a4d48a3ac2c70ab092a7d
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-10T04:27:52Z
publishDate 2016-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-15d56a31a03a4d48a3ac2c70ab092a7d2022-12-22T02:02:14ZengPublic Library of Science (PLoS)PLoS ONE1932-62032016-01-01117e015902810.1371/journal.pone.0159028A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq.Tianyuan WangJanine H SantosJian FengDavid C FargoLi ShenGonzalo RiadiElizabeth KeeleyZachary S RoshEric J NestlerRichard P WoychikRepetitive elements (REs) comprise 40-60% of the mammalian genome and have been shown to epigenetically influence the expression of genes through the formation of fusion transcript (FTs). We previously showed that an intracisternal A particle forms an FT with the agouti gene in mice, causing obesity/type 2 diabetes. To determine the frequency of FTs genome-wide, we developed a TopHat-Fusion-based analytical pipeline to identify FTs with high specificity. We applied it to an RNA-seq dataset from the nucleus accumbens (NAc) of mice repeatedly exposed to cocaine. Cocaine was previously shown to increase the expression of certain REs in this brain region. Using this pipeline that can be applied to single- or paired-end reads, we identified 438 genes expressing 813 different FTs in the NAc. Although all types of studied repeats were present in FTs, simple sequence repeats were underrepresented. Most importantly, reverse-transcription and quantitative PCR validated the expression of selected FTs in an independent cohort of animals, which also revealed that some FTs are the prominent isoforms expressed in the NAc by some genes. In other RNA-seq datasets, developmental expression as well as tissue specificity of some FTs differed from their corresponding non-fusion counterparts. Finally, in silico analysis predicted changes in the structure of proteins encoded by some FTs, potentially resulting in gain or loss of function. Collectively, these results indicate the robustness of our pipeline in detecting these new isoforms of genes, which we believe provides a valuable tool to aid in better understanding the broad role of REs in mammalian cellular biology.http://europepmc.org/articles/PMC4945064?pdf=render
spellingShingle Tianyuan Wang
Janine H Santos
Jian Feng
David C Fargo
Li Shen
Gonzalo Riadi
Elizabeth Keeley
Zachary S Rosh
Eric J Nestler
Richard P Woychik
A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq.
PLoS ONE
title A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq.
title_full A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq.
title_fullStr A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq.
title_full_unstemmed A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq.
title_short A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq.
title_sort novel analytical strategy to identify fusion transcripts between repetitive elements and protein coding exons using rna seq
url http://europepmc.org/articles/PMC4945064?pdf=render
work_keys_str_mv AT tianyuanwang anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT janinehsantos anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT jianfeng anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT davidcfargo anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT lishen anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT gonzaloriadi anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT elizabethkeeley anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT zacharysrosh anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT ericjnestler anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT richardpwoychik anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT tianyuanwang novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT janinehsantos novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT jianfeng novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT davidcfargo novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT lishen novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT gonzaloriadi novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT elizabethkeeley novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT zacharysrosh novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT ericjnestler novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq
AT richardpwoychik novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq