A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq.
Repetitive elements (REs) comprise 40-60% of the mammalian genome and have been shown to epigenetically influence the expression of genes through the formation of fusion transcript (FTs). We previously showed that an intracisternal A particle forms an FT with the agouti gene in mice, causing obesity...
Main Authors: | , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2016-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC4945064?pdf=render |
_version_ | 1828382178063417344 |
---|---|
author | Tianyuan Wang Janine H Santos Jian Feng David C Fargo Li Shen Gonzalo Riadi Elizabeth Keeley Zachary S Rosh Eric J Nestler Richard P Woychik |
author_facet | Tianyuan Wang Janine H Santos Jian Feng David C Fargo Li Shen Gonzalo Riadi Elizabeth Keeley Zachary S Rosh Eric J Nestler Richard P Woychik |
author_sort | Tianyuan Wang |
collection | DOAJ |
description | Repetitive elements (REs) comprise 40-60% of the mammalian genome and have been shown to epigenetically influence the expression of genes through the formation of fusion transcript (FTs). We previously showed that an intracisternal A particle forms an FT with the agouti gene in mice, causing obesity/type 2 diabetes. To determine the frequency of FTs genome-wide, we developed a TopHat-Fusion-based analytical pipeline to identify FTs with high specificity. We applied it to an RNA-seq dataset from the nucleus accumbens (NAc) of mice repeatedly exposed to cocaine. Cocaine was previously shown to increase the expression of certain REs in this brain region. Using this pipeline that can be applied to single- or paired-end reads, we identified 438 genes expressing 813 different FTs in the NAc. Although all types of studied repeats were present in FTs, simple sequence repeats were underrepresented. Most importantly, reverse-transcription and quantitative PCR validated the expression of selected FTs in an independent cohort of animals, which also revealed that some FTs are the prominent isoforms expressed in the NAc by some genes. In other RNA-seq datasets, developmental expression as well as tissue specificity of some FTs differed from their corresponding non-fusion counterparts. Finally, in silico analysis predicted changes in the structure of proteins encoded by some FTs, potentially resulting in gain or loss of function. Collectively, these results indicate the robustness of our pipeline in detecting these new isoforms of genes, which we believe provides a valuable tool to aid in better understanding the broad role of REs in mammalian cellular biology. |
first_indexed | 2024-12-10T04:27:52Z |
format | Article |
id | doaj.art-15d56a31a03a4d48a3ac2c70ab092a7d |
institution | Directory Open Access Journal |
issn | 1932-6203 |
language | English |
last_indexed | 2024-12-10T04:27:52Z |
publishDate | 2016-01-01 |
publisher | Public Library of Science (PLoS) |
record_format | Article |
series | PLoS ONE |
spelling | doaj.art-15d56a31a03a4d48a3ac2c70ab092a7d2022-12-22T02:02:14ZengPublic Library of Science (PLoS)PLoS ONE1932-62032016-01-01117e015902810.1371/journal.pone.0159028A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq.Tianyuan WangJanine H SantosJian FengDavid C FargoLi ShenGonzalo RiadiElizabeth KeeleyZachary S RoshEric J NestlerRichard P WoychikRepetitive elements (REs) comprise 40-60% of the mammalian genome and have been shown to epigenetically influence the expression of genes through the formation of fusion transcript (FTs). We previously showed that an intracisternal A particle forms an FT with the agouti gene in mice, causing obesity/type 2 diabetes. To determine the frequency of FTs genome-wide, we developed a TopHat-Fusion-based analytical pipeline to identify FTs with high specificity. We applied it to an RNA-seq dataset from the nucleus accumbens (NAc) of mice repeatedly exposed to cocaine. Cocaine was previously shown to increase the expression of certain REs in this brain region. Using this pipeline that can be applied to single- or paired-end reads, we identified 438 genes expressing 813 different FTs in the NAc. Although all types of studied repeats were present in FTs, simple sequence repeats were underrepresented. Most importantly, reverse-transcription and quantitative PCR validated the expression of selected FTs in an independent cohort of animals, which also revealed that some FTs are the prominent isoforms expressed in the NAc by some genes. In other RNA-seq datasets, developmental expression as well as tissue specificity of some FTs differed from their corresponding non-fusion counterparts. Finally, in silico analysis predicted changes in the structure of proteins encoded by some FTs, potentially resulting in gain or loss of function. Collectively, these results indicate the robustness of our pipeline in detecting these new isoforms of genes, which we believe provides a valuable tool to aid in better understanding the broad role of REs in mammalian cellular biology.http://europepmc.org/articles/PMC4945064?pdf=render |
spellingShingle | Tianyuan Wang Janine H Santos Jian Feng David C Fargo Li Shen Gonzalo Riadi Elizabeth Keeley Zachary S Rosh Eric J Nestler Richard P Woychik A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq. PLoS ONE |
title | A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq. |
title_full | A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq. |
title_fullStr | A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq. |
title_full_unstemmed | A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq. |
title_short | A Novel Analytical Strategy to Identify Fusion Transcripts between Repetitive Elements and Protein Coding-Exons Using RNA-Seq. |
title_sort | novel analytical strategy to identify fusion transcripts between repetitive elements and protein coding exons using rna seq |
url | http://europepmc.org/articles/PMC4945064?pdf=render |
work_keys_str_mv | AT tianyuanwang anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT janinehsantos anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT jianfeng anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT davidcfargo anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT lishen anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT gonzaloriadi anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT elizabethkeeley anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT zacharysrosh anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT ericjnestler anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT richardpwoychik anovelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT tianyuanwang novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT janinehsantos novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT jianfeng novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT davidcfargo novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT lishen novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT gonzaloriadi novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT elizabethkeeley novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT zacharysrosh novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT ericjnestler novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq AT richardpwoychik novelanalyticalstrategytoidentifyfusiontranscriptsbetweenrepetitiveelementsandproteincodingexonsusingrnaseq |