RNA-seq-based genome annotation and identification of long-noncoding RNAs in the grapevine cultivar ‘Riesling’

Abstract Background The technological advances of RNA-seq and de novo transcriptome assembly have enabled genome annotation and transcriptome profiling in highly heterozygous species such as grapevine (Vitis vinifera L.). This work is an attempt to utilize a de novo-assembled transcriptome of the V....

Full description

Bibliographic Details
Main Authors: Zachary N. Harris, Laszlo G. Kovacs, Jason P. Londo
Format: Article
Language:English
Published: BMC 2017-12-01
Series:BMC Genomics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12864-017-4346-6
_version_ 1811229946716094464
author Zachary N. Harris
Laszlo G. Kovacs
Jason P. Londo
author_facet Zachary N. Harris
Laszlo G. Kovacs
Jason P. Londo
author_sort Zachary N. Harris
collection DOAJ
description Abstract Background The technological advances of RNA-seq and de novo transcriptome assembly have enabled genome annotation and transcriptome profiling in highly heterozygous species such as grapevine (Vitis vinifera L.). This work is an attempt to utilize a de novo-assembled transcriptome of the V. vinifera cultivar ‘Riesling’ to improve annotation of the grapevine reference genome sequence. Results Here we show that the transcriptome assembly of a single V. vinifera cultivar is insufficient for a complete genome annotation of the grapevine reference genome constructed from V. vinifera PN40024. Further, we provide evidence that the gene models we identified cannot be completely anchored to the previously published V. vinifera PN40024 gene models. In addition to these findings, we present a computational pipeline for the de novo identification of lncRNAs. Our results demonstrate that, in grapevine, lncRNAs are significantly different from protein coding transcripts in such metrics as length, GC-content, minimum free energy, and length-corrected minimum free energy. Conclusions In grapevine, high-level heterozygosity necessitates that transcriptome characterization be based on cultivar-specific reference genome sequences. Our results strengthen the hypothesis that lncRNAs have thermodynamically different properties than protein-coding RNAs. The analyses of both coding and non-coding RNAs will be instrumental in uncovering inter-cultivar variation in wild and cultivated grapevine species.
first_indexed 2024-04-12T10:21:46Z
format Article
id doaj.art-af676f2a07c947ac9fb37195f5588e36
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-04-12T10:21:46Z
publishDate 2017-12-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-af676f2a07c947ac9fb37195f5588e362022-12-22T03:37:05ZengBMCBMC Genomics1471-21642017-12-0118111210.1186/s12864-017-4346-6RNA-seq-based genome annotation and identification of long-noncoding RNAs in the grapevine cultivar ‘Riesling’Zachary N. Harris0Laszlo G. Kovacs1Jason P. Londo2Missouri State University, Biology DepartmentMissouri State University, Biology DepartmentUnited States Department of Agriculture, Agricultural Research Service, Grape Genetics Research UnitAbstract Background The technological advances of RNA-seq and de novo transcriptome assembly have enabled genome annotation and transcriptome profiling in highly heterozygous species such as grapevine (Vitis vinifera L.). This work is an attempt to utilize a de novo-assembled transcriptome of the V. vinifera cultivar ‘Riesling’ to improve annotation of the grapevine reference genome sequence. Results Here we show that the transcriptome assembly of a single V. vinifera cultivar is insufficient for a complete genome annotation of the grapevine reference genome constructed from V. vinifera PN40024. Further, we provide evidence that the gene models we identified cannot be completely anchored to the previously published V. vinifera PN40024 gene models. In addition to these findings, we present a computational pipeline for the de novo identification of lncRNAs. Our results demonstrate that, in grapevine, lncRNAs are significantly different from protein coding transcripts in such metrics as length, GC-content, minimum free energy, and length-corrected minimum free energy. Conclusions In grapevine, high-level heterozygosity necessitates that transcriptome characterization be based on cultivar-specific reference genome sequences. Our results strengthen the hypothesis that lncRNAs have thermodynamically different properties than protein-coding RNAs. The analyses of both coding and non-coding RNAs will be instrumental in uncovering inter-cultivar variation in wild and cultivated grapevine species.http://link.springer.com/article/10.1186/s12864-017-4346-6TranscriptomeGenome re-annotationRNA-seqlncRNAMinimum free energyRiesling
spellingShingle Zachary N. Harris
Laszlo G. Kovacs
Jason P. Londo
RNA-seq-based genome annotation and identification of long-noncoding RNAs in the grapevine cultivar ‘Riesling’
BMC Genomics
Transcriptome
Genome re-annotation
RNA-seq
lncRNA
Minimum free energy
Riesling
title RNA-seq-based genome annotation and identification of long-noncoding RNAs in the grapevine cultivar ‘Riesling’
title_full RNA-seq-based genome annotation and identification of long-noncoding RNAs in the grapevine cultivar ‘Riesling’
title_fullStr RNA-seq-based genome annotation and identification of long-noncoding RNAs in the grapevine cultivar ‘Riesling’
title_full_unstemmed RNA-seq-based genome annotation and identification of long-noncoding RNAs in the grapevine cultivar ‘Riesling’
title_short RNA-seq-based genome annotation and identification of long-noncoding RNAs in the grapevine cultivar ‘Riesling’
title_sort rna seq based genome annotation and identification of long noncoding rnas in the grapevine cultivar riesling
topic Transcriptome
Genome re-annotation
RNA-seq
lncRNA
Minimum free energy
Riesling
url http://link.springer.com/article/10.1186/s12864-017-4346-6
work_keys_str_mv AT zacharynharris rnaseqbasedgenomeannotationandidentificationoflongnoncodingrnasinthegrapevinecultivarriesling
AT laszlogkovacs rnaseqbasedgenomeannotationandidentificationoflongnoncodingrnasinthegrapevinecultivarriesling
AT jasonplondo rnaseqbasedgenomeannotationandidentificationoflongnoncodingrnasinthegrapevinecultivarriesling