VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data

<p>Abstract</p> <p>Background</p> <p>The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), glo...

Full description

Bibliographic Details
Main Authors: Peterson Elena S, McCue Lee Ann, Schrimpe-Rutledge Alexandra C, Jensen Jeffrey L, Walker Hyunjoo, Kobold Markus A, Webb Samantha R, Payne Samuel H, Ansong Charles, Adkins Joshua N, Cannon William R, Webb-Robertson Bobbie-Jo M
Format: Article
Language:English
Published: BMC 2012-04-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/13/131
_version_ 1828215524266344448
author Peterson Elena S
McCue Lee Ann
Schrimpe-Rutledge Alexandra C
Jensen Jeffrey L
Walker Hyunjoo
Kobold Markus A
Webb Samantha R
Payne Samuel H
Ansong Charles
Adkins Joshua N
Cannon William R
Webb-Robertson Bobbie-Jo M
author_facet Peterson Elena S
McCue Lee Ann
Schrimpe-Rutledge Alexandra C
Jensen Jeffrey L
Walker Hyunjoo
Kobold Markus A
Webb Samantha R
Payne Samuel H
Ansong Charles
Adkins Joshua N
Cannon William R
Webb-Robertson Bobbie-Jo M
author_sort Peterson Elena S
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-based proteomics have demonstrated immense value to genome curators as individual sources of information, however, integrating these data types to validate and improve structural annotation remains a major challenge. Current visual and statistical analytic tools are focused on a single data type, or existing software tools are retrofitted to analyze new data forms. We present Visual Exploration and Statistics to Promote Annotation (VESPA) is a new interactive visual analysis software tool focused on assisting scientists with the annotation of prokaryotic genomes though the integration of proteomics and transcriptomics data with current genome location coordinates.</p> <p>Results</p> <p>VESPA is a desktop Java™ application that integrates high-throughput proteomics data (peptide-centric) and transcriptomics (probe or RNA-Seq) data into a genomic context, all of which can be visualized at three levels of genomic resolution. Data is interrogated via searches linked to the genome visualizations to find regions with high likelihood of mis-annotation. Search results are linked to exports for further validation outside of VESPA or potential coding-regions can be analyzed concurrently with the software through interaction with BLAST. VESPA is demonstrated on two use cases (<it>Yersinia pestis </it>Pestoides F and <it>Synechococcus </it>sp. PCC 7002) to demonstrate the rapid manner in which mis-annotations can be found and explored in VESPA using either proteomics data alone, or in combination with transcriptomic data.</p> <p>Conclusions</p> <p>VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic genomes. Data is evaluated via visual analysis across multiple levels of genomic resolution, linked searches and interaction with existing bioinformatics tools. We highlight the novel functionality of VESPA and core programming requirements for visualization of these large heterogeneous datasets for a client-side application. The software is freely available at <url>https://www.biopilot.org/docs/Software/Vespa.php</url>.</p>
first_indexed 2024-04-12T15:21:44Z
format Article
id doaj.art-6b65bd95a56b46d3a3601d83b8dde054
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-04-12T15:21:44Z
publishDate 2012-04-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-6b65bd95a56b46d3a3601d83b8dde0542022-12-22T03:27:25ZengBMCBMC Genomics1471-21642012-04-0113113110.1186/1471-2164-13-131VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic dataPeterson Elena SMcCue Lee AnnSchrimpe-Rutledge Alexandra CJensen Jeffrey LWalker HyunjooKobold Markus AWebb Samantha RPayne Samuel HAnsong CharlesAdkins Joshua NCannon William RWebb-Robertson Bobbie-Jo M<p>Abstract</p> <p>Background</p> <p>The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-based proteomics have demonstrated immense value to genome curators as individual sources of information, however, integrating these data types to validate and improve structural annotation remains a major challenge. Current visual and statistical analytic tools are focused on a single data type, or existing software tools are retrofitted to analyze new data forms. We present Visual Exploration and Statistics to Promote Annotation (VESPA) is a new interactive visual analysis software tool focused on assisting scientists with the annotation of prokaryotic genomes though the integration of proteomics and transcriptomics data with current genome location coordinates.</p> <p>Results</p> <p>VESPA is a desktop Java™ application that integrates high-throughput proteomics data (peptide-centric) and transcriptomics (probe or RNA-Seq) data into a genomic context, all of which can be visualized at three levels of genomic resolution. Data is interrogated via searches linked to the genome visualizations to find regions with high likelihood of mis-annotation. Search results are linked to exports for further validation outside of VESPA or potential coding-regions can be analyzed concurrently with the software through interaction with BLAST. VESPA is demonstrated on two use cases (<it>Yersinia pestis </it>Pestoides F and <it>Synechococcus </it>sp. PCC 7002) to demonstrate the rapid manner in which mis-annotations can be found and explored in VESPA using either proteomics data alone, or in combination with transcriptomic data.</p> <p>Conclusions</p> <p>VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic genomes. Data is evaluated via visual analysis across multiple levels of genomic resolution, linked searches and interaction with existing bioinformatics tools. We highlight the novel functionality of VESPA and core programming requirements for visualization of these large heterogeneous datasets for a client-side application. The software is freely available at <url>https://www.biopilot.org/docs/Software/Vespa.php</url>.</p>http://www.biomedcentral.com/1471-2164/13/131
spellingShingle Peterson Elena S
McCue Lee Ann
Schrimpe-Rutledge Alexandra C
Jensen Jeffrey L
Walker Hyunjoo
Kobold Markus A
Webb Samantha R
Payne Samuel H
Ansong Charles
Adkins Joshua N
Cannon William R
Webb-Robertson Bobbie-Jo M
VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data
BMC Genomics
title VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data
title_full VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data
title_fullStr VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data
title_full_unstemmed VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data
title_short VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data
title_sort vespa software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data
url http://www.biomedcentral.com/1471-2164/13/131
work_keys_str_mv AT petersonelenas vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata
AT mccueleeann vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata
AT schrimperutledgealexandrac vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata
AT jensenjeffreyl vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata
AT walkerhyunjoo vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata
AT koboldmarkusa vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata
AT webbsamanthar vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata
AT paynesamuelh vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata
AT ansongcharles vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata
AT adkinsjoshuan vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata
AT cannonwilliamr vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata
AT webbrobertsonbobbiejom vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata