Comparative omics-driven genome annotation refinement: application across Yersiniae.

Genome sequencing continues to be a rapidly evolving technology, yet most downstream aspects of genome annotation pipelines remain relatively stable or are even being abandoned. The annotation process is now performed almost exclusively in an automated fashion to balance the large number of sequence...

Full description

Bibliographic Details
Main Authors: Alexandra C Schrimpe-Rutledge, Marcus B Jones, Sadhana Chauhan, Samuel O Purvine, James A Sanford, Matthew E Monroe, Heather M Brewer, Samuel H Payne, Charles Ansong, Bryan C Frank, Richard D Smith, Scott N Peterson, Vladimir L Motin, Joshua N Adkins
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3313959?pdf=render
_version_ 1819183170784854016
author Alexandra C Schrimpe-Rutledge
Marcus B Jones
Sadhana Chauhan
Samuel O Purvine
James A Sanford
Matthew E Monroe
Heather M Brewer
Samuel H Payne
Charles Ansong
Bryan C Frank
Richard D Smith
Scott N Peterson
Vladimir L Motin
Joshua N Adkins
author_facet Alexandra C Schrimpe-Rutledge
Marcus B Jones
Sadhana Chauhan
Samuel O Purvine
James A Sanford
Matthew E Monroe
Heather M Brewer
Samuel H Payne
Charles Ansong
Bryan C Frank
Richard D Smith
Scott N Peterson
Vladimir L Motin
Joshua N Adkins
author_sort Alexandra C Schrimpe-Rutledge
collection DOAJ
description Genome sequencing continues to be a rapidly evolving technology, yet most downstream aspects of genome annotation pipelines remain relatively stable or are even being abandoned. The annotation process is now performed almost exclusively in an automated fashion to balance the large number of sequences generated. One possible way of reducing errors inherent to automated computational annotations is to apply data from omics measurements (i.e. transcriptional and proteomic) to the un-annotated genome with a proteogenomic-based approach. Here, the concept of annotation refinement has been extended to include a comparative assessment of genomes across closely related species. Transcriptomic and proteomic data derived from highly similar pathogenic Yersiniae (Y. pestis CO92, Y. pestis Pestoides F, and Y. pseudotuberculosis PB1/+) was used to demonstrate a comprehensive comparative omic-based annotation methodology. Peptide and oligo measurements experimentally validated the expression of nearly 40% of each strain's predicted proteome and revealed the identification of 28 novel and 68 incorrect (i.e., observed frameshifts, extended start sites, and translated pseudogenes) protein-coding sequences within the three current genome annotations. Gene loss is presumed to play a major role in Y. pestis acquiring its niche as a virulent pathogen, thus the discovery of many translated pseudogenes, including the insertion-ablated argD, underscores a need for functional analyses to investigate hypotheses related to divergence. Refinements included the discovery of a seemingly essential ribosomal protein, several virulence-associated factors, a transcriptional regulator, and many hypothetical proteins that were missed during annotation.
first_indexed 2024-12-22T22:57:46Z
format Article
id doaj.art-4391ffc536ab451d94a0a9f703d959d8
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-22T22:57:46Z
publishDate 2012-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-4391ffc536ab451d94a0a9f703d959d82022-12-21T18:09:46ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0173e3390310.1371/journal.pone.0033903Comparative omics-driven genome annotation refinement: application across Yersiniae.Alexandra C Schrimpe-RutledgeMarcus B JonesSadhana ChauhanSamuel O PurvineJames A SanfordMatthew E MonroeHeather M BrewerSamuel H PayneCharles AnsongBryan C FrankRichard D SmithScott N PetersonVladimir L MotinJoshua N AdkinsGenome sequencing continues to be a rapidly evolving technology, yet most downstream aspects of genome annotation pipelines remain relatively stable or are even being abandoned. The annotation process is now performed almost exclusively in an automated fashion to balance the large number of sequences generated. One possible way of reducing errors inherent to automated computational annotations is to apply data from omics measurements (i.e. transcriptional and proteomic) to the un-annotated genome with a proteogenomic-based approach. Here, the concept of annotation refinement has been extended to include a comparative assessment of genomes across closely related species. Transcriptomic and proteomic data derived from highly similar pathogenic Yersiniae (Y. pestis CO92, Y. pestis Pestoides F, and Y. pseudotuberculosis PB1/+) was used to demonstrate a comprehensive comparative omic-based annotation methodology. Peptide and oligo measurements experimentally validated the expression of nearly 40% of each strain's predicted proteome and revealed the identification of 28 novel and 68 incorrect (i.e., observed frameshifts, extended start sites, and translated pseudogenes) protein-coding sequences within the three current genome annotations. Gene loss is presumed to play a major role in Y. pestis acquiring its niche as a virulent pathogen, thus the discovery of many translated pseudogenes, including the insertion-ablated argD, underscores a need for functional analyses to investigate hypotheses related to divergence. Refinements included the discovery of a seemingly essential ribosomal protein, several virulence-associated factors, a transcriptional regulator, and many hypothetical proteins that were missed during annotation.http://europepmc.org/articles/PMC3313959?pdf=render
spellingShingle Alexandra C Schrimpe-Rutledge
Marcus B Jones
Sadhana Chauhan
Samuel O Purvine
James A Sanford
Matthew E Monroe
Heather M Brewer
Samuel H Payne
Charles Ansong
Bryan C Frank
Richard D Smith
Scott N Peterson
Vladimir L Motin
Joshua N Adkins
Comparative omics-driven genome annotation refinement: application across Yersiniae.
PLoS ONE
title Comparative omics-driven genome annotation refinement: application across Yersiniae.
title_full Comparative omics-driven genome annotation refinement: application across Yersiniae.
title_fullStr Comparative omics-driven genome annotation refinement: application across Yersiniae.
title_full_unstemmed Comparative omics-driven genome annotation refinement: application across Yersiniae.
title_short Comparative omics-driven genome annotation refinement: application across Yersiniae.
title_sort comparative omics driven genome annotation refinement application across yersiniae
url http://europepmc.org/articles/PMC3313959?pdf=render
work_keys_str_mv AT alexandracschrimperutledge comparativeomicsdrivengenomeannotationrefinementapplicationacrossyersiniae
AT marcusbjones comparativeomicsdrivengenomeannotationrefinementapplicationacrossyersiniae
AT sadhanachauhan comparativeomicsdrivengenomeannotationrefinementapplicationacrossyersiniae
AT samuelopurvine comparativeomicsdrivengenomeannotationrefinementapplicationacrossyersiniae
AT jamesasanford comparativeomicsdrivengenomeannotationrefinementapplicationacrossyersiniae
AT matthewemonroe comparativeomicsdrivengenomeannotationrefinementapplicationacrossyersiniae
AT heathermbrewer comparativeomicsdrivengenomeannotationrefinementapplicationacrossyersiniae
AT samuelhpayne comparativeomicsdrivengenomeannotationrefinementapplicationacrossyersiniae
AT charlesansong comparativeomicsdrivengenomeannotationrefinementapplicationacrossyersiniae
AT bryancfrank comparativeomicsdrivengenomeannotationrefinementapplicationacrossyersiniae
AT richarddsmith comparativeomicsdrivengenomeannotationrefinementapplicationacrossyersiniae
AT scottnpeterson comparativeomicsdrivengenomeannotationrefinementapplicationacrossyersiniae
AT vladimirlmotin comparativeomicsdrivengenomeannotationrefinementapplicationacrossyersiniae
AT joshuanadkins comparativeomicsdrivengenomeannotationrefinementapplicationacrossyersiniae