CGUG: <it>in silico </it>proteome and genome parsing tool for the determination of "core" and unique genes in the analysis of genomes up to <it>ca</it>. 1.9 Mb

<p>Abstract</p> <p>Background</p> <p>Viruses and small-genome bacteria (~2 megabases and smaller) comprise a considerable population in the biosphere and are of interest to many researchers. These genomes are now sequenced at an unprecedented rate and require complement...

Full description

Bibliographic Details
Main Authors: Mahadevan Padmanabhan, King John F, Seto Donald
Format: Article
Language:English
Published: BMC 2009-08-01
Series:BMC Research Notes
Online Access:http://www.biomedcentral.com/1756-0500-2-168
_version_ 1811287224514248704
author Mahadevan Padmanabhan
King John F
Seto Donald
author_facet Mahadevan Padmanabhan
King John F
Seto Donald
author_sort Mahadevan Padmanabhan
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>Viruses and small-genome bacteria (~2 megabases and smaller) comprise a considerable population in the biosphere and are of interest to many researchers. These genomes are now sequenced at an unprecedented rate and require complementary computational tools to analyze. "CoreGenesUniqueGenes" (CGUG) is an <it>in silico </it>genome data mining tool that determines a "core" set of genes from two to five organisms with genomes in this size range. Core and unique genes may reflect similar niches and needs, and may be used in classifying organisms.</p> <p>Findings</p> <p>CGUG is available at <url>http://binf.gmu.edu/geneorder.html</url> as a web-based on-the-fly tool that performs iterative BLASTP analyses using a reference genome and up to four query genomes to provide a table of genes common to these genomes. The result is an <it>in silico </it>display of genomes and their proteomes, allowing for further analysis. CGUG can be used for "genome annotation by homology", as demonstrated with <it>Chlamydophila </it>and <it>Francisella </it>genomes.</p> <p>Conclusion</p> <p>CGUG is used to reanalyze the ICTV-based classifications of bacteriophages, to reconfirm long-standing relationships and to explore new classifications. These genomes have been problematic in the past, due largely to horizontal gene transfers. CGUG is validated as a tool for reannotating small genome bacteria using more up-to-date annotations by similarity or homology. These serve as an entry point for wet-bench experiments to confirm the functions of these "hypothetical" and "unknown" proteins.</p>
first_indexed 2024-04-13T03:14:58Z
format Article
id doaj.art-191302300c56430d9e7f065a9643ba74
institution Directory Open Access Journal
issn 1756-0500
language English
last_indexed 2024-04-13T03:14:58Z
publishDate 2009-08-01
publisher BMC
record_format Article
series BMC Research Notes
spelling doaj.art-191302300c56430d9e7f065a9643ba742022-12-22T03:04:56ZengBMCBMC Research Notes1756-05002009-08-012116810.1186/1756-0500-2-168CGUG: <it>in silico </it>proteome and genome parsing tool for the determination of "core" and unique genes in the analysis of genomes up to <it>ca</it>. 1.9 MbMahadevan PadmanabhanKing John FSeto Donald<p>Abstract</p> <p>Background</p> <p>Viruses and small-genome bacteria (~2 megabases and smaller) comprise a considerable population in the biosphere and are of interest to many researchers. These genomes are now sequenced at an unprecedented rate and require complementary computational tools to analyze. "CoreGenesUniqueGenes" (CGUG) is an <it>in silico </it>genome data mining tool that determines a "core" set of genes from two to five organisms with genomes in this size range. Core and unique genes may reflect similar niches and needs, and may be used in classifying organisms.</p> <p>Findings</p> <p>CGUG is available at <url>http://binf.gmu.edu/geneorder.html</url> as a web-based on-the-fly tool that performs iterative BLASTP analyses using a reference genome and up to four query genomes to provide a table of genes common to these genomes. The result is an <it>in silico </it>display of genomes and their proteomes, allowing for further analysis. CGUG can be used for "genome annotation by homology", as demonstrated with <it>Chlamydophila </it>and <it>Francisella </it>genomes.</p> <p>Conclusion</p> <p>CGUG is used to reanalyze the ICTV-based classifications of bacteriophages, to reconfirm long-standing relationships and to explore new classifications. These genomes have been problematic in the past, due largely to horizontal gene transfers. CGUG is validated as a tool for reannotating small genome bacteria using more up-to-date annotations by similarity or homology. These serve as an entry point for wet-bench experiments to confirm the functions of these "hypothetical" and "unknown" proteins.</p>http://www.biomedcentral.com/1756-0500-2-168
spellingShingle Mahadevan Padmanabhan
King John F
Seto Donald
CGUG: <it>in silico </it>proteome and genome parsing tool for the determination of "core" and unique genes in the analysis of genomes up to <it>ca</it>. 1.9 Mb
BMC Research Notes
title CGUG: <it>in silico </it>proteome and genome parsing tool for the determination of "core" and unique genes in the analysis of genomes up to <it>ca</it>. 1.9 Mb
title_full CGUG: <it>in silico </it>proteome and genome parsing tool for the determination of "core" and unique genes in the analysis of genomes up to <it>ca</it>. 1.9 Mb
title_fullStr CGUG: <it>in silico </it>proteome and genome parsing tool for the determination of "core" and unique genes in the analysis of genomes up to <it>ca</it>. 1.9 Mb
title_full_unstemmed CGUG: <it>in silico </it>proteome and genome parsing tool for the determination of "core" and unique genes in the analysis of genomes up to <it>ca</it>. 1.9 Mb
title_short CGUG: <it>in silico </it>proteome and genome parsing tool for the determination of "core" and unique genes in the analysis of genomes up to <it>ca</it>. 1.9 Mb
title_sort cgug it in silico it proteome and genome parsing tool for the determination of core and unique genes in the analysis of genomes up to it ca it 1 9 mb
url http://www.biomedcentral.com/1756-0500-2-168
work_keys_str_mv AT mahadevanpadmanabhan cgugitinsilicoitproteomeandgenomeparsingtoolforthedeterminationofcoreanduniquegenesintheanalysisofgenomesuptoitcait19mb
AT kingjohnf cgugitinsilicoitproteomeandgenomeparsingtoolforthedeterminationofcoreanduniquegenesintheanalysisofgenomesuptoitcait19mb
AT setodonald cgugitinsilicoitproteomeandgenomeparsingtoolforthedeterminationofcoreanduniquegenesintheanalysisofgenomesuptoitcait19mb