COGNIZER: A Framework for Functional Annotation of Metagenomic Datasets.

Recent advances in sequencing technologies have resulted in an unprecedented increase in the number of metagenomes that are being sequenced world-wide. Given their volume, functional annotation of metagenomic sequence datasets requires specialized computational tools/techniques. In spite of having h...

Full description

Bibliographic Details
Main Authors: Tungadri Bose, Mohammed Monzoorul Haque, Cvsk Reddy, Sharmila S Mande
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4641738?pdf=render
_version_ 1818411485858103296
author Tungadri Bose
Mohammed Monzoorul Haque
Cvsk Reddy
Sharmila S Mande
author_facet Tungadri Bose
Mohammed Monzoorul Haque
Cvsk Reddy
Sharmila S Mande
author_sort Tungadri Bose
collection DOAJ
description Recent advances in sequencing technologies have resulted in an unprecedented increase in the number of metagenomes that are being sequenced world-wide. Given their volume, functional annotation of metagenomic sequence datasets requires specialized computational tools/techniques. In spite of having high accuracy, existing stand-alone functional annotation tools necessitate end-users to perform compute-intensive homology searches of metagenomic datasets against "multiple" databases prior to functional analysis. Although, web-based functional annotation servers address to some extent the problem of availability of compute resources, uploading and analyzing huge volumes of sequence data on a shared public web-service has its own set of limitations. In this study, we present COGNIZER, a comprehensive stand-alone annotation framework which enables end-users to functionally annotate sequences constituting metagenomic datasets. The COGNIZER framework provides multiple workflow options. A subset of these options employs a novel directed-search strategy which helps in reducing the overall compute requirements for end-users. The COGNIZER framework includes a cross-mapping database that enables end-users to simultaneously derive/infer KEGG, Pfam, GO, and SEED subsystem information from the COG annotations.Validation experiments performed with real-world metagenomes and metatranscriptomes, generated using diverse sequencing technologies, indicate that the novel directed-search strategy employed in COGNIZER helps in reducing the compute requirements without significant loss in annotation accuracy. A comparison of COGNIZER's results with pre-computed benchmark values indicate the reliability of the cross-mapping database employed in COGNIZER.The COGNIZER framework is capable of comprehensively annotating any metagenomic or metatranscriptomic dataset from varied sequencing platforms in functional terms. Multiple search options in COGNIZER provide end-users the flexibility of choosing a homology search protocol based on available compute resources. The cross-mapping database in COGNIZER is of high utility since it enables end-users to directly infer/derive KEGG, Pfam, GO, and SEED subsystem annotations from COG categorizations. Furthermore, availability of COGNIZER as a stand-alone scalable implementation is expected to make it a valuable annotation tool in the field of metagenomic research.A Linux implementation of COGNIZER is freely available for download from the following links: http://metagenomics.atc.tcs.com/cognizer, https://metagenomics.atc.tcs.com/function/cognizer.
first_indexed 2024-12-14T10:32:10Z
format Article
id doaj.art-9fd3a9a5556f4fe980483e2442dcadfa
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-14T10:32:10Z
publishDate 2015-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-9fd3a9a5556f4fe980483e2442dcadfa2022-12-21T23:06:06ZengPublic Library of Science (PLoS)PLoS ONE1932-62032015-01-011011e014210210.1371/journal.pone.0142102COGNIZER: A Framework for Functional Annotation of Metagenomic Datasets.Tungadri BoseMohammed Monzoorul HaqueCvsk ReddySharmila S MandeRecent advances in sequencing technologies have resulted in an unprecedented increase in the number of metagenomes that are being sequenced world-wide. Given their volume, functional annotation of metagenomic sequence datasets requires specialized computational tools/techniques. In spite of having high accuracy, existing stand-alone functional annotation tools necessitate end-users to perform compute-intensive homology searches of metagenomic datasets against "multiple" databases prior to functional analysis. Although, web-based functional annotation servers address to some extent the problem of availability of compute resources, uploading and analyzing huge volumes of sequence data on a shared public web-service has its own set of limitations. In this study, we present COGNIZER, a comprehensive stand-alone annotation framework which enables end-users to functionally annotate sequences constituting metagenomic datasets. The COGNIZER framework provides multiple workflow options. A subset of these options employs a novel directed-search strategy which helps in reducing the overall compute requirements for end-users. The COGNIZER framework includes a cross-mapping database that enables end-users to simultaneously derive/infer KEGG, Pfam, GO, and SEED subsystem information from the COG annotations.Validation experiments performed with real-world metagenomes and metatranscriptomes, generated using diverse sequencing technologies, indicate that the novel directed-search strategy employed in COGNIZER helps in reducing the compute requirements without significant loss in annotation accuracy. A comparison of COGNIZER's results with pre-computed benchmark values indicate the reliability of the cross-mapping database employed in COGNIZER.The COGNIZER framework is capable of comprehensively annotating any metagenomic or metatranscriptomic dataset from varied sequencing platforms in functional terms. Multiple search options in COGNIZER provide end-users the flexibility of choosing a homology search protocol based on available compute resources. The cross-mapping database in COGNIZER is of high utility since it enables end-users to directly infer/derive KEGG, Pfam, GO, and SEED subsystem annotations from COG categorizations. Furthermore, availability of COGNIZER as a stand-alone scalable implementation is expected to make it a valuable annotation tool in the field of metagenomic research.A Linux implementation of COGNIZER is freely available for download from the following links: http://metagenomics.atc.tcs.com/cognizer, https://metagenomics.atc.tcs.com/function/cognizer.http://europepmc.org/articles/PMC4641738?pdf=render
spellingShingle Tungadri Bose
Mohammed Monzoorul Haque
Cvsk Reddy
Sharmila S Mande
COGNIZER: A Framework for Functional Annotation of Metagenomic Datasets.
PLoS ONE
title COGNIZER: A Framework for Functional Annotation of Metagenomic Datasets.
title_full COGNIZER: A Framework for Functional Annotation of Metagenomic Datasets.
title_fullStr COGNIZER: A Framework for Functional Annotation of Metagenomic Datasets.
title_full_unstemmed COGNIZER: A Framework for Functional Annotation of Metagenomic Datasets.
title_short COGNIZER: A Framework for Functional Annotation of Metagenomic Datasets.
title_sort cognizer a framework for functional annotation of metagenomic datasets
url http://europepmc.org/articles/PMC4641738?pdf=render
work_keys_str_mv AT tungadribose cognizeraframeworkforfunctionalannotationofmetagenomicdatasets
AT mohammedmonzoorulhaque cognizeraframeworkforfunctionalannotationofmetagenomicdatasets
AT cvskreddy cognizeraframeworkforfunctionalannotationofmetagenomicdatasets
AT sharmilasmande cognizeraframeworkforfunctionalannotationofmetagenomicdatasets