ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding

Abstract Background Rapid generation of omics data in recent years have resulted in vast amounts of disconnected datasets without systemic integration and knowledge building, while individual groups have made customized, annotated datasets available on the web with few ways to link them to in-lab da...

Full description

Bibliographic Details
Main Authors: Joseph Guhlin, Kevin A. T. Silverstein, Peng Zhou, Peter Tiffin, Nevin D. Young
Format: Article
Language:English
Published: BMC 2017-08-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-017-1777-7
_version_ 1818016430067548160
author Joseph Guhlin
Kevin A. T. Silverstein
Peng Zhou
Peter Tiffin
Nevin D. Young
author_facet Joseph Guhlin
Kevin A. T. Silverstein
Peng Zhou
Peter Tiffin
Nevin D. Young
author_sort Joseph Guhlin
collection DOAJ
description Abstract Background Rapid generation of omics data in recent years have resulted in vast amounts of disconnected datasets without systemic integration and knowledge building, while individual groups have made customized, annotated datasets available on the web with few ways to link them to in-lab datasets. With so many research groups generating their own data, the ability to relate it to the larger genomic and comparative genomic context is becoming increasingly crucial to make full use of the data. Results The Omics Database Generator (ODG) allows users to create customized databases that utilize published genomics data integrated with experimental data which can be queried using a flexible graph database. When provided with omics and experimental data, ODG will create a comparative, multi-dimensional graph database. ODG can import definitions and annotations from other sources such as InterProScan, the Gene Ontology, ENZYME, UniPathway, and others. This annotation data can be especially useful for studying new or understudied species for which transcripts have only been predicted, and rapidly give additional layers of annotation to predicted genes. In better studied species, ODG can perform syntenic annotation translations or rapidly identify characteristics of a set of genes or nucleotide locations, such as hits from an association study. ODG provides a web-based user-interface for configuring the data import and for querying the database. Queries can also be run from the command-line and the database can be queried directly through programming language hooks available for most languages. ODG supports most common genomic formats as well as generic, easy to use tab-separated value format for user-provided annotations. Conclusions ODG is a user-friendly database generation and query tool that adapts to the supplied data to produce a comparative genomic database or multi-layered annotation database. ODG provides rapid comparative genomic annotation and is therefore particularly useful for non-model or understudied species. For species for which more data are available, ODG can be used to conduct complex multi-omics, pattern-matching queries.
first_indexed 2024-04-14T07:12:09Z
format Article
id doaj.art-0e79217ae7bf4a58b36634c3427f2da0
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-04-14T07:12:09Z
publishDate 2017-08-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-0e79217ae7bf4a58b36634c3427f2da02022-12-22T02:06:24ZengBMCBMC Bioinformatics1471-21052017-08-011811810.1186/s12859-017-1777-7ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understandingJoseph Guhlin0Kevin A. T. Silverstein1Peng Zhou2Peter Tiffin3Nevin D. Young4Department of Plant and Microbial Biology, 140 Gortner Laboratory, 1479 Gortner Avenue, University of MinnesotaMinnesota Supercomputing Institute, 599 Walter LibraryDepartment of Plant PathologyDepartment of Plant and Microbial Biology, 140 Gortner Laboratory, 1479 Gortner Avenue, University of MinnesotaDepartment of Plant PathologyAbstract Background Rapid generation of omics data in recent years have resulted in vast amounts of disconnected datasets without systemic integration and knowledge building, while individual groups have made customized, annotated datasets available on the web with few ways to link them to in-lab datasets. With so many research groups generating their own data, the ability to relate it to the larger genomic and comparative genomic context is becoming increasingly crucial to make full use of the data. Results The Omics Database Generator (ODG) allows users to create customized databases that utilize published genomics data integrated with experimental data which can be queried using a flexible graph database. When provided with omics and experimental data, ODG will create a comparative, multi-dimensional graph database. ODG can import definitions and annotations from other sources such as InterProScan, the Gene Ontology, ENZYME, UniPathway, and others. This annotation data can be especially useful for studying new or understudied species for which transcripts have only been predicted, and rapidly give additional layers of annotation to predicted genes. In better studied species, ODG can perform syntenic annotation translations or rapidly identify characteristics of a set of genes or nucleotide locations, such as hits from an association study. ODG provides a web-based user-interface for configuring the data import and for querying the database. Queries can also be run from the command-line and the database can be queried directly through programming language hooks available for most languages. ODG supports most common genomic formats as well as generic, easy to use tab-separated value format for user-provided annotations. Conclusions ODG is a user-friendly database generation and query tool that adapts to the supplied data to produce a comparative genomic database or multi-layered annotation database. ODG provides rapid comparative genomic annotation and is therefore particularly useful for non-model or understudied species. For species for which more data are available, ODG can be used to conduct complex multi-omics, pattern-matching queries.http://link.springer.com/article/10.1186/s12859-017-1777-7Comparative genomicsNon-model speciesAnnotationGraph databaseData integration
spellingShingle Joseph Guhlin
Kevin A. T. Silverstein
Peng Zhou
Peter Tiffin
Nevin D. Young
ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding
BMC Bioinformatics
Comparative genomics
Non-model species
Annotation
Graph database
Data integration
title ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding
title_full ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding
title_fullStr ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding
title_full_unstemmed ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding
title_short ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding
title_sort odg omics database generator a tool for generating querying and analyzing multi omics comparative databases to facilitate biological understanding
topic Comparative genomics
Non-model species
Annotation
Graph database
Data integration
url http://link.springer.com/article/10.1186/s12859-017-1777-7
work_keys_str_mv AT josephguhlin odgomicsdatabasegeneratoratoolforgeneratingqueryingandanalyzingmultiomicscomparativedatabasestofacilitatebiologicalunderstanding
AT kevinatsilverstein odgomicsdatabasegeneratoratoolforgeneratingqueryingandanalyzingmultiomicscomparativedatabasestofacilitatebiologicalunderstanding
AT pengzhou odgomicsdatabasegeneratoratoolforgeneratingqueryingandanalyzingmultiomicscomparativedatabasestofacilitatebiologicalunderstanding
AT petertiffin odgomicsdatabasegeneratoratoolforgeneratingqueryingandanalyzingmultiomicscomparativedatabasestofacilitatebiologicalunderstanding
AT nevindyoung odgomicsdatabasegeneratoratoolforgeneratingqueryingandanalyzingmultiomicscomparativedatabasestofacilitatebiologicalunderstanding