ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding
Abstract Background Rapid generation of omics data in recent years have resulted in vast amounts of disconnected datasets without systemic integration and knowledge building, while individual groups have made customized, annotated datasets available on the web with few ways to link them to in-lab da...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2017-08-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12859-017-1777-7 |
_version_ | 1818016430067548160 |
---|---|
author | Joseph Guhlin Kevin A. T. Silverstein Peng Zhou Peter Tiffin Nevin D. Young |
author_facet | Joseph Guhlin Kevin A. T. Silverstein Peng Zhou Peter Tiffin Nevin D. Young |
author_sort | Joseph Guhlin |
collection | DOAJ |
description | Abstract Background Rapid generation of omics data in recent years have resulted in vast amounts of disconnected datasets without systemic integration and knowledge building, while individual groups have made customized, annotated datasets available on the web with few ways to link them to in-lab datasets. With so many research groups generating their own data, the ability to relate it to the larger genomic and comparative genomic context is becoming increasingly crucial to make full use of the data. Results The Omics Database Generator (ODG) allows users to create customized databases that utilize published genomics data integrated with experimental data which can be queried using a flexible graph database. When provided with omics and experimental data, ODG will create a comparative, multi-dimensional graph database. ODG can import definitions and annotations from other sources such as InterProScan, the Gene Ontology, ENZYME, UniPathway, and others. This annotation data can be especially useful for studying new or understudied species for which transcripts have only been predicted, and rapidly give additional layers of annotation to predicted genes. In better studied species, ODG can perform syntenic annotation translations or rapidly identify characteristics of a set of genes or nucleotide locations, such as hits from an association study. ODG provides a web-based user-interface for configuring the data import and for querying the database. Queries can also be run from the command-line and the database can be queried directly through programming language hooks available for most languages. ODG supports most common genomic formats as well as generic, easy to use tab-separated value format for user-provided annotations. Conclusions ODG is a user-friendly database generation and query tool that adapts to the supplied data to produce a comparative genomic database or multi-layered annotation database. ODG provides rapid comparative genomic annotation and is therefore particularly useful for non-model or understudied species. For species for which more data are available, ODG can be used to conduct complex multi-omics, pattern-matching queries. |
first_indexed | 2024-04-14T07:12:09Z |
format | Article |
id | doaj.art-0e79217ae7bf4a58b36634c3427f2da0 |
institution | Directory Open Access Journal |
issn | 1471-2105 |
language | English |
last_indexed | 2024-04-14T07:12:09Z |
publishDate | 2017-08-01 |
publisher | BMC |
record_format | Article |
series | BMC Bioinformatics |
spelling | doaj.art-0e79217ae7bf4a58b36634c3427f2da02022-12-22T02:06:24ZengBMCBMC Bioinformatics1471-21052017-08-011811810.1186/s12859-017-1777-7ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understandingJoseph Guhlin0Kevin A. T. Silverstein1Peng Zhou2Peter Tiffin3Nevin D. Young4Department of Plant and Microbial Biology, 140 Gortner Laboratory, 1479 Gortner Avenue, University of MinnesotaMinnesota Supercomputing Institute, 599 Walter LibraryDepartment of Plant PathologyDepartment of Plant and Microbial Biology, 140 Gortner Laboratory, 1479 Gortner Avenue, University of MinnesotaDepartment of Plant PathologyAbstract Background Rapid generation of omics data in recent years have resulted in vast amounts of disconnected datasets without systemic integration and knowledge building, while individual groups have made customized, annotated datasets available on the web with few ways to link them to in-lab datasets. With so many research groups generating their own data, the ability to relate it to the larger genomic and comparative genomic context is becoming increasingly crucial to make full use of the data. Results The Omics Database Generator (ODG) allows users to create customized databases that utilize published genomics data integrated with experimental data which can be queried using a flexible graph database. When provided with omics and experimental data, ODG will create a comparative, multi-dimensional graph database. ODG can import definitions and annotations from other sources such as InterProScan, the Gene Ontology, ENZYME, UniPathway, and others. This annotation data can be especially useful for studying new or understudied species for which transcripts have only been predicted, and rapidly give additional layers of annotation to predicted genes. In better studied species, ODG can perform syntenic annotation translations or rapidly identify characteristics of a set of genes or nucleotide locations, such as hits from an association study. ODG provides a web-based user-interface for configuring the data import and for querying the database. Queries can also be run from the command-line and the database can be queried directly through programming language hooks available for most languages. ODG supports most common genomic formats as well as generic, easy to use tab-separated value format for user-provided annotations. Conclusions ODG is a user-friendly database generation and query tool that adapts to the supplied data to produce a comparative genomic database or multi-layered annotation database. ODG provides rapid comparative genomic annotation and is therefore particularly useful for non-model or understudied species. For species for which more data are available, ODG can be used to conduct complex multi-omics, pattern-matching queries.http://link.springer.com/article/10.1186/s12859-017-1777-7Comparative genomicsNon-model speciesAnnotationGraph databaseData integration |
spellingShingle | Joseph Guhlin Kevin A. T. Silverstein Peng Zhou Peter Tiffin Nevin D. Young ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding BMC Bioinformatics Comparative genomics Non-model species Annotation Graph database Data integration |
title | ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding |
title_full | ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding |
title_fullStr | ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding |
title_full_unstemmed | ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding |
title_short | ODG: Omics database generator - a tool for generating, querying, and analyzing multi-omics comparative databases to facilitate biological understanding |
title_sort | odg omics database generator a tool for generating querying and analyzing multi omics comparative databases to facilitate biological understanding |
topic | Comparative genomics Non-model species Annotation Graph database Data integration |
url | http://link.springer.com/article/10.1186/s12859-017-1777-7 |
work_keys_str_mv | AT josephguhlin odgomicsdatabasegeneratoratoolforgeneratingqueryingandanalyzingmultiomicscomparativedatabasestofacilitatebiologicalunderstanding AT kevinatsilverstein odgomicsdatabasegeneratoratoolforgeneratingqueryingandanalyzingmultiomicscomparativedatabasestofacilitatebiologicalunderstanding AT pengzhou odgomicsdatabasegeneratoratoolforgeneratingqueryingandanalyzingmultiomicscomparativedatabasestofacilitatebiologicalunderstanding AT petertiffin odgomicsdatabasegeneratoratoolforgeneratingqueryingandanalyzingmultiomicscomparativedatabasestofacilitatebiologicalunderstanding AT nevindyoung odgomicsdatabasegeneratoratoolforgeneratingqueryingandanalyzingmultiomicscomparativedatabasestofacilitatebiologicalunderstanding |