Genome-scale reconstructions to assess metabolic phylogeny and organism clustering.

Approaches for systematizing information of relatedness between organisms is important in biology. Phylogenetic analyses based on sets of highly conserved genes are currently the basis for the Tree of Life. Genome-scale metabolic reconstructions contain high-quality information regarding the metabol...

Full description

Bibliographic Details
Main Authors: Christian Schulz, Eivind Almaas
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0240953
_version_ 1819027882479976448
author Christian Schulz
Eivind Almaas
author_facet Christian Schulz
Eivind Almaas
author_sort Christian Schulz
collection DOAJ
description Approaches for systematizing information of relatedness between organisms is important in biology. Phylogenetic analyses based on sets of highly conserved genes are currently the basis for the Tree of Life. Genome-scale metabolic reconstructions contain high-quality information regarding the metabolic capability of an organism and are typically restricted to metabolically active enzyme-encoding genes. While there are many tools available to generate draft reconstructions, expert-level knowledge is still required to generate and manually curate high-quality genome-scale metabolic models and to fill gaps in their reaction networks. Here, we use the tool AutoKEGGRec to construct 975 genome-scale metabolic draft reconstructions encoded in the KEGG database without further curation. The organisms are selected across all three domains, and their metabolic networks serve as basis for generating phylogenetic trees. We find that using all reactions encoded, these metabolism-based comparisons give rise to a phylogenetic tree with close similarity to the Tree of Life. While this tree is quite robust to reasonable levels of noise in the metabolic reaction content of an organism, we find a significant heterogeneity in how much noise an organism may tolerate before it is incorrectly placed in the tree. Furthermore, by using the protein sequences for particular metabolic functions and pathway sets, such as central carbon-, nitrogen-, and sulfur-metabolism, as basis for the organism comparisons, we generate highly specific phylogenetic trees. We believe the generation of phylogenetic trees based on metabolic reaction content, in particular when focused on specific functions and pathways, could aid the identification of functionally important metabolic enzymes and be of value for genome-scale metabolic modellers and enzyme-engineers.
first_indexed 2024-12-21T05:49:32Z
format Article
id doaj.art-464c530acc9d4970babe0697f9c6eba8
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-21T05:49:32Z
publishDate 2020-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-464c530acc9d4970babe0697f9c6eba82022-12-21T19:14:00ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-011512e024095310.1371/journal.pone.0240953Genome-scale reconstructions to assess metabolic phylogeny and organism clustering.Christian SchulzEivind AlmaasApproaches for systematizing information of relatedness between organisms is important in biology. Phylogenetic analyses based on sets of highly conserved genes are currently the basis for the Tree of Life. Genome-scale metabolic reconstructions contain high-quality information regarding the metabolic capability of an organism and are typically restricted to metabolically active enzyme-encoding genes. While there are many tools available to generate draft reconstructions, expert-level knowledge is still required to generate and manually curate high-quality genome-scale metabolic models and to fill gaps in their reaction networks. Here, we use the tool AutoKEGGRec to construct 975 genome-scale metabolic draft reconstructions encoded in the KEGG database without further curation. The organisms are selected across all three domains, and their metabolic networks serve as basis for generating phylogenetic trees. We find that using all reactions encoded, these metabolism-based comparisons give rise to a phylogenetic tree with close similarity to the Tree of Life. While this tree is quite robust to reasonable levels of noise in the metabolic reaction content of an organism, we find a significant heterogeneity in how much noise an organism may tolerate before it is incorrectly placed in the tree. Furthermore, by using the protein sequences for particular metabolic functions and pathway sets, such as central carbon-, nitrogen-, and sulfur-metabolism, as basis for the organism comparisons, we generate highly specific phylogenetic trees. We believe the generation of phylogenetic trees based on metabolic reaction content, in particular when focused on specific functions and pathways, could aid the identification of functionally important metabolic enzymes and be of value for genome-scale metabolic modellers and enzyme-engineers.https://doi.org/10.1371/journal.pone.0240953
spellingShingle Christian Schulz
Eivind Almaas
Genome-scale reconstructions to assess metabolic phylogeny and organism clustering.
PLoS ONE
title Genome-scale reconstructions to assess metabolic phylogeny and organism clustering.
title_full Genome-scale reconstructions to assess metabolic phylogeny and organism clustering.
title_fullStr Genome-scale reconstructions to assess metabolic phylogeny and organism clustering.
title_full_unstemmed Genome-scale reconstructions to assess metabolic phylogeny and organism clustering.
title_short Genome-scale reconstructions to assess metabolic phylogeny and organism clustering.
title_sort genome scale reconstructions to assess metabolic phylogeny and organism clustering
url https://doi.org/10.1371/journal.pone.0240953
work_keys_str_mv AT christianschulz genomescalereconstructionstoassessmetabolicphylogenyandorganismclustering
AT eivindalmaas genomescalereconstructionstoassessmetabolicphylogenyandorganismclustering