Constructing metagenome-assembled genomes for almost all components in a real bacterial consortium for binning benchmarking

Abstract Background So far, a lot of binning approaches have been intensively developed for untangling metagenome-assembled genomes (MAGs) and evaluated by two main strategies. The strategy by comparison to known genomes prevails over the other strategy by using single-copy genes. However, there is...

Full description

Bibliographic Details
Main Authors: Ziyao Wu, Yuxiao Wang, Jiaqi Zeng, Yizhuang Zhou
Format: Article
Language:English
Published: BMC 2022-11-01
Series:BMC Genomics
Subjects:
Online Access:https://doi.org/10.1186/s12864-022-08967-x
_version_ 1798019166075092992
author Ziyao Wu
Yuxiao Wang
Jiaqi Zeng
Yizhuang Zhou
author_facet Ziyao Wu
Yuxiao Wang
Jiaqi Zeng
Yizhuang Zhou
author_sort Ziyao Wu
collection DOAJ
description Abstract Background So far, a lot of binning approaches have been intensively developed for untangling metagenome-assembled genomes (MAGs) and evaluated by two main strategies. The strategy by comparison to known genomes prevails over the other strategy by using single-copy genes. However, there is still no dataset with all known genomes for a real (not simulated) bacterial consortium yet. Results Here, we continue investigating the real bacterial consortium F1RT enriched and sequenced by us previously, considering the high possibility to unearth all MAGs, due to its low complexity. The improved F1RT metagenome reassembled by metaSPAdes here utilizes about 98.62% of reads, and a series of analyses for the remaining reads suggests that the possibility of containing other low-abundance organisms in F1RT is greatly low, demonstrating that almost all MAGs are successfully assembled. Then, 4 isolates are obtained and individually sequenced. Based on the 4 isolate genomes and the entire metagenome, an elaborate pipeline is then in-house developed to construct all F1RT MAGs. A series of assessments extensively prove the high reliability of the herein reconstruction. Next, our findings further show that this dataset harbors several properties challenging for binning and thus is suitable to compare advanced binning tools available now or benchmark novel binners. Using this dataset, 8 advanced binning algorithms are assessed, giving useful insights for developing novel approaches. In addition, compared with our previous study, two novel MAGs termed FC8 and FC9 are discovered here, and 7 MAGs are solidly unearthed for species without any available genomes. Conclusion To our knowledge, it is the first time to construct a dataset with almost all known MAGs for a not simulated consortium. We hope that this dataset will be used as a routine toolkit to complement mock datasets for evaluating binning methods to further facilitate binning and metagenomic studies in the future.
first_indexed 2024-04-11T16:35:05Z
format Article
id doaj.art-b90c07fb249a46b895e3472464b48c4c
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-04-11T16:35:05Z
publishDate 2022-11-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-b90c07fb249a46b895e3472464b48c4c2022-12-22T04:13:51ZengBMCBMC Genomics1471-21642022-11-0123111910.1186/s12864-022-08967-xConstructing metagenome-assembled genomes for almost all components in a real bacterial consortium for binning benchmarkingZiyao Wu0Yuxiao Wang1Jiaqi Zeng2Yizhuang Zhou3Guangxi Key Laboratory of Environmental Exposomics and Entire Lifecycle Health, School of Public Health, Guilin Medical UniversityGuangxi Key Laboratory of Environmental Exposomics and Entire Lifecycle Health, School of Public Health, Guilin Medical UniversityGuangxi Key Laboratory of Environmental Exposomics and Entire Lifecycle Health, School of Public Health, Guilin Medical UniversityGuangxi Key Laboratory of Environmental Exposomics and Entire Lifecycle Health, School of Public Health, Guilin Medical UniversityAbstract Background So far, a lot of binning approaches have been intensively developed for untangling metagenome-assembled genomes (MAGs) and evaluated by two main strategies. The strategy by comparison to known genomes prevails over the other strategy by using single-copy genes. However, there is still no dataset with all known genomes for a real (not simulated) bacterial consortium yet. Results Here, we continue investigating the real bacterial consortium F1RT enriched and sequenced by us previously, considering the high possibility to unearth all MAGs, due to its low complexity. The improved F1RT metagenome reassembled by metaSPAdes here utilizes about 98.62% of reads, and a series of analyses for the remaining reads suggests that the possibility of containing other low-abundance organisms in F1RT is greatly low, demonstrating that almost all MAGs are successfully assembled. Then, 4 isolates are obtained and individually sequenced. Based on the 4 isolate genomes and the entire metagenome, an elaborate pipeline is then in-house developed to construct all F1RT MAGs. A series of assessments extensively prove the high reliability of the herein reconstruction. Next, our findings further show that this dataset harbors several properties challenging for binning and thus is suitable to compare advanced binning tools available now or benchmark novel binners. Using this dataset, 8 advanced binning algorithms are assessed, giving useful insights for developing novel approaches. In addition, compared with our previous study, two novel MAGs termed FC8 and FC9 are discovered here, and 7 MAGs are solidly unearthed for species without any available genomes. Conclusion To our knowledge, it is the first time to construct a dataset with almost all known MAGs for a not simulated consortium. We hope that this dataset will be used as a routine toolkit to complement mock datasets for evaluating binning methods to further facilitate binning and metagenomic studies in the future.https://doi.org/10.1186/s12864-022-08967-xBinningMetagenomicsCompositionAbundanceBenchmark
spellingShingle Ziyao Wu
Yuxiao Wang
Jiaqi Zeng
Yizhuang Zhou
Constructing metagenome-assembled genomes for almost all components in a real bacterial consortium for binning benchmarking
BMC Genomics
Binning
Metagenomics
Composition
Abundance
Benchmark
title Constructing metagenome-assembled genomes for almost all components in a real bacterial consortium for binning benchmarking
title_full Constructing metagenome-assembled genomes for almost all components in a real bacterial consortium for binning benchmarking
title_fullStr Constructing metagenome-assembled genomes for almost all components in a real bacterial consortium for binning benchmarking
title_full_unstemmed Constructing metagenome-assembled genomes for almost all components in a real bacterial consortium for binning benchmarking
title_short Constructing metagenome-assembled genomes for almost all components in a real bacterial consortium for binning benchmarking
title_sort constructing metagenome assembled genomes for almost all components in a real bacterial consortium for binning benchmarking
topic Binning
Metagenomics
Composition
Abundance
Benchmark
url https://doi.org/10.1186/s12864-022-08967-x
work_keys_str_mv AT ziyaowu constructingmetagenomeassembledgenomesforalmostallcomponentsinarealbacterialconsortiumforbinningbenchmarking
AT yuxiaowang constructingmetagenomeassembledgenomesforalmostallcomponentsinarealbacterialconsortiumforbinningbenchmarking
AT jiaqizeng constructingmetagenomeassembledgenomesforalmostallcomponentsinarealbacterialconsortiumforbinningbenchmarking
AT yizhuangzhou constructingmetagenomeassembledgenomesforalmostallcomponentsinarealbacterialconsortiumforbinningbenchmarking