Discovery of Novel Biosynthetic Gene Cluster Diversity From a Soil Metagenomic Library

Soil microorganisms historically have been a rich resource for natural product discovery, yet the majority of these microbes remain uncultivated and their biosynthetic capacity is left underexplored. To identify the biosynthetic potential of soil microorganisms using a culture-independent approach,...

Full description

Bibliographic Details
Main Authors: Alinne L. R. Santana-Pereira, Megan Sandoval-Powers, Scott Monsma, Jinglie Zhou, Scott R. Santos, David A. Mead, Mark R. Liles
Format: Article
Language:English
Published: Frontiers Media S.A. 2020-12-01
Series:Frontiers in Microbiology
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fmicb.2020.585398/full
_version_ 1819275492180623360
author Alinne L. R. Santana-Pereira
Megan Sandoval-Powers
Scott Monsma
Jinglie Zhou
Scott R. Santos
David A. Mead
Mark R. Liles
Mark R. Liles
author_facet Alinne L. R. Santana-Pereira
Megan Sandoval-Powers
Scott Monsma
Jinglie Zhou
Scott R. Santos
David A. Mead
Mark R. Liles
Mark R. Liles
author_sort Alinne L. R. Santana-Pereira
collection DOAJ
description Soil microorganisms historically have been a rich resource for natural product discovery, yet the majority of these microbes remain uncultivated and their biosynthetic capacity is left underexplored. To identify the biosynthetic potential of soil microorganisms using a culture-independent approach, we constructed a large-insert metagenomic library in Escherichia coli from a topsoil sampled from the Cullars Rotation (Auburn, AL, United States), a long-term crop rotation experiment. Library clones were screened for biosynthetic gene clusters (BGCs) using either PCR or a NGS (next generation sequencing) multiplexed pooling strategy, coupled with bioinformatic analysis to identify contigs associated with each metagenomic clone. A total of 1,015 BGCs were detected from 19,200 clones, identifying 223 clones (1.2%) that carry a polyketide synthase (PKS) and/or a non-ribosomal peptide synthetase (NRPS) cluster, a dramatically improved hit rate compared to PCR screening that targeted type I polyketide ketosynthase (KS) domains. The NRPS and PKS clusters identified by NGS were distinct from known BGCs in the MIBiG database or those PKS clusters identified by PCR. Likewise, 16S rRNA gene sequences obtained by NGS of the library included many representatives that were not recovered by PCR, in concordance with the same bias observed in KS amplicon screening. This study provides novel resources for natural product discovery and circumvents amplification bias to allow annotation of a soil metagenomic library for a more complete picture of its functional and phylogenetic diversity.
first_indexed 2024-12-23T23:25:11Z
format Article
id doaj.art-053e5c62884d4640abef443d685967bc
institution Directory Open Access Journal
issn 1664-302X
language English
last_indexed 2024-12-23T23:25:11Z
publishDate 2020-12-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Microbiology
spelling doaj.art-053e5c62884d4640abef443d685967bc2022-12-21T17:26:14ZengFrontiers Media S.A.Frontiers in Microbiology1664-302X2020-12-011110.3389/fmicb.2020.585398585398Discovery of Novel Biosynthetic Gene Cluster Diversity From a Soil Metagenomic LibraryAlinne L. R. Santana-Pereira0Megan Sandoval-Powers1Scott Monsma2Jinglie Zhou3Scott R. Santos4David A. Mead5Mark R. Liles6Mark R. Liles7Department of Biological Sciences, Auburn University, Auburn, AL, United StatesDepartment of Biological Sciences, Auburn University, Auburn, AL, United StatesLucigen Corporation, Middleton, WI, United StatesDepartment of Biological Sciences, Auburn University, Auburn, AL, United StatesDepartment of Biological Sciences, Auburn University, Auburn, AL, United StatesVarigen Biosciences Corporation, Madison, WI, United StatesDepartment of Biological Sciences, Auburn University, Auburn, AL, United StatesVarigen Biosciences Corporation, Madison, WI, United StatesSoil microorganisms historically have been a rich resource for natural product discovery, yet the majority of these microbes remain uncultivated and their biosynthetic capacity is left underexplored. To identify the biosynthetic potential of soil microorganisms using a culture-independent approach, we constructed a large-insert metagenomic library in Escherichia coli from a topsoil sampled from the Cullars Rotation (Auburn, AL, United States), a long-term crop rotation experiment. Library clones were screened for biosynthetic gene clusters (BGCs) using either PCR or a NGS (next generation sequencing) multiplexed pooling strategy, coupled with bioinformatic analysis to identify contigs associated with each metagenomic clone. A total of 1,015 BGCs were detected from 19,200 clones, identifying 223 clones (1.2%) that carry a polyketide synthase (PKS) and/or a non-ribosomal peptide synthetase (NRPS) cluster, a dramatically improved hit rate compared to PCR screening that targeted type I polyketide ketosynthase (KS) domains. The NRPS and PKS clusters identified by NGS were distinct from known BGCs in the MIBiG database or those PKS clusters identified by PCR. Likewise, 16S rRNA gene sequences obtained by NGS of the library included many representatives that were not recovered by PCR, in concordance with the same bias observed in KS amplicon screening. This study provides novel resources for natural product discovery and circumvents amplification bias to allow annotation of a soil metagenomic library for a more complete picture of its functional and phylogenetic diversity.https://www.frontiersin.org/articles/10.3389/fmicb.2020.585398/fullmetagenomesoilbiosynthetic abilitynext-generating sequencingbiases
spellingShingle Alinne L. R. Santana-Pereira
Megan Sandoval-Powers
Scott Monsma
Jinglie Zhou
Scott R. Santos
David A. Mead
Mark R. Liles
Mark R. Liles
Discovery of Novel Biosynthetic Gene Cluster Diversity From a Soil Metagenomic Library
Frontiers in Microbiology
metagenome
soil
biosynthetic ability
next-generating sequencing
biases
title Discovery of Novel Biosynthetic Gene Cluster Diversity From a Soil Metagenomic Library
title_full Discovery of Novel Biosynthetic Gene Cluster Diversity From a Soil Metagenomic Library
title_fullStr Discovery of Novel Biosynthetic Gene Cluster Diversity From a Soil Metagenomic Library
title_full_unstemmed Discovery of Novel Biosynthetic Gene Cluster Diversity From a Soil Metagenomic Library
title_short Discovery of Novel Biosynthetic Gene Cluster Diversity From a Soil Metagenomic Library
title_sort discovery of novel biosynthetic gene cluster diversity from a soil metagenomic library
topic metagenome
soil
biosynthetic ability
next-generating sequencing
biases
url https://www.frontiersin.org/articles/10.3389/fmicb.2020.585398/full
work_keys_str_mv AT alinnelrsantanapereira discoveryofnovelbiosyntheticgeneclusterdiversityfromasoilmetagenomiclibrary
AT megansandovalpowers discoveryofnovelbiosyntheticgeneclusterdiversityfromasoilmetagenomiclibrary
AT scottmonsma discoveryofnovelbiosyntheticgeneclusterdiversityfromasoilmetagenomiclibrary
AT jingliezhou discoveryofnovelbiosyntheticgeneclusterdiversityfromasoilmetagenomiclibrary
AT scottrsantos discoveryofnovelbiosyntheticgeneclusterdiversityfromasoilmetagenomiclibrary
AT davidamead discoveryofnovelbiosyntheticgeneclusterdiversityfromasoilmetagenomiclibrary
AT markrliles discoveryofnovelbiosyntheticgeneclusterdiversityfromasoilmetagenomiclibrary
AT markrliles discoveryofnovelbiosyntheticgeneclusterdiversityfromasoilmetagenomiclibrary