An ANI gap within bacterial species that advances the definitions of intra-species units

ABSTRACTLarge-scale surveys of prokaryotic communities (metagenomes), as well as isolate genomes, have revealed that their diversity is predominantly organized in sequence-discrete units that may be equated to species. Specifically, genomes of the same species commonly show genome-aggregate average...

Full description

Bibliographic Details
Main Authors: Luis M. Rodriguez-R, Roth E. Conrad, Tomeu Viver, Dorian J. Feistel, Blake G. Lindner, Stephanus N. Venter, Luis H. Orellana, Rudolf Amann, Ramon Rossello-Mora, Konstantinos T. Konstantinidis
Format: Article
Language:English
Published: American Society for Microbiology 2024-01-01
Series:mBio
Subjects:
Online Access:https://journals.asm.org/doi/10.1128/mbio.02696-23
_version_ 1827380679216726016
author Luis M. Rodriguez-R
Roth E. Conrad
Tomeu Viver
Dorian J. Feistel
Blake G. Lindner
Stephanus N. Venter
Luis H. Orellana
Rudolf Amann
Ramon Rossello-Mora
Konstantinos T. Konstantinidis
author_facet Luis M. Rodriguez-R
Roth E. Conrad
Tomeu Viver
Dorian J. Feistel
Blake G. Lindner
Stephanus N. Venter
Luis H. Orellana
Rudolf Amann
Ramon Rossello-Mora
Konstantinos T. Konstantinidis
author_sort Luis M. Rodriguez-R
collection DOAJ
description ABSTRACTLarge-scale surveys of prokaryotic communities (metagenomes), as well as isolate genomes, have revealed that their diversity is predominantly organized in sequence-discrete units that may be equated to species. Specifically, genomes of the same species commonly show genome-aggregate average nucleotide identity (ANI) >95% among themselves and ANI <90% to members of other species, while genomes showing ANI 90%–95% are comparatively rare. However, it remains unclear if such “discontinuities” or gaps in ANI values can be observed within species and thus used to advance and standardize intra-species units. By analyzing 18,123 complete isolate genomes from 330 bacterial species with at least 10 genome representatives each and available long-read metagenomes, we show that another discontinuity exists between 99.2% and 99.8% (midpoint 99.5%) ANI in most of these species. The 99.5% ANI threshold is largely consistent with how sequence types have been defined in previous epidemiological studies but provides clusters with ~20% higher accuracy in terms of evolutionary and gene-content relatedness of the grouped genomes, while strains should be consequently defined at higher ANI values (>99.99% proposed). Collectively, our results should facilitate future micro-diversity studies across clinical or environmental settings because they provide a more natural definition of intra-species units of diversity.IMPORTANCEBacterial strains and clonal complexes are two cornerstone concepts for microbiology that remain loosely defined, which confuses communication and research. Here we identify a natural gap in genome sequence comparisons among isolate genomes of all well-sequenced species that has gone unnoticed so far and could be used to more accurately and precisely define these and related concepts compared to current methods. These findings advance the molecular toolbox for accurately delineating and following the important units of diversity within prokaryotic species and thus should greatly facilitate future epidemiological and micro-diversity studies across clinical and environmental settings.
first_indexed 2024-03-08T13:37:30Z
format Article
id doaj.art-8dfb72290426482f9407ccf710b458a2
institution Directory Open Access Journal
issn 2150-7511
language English
last_indexed 2024-03-08T13:37:30Z
publishDate 2024-01-01
publisher American Society for Microbiology
record_format Article
series mBio
spelling doaj.art-8dfb72290426482f9407ccf710b458a22024-01-16T15:40:00ZengAmerican Society for MicrobiologymBio2150-75112024-01-0115110.1128/mbio.02696-23An ANI gap within bacterial species that advances the definitions of intra-species unitsLuis M. Rodriguez-R0Roth E. Conrad1Tomeu Viver2Dorian J. Feistel3Blake G. Lindner4Stephanus N. Venter5Luis H. Orellana6Rudolf Amann7Ramon Rossello-Mora8Konstantinos T. Konstantinidis9Department of Microbiology, and Digital Science Center (DiSC), University of Innsbruck, Innsbruck, AustriaSchool of Civil and Environmental Engineering, and School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, USADepartment of Animal and Microbial Biodiversity, Marine Microbiology Group, Mediterranean Institutes for Advanced Studies (IMEDEA, CSIC-UIB), Esporles, SpainSchool of Civil and Environmental Engineering, and School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, USASchool of Civil and Environmental Engineering, and School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, USADepartment of Biochemistry, Genetics and Microbiology, and Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South AfricaDepartment of Molecular Ecology, Max Planck Institute for Marine Microbiology, Bremen, GermanyDepartment of Molecular Ecology, Max Planck Institute for Marine Microbiology, Bremen, GermanyDepartment of Animal and Microbial Biodiversity, Marine Microbiology Group, Mediterranean Institutes for Advanced Studies (IMEDEA, CSIC-UIB), Esporles, SpainSchool of Civil and Environmental Engineering, and School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, USAABSTRACTLarge-scale surveys of prokaryotic communities (metagenomes), as well as isolate genomes, have revealed that their diversity is predominantly organized in sequence-discrete units that may be equated to species. Specifically, genomes of the same species commonly show genome-aggregate average nucleotide identity (ANI) >95% among themselves and ANI <90% to members of other species, while genomes showing ANI 90%–95% are comparatively rare. However, it remains unclear if such “discontinuities” or gaps in ANI values can be observed within species and thus used to advance and standardize intra-species units. By analyzing 18,123 complete isolate genomes from 330 bacterial species with at least 10 genome representatives each and available long-read metagenomes, we show that another discontinuity exists between 99.2% and 99.8% (midpoint 99.5%) ANI in most of these species. The 99.5% ANI threshold is largely consistent with how sequence types have been defined in previous epidemiological studies but provides clusters with ~20% higher accuracy in terms of evolutionary and gene-content relatedness of the grouped genomes, while strains should be consequently defined at higher ANI values (>99.99% proposed). Collectively, our results should facilitate future micro-diversity studies across clinical or environmental settings because they provide a more natural definition of intra-species units of diversity.IMPORTANCEBacterial strains and clonal complexes are two cornerstone concepts for microbiology that remain loosely defined, which confuses communication and research. Here we identify a natural gap in genome sequence comparisons among isolate genomes of all well-sequenced species that has gone unnoticed so far and could be used to more accurately and precisely define these and related concepts compared to current methods. These findings advance the molecular toolbox for accurately delineating and following the important units of diversity within prokaryotic species and thus should greatly facilitate future epidemiological and micro-diversity studies across clinical and environmental settings.https://journals.asm.org/doi/10.1128/mbio.02696-23ANIstrain definitionmicro-diversityepidemiologyclonal complex
spellingShingle Luis M. Rodriguez-R
Roth E. Conrad
Tomeu Viver
Dorian J. Feistel
Blake G. Lindner
Stephanus N. Venter
Luis H. Orellana
Rudolf Amann
Ramon Rossello-Mora
Konstantinos T. Konstantinidis
An ANI gap within bacterial species that advances the definitions of intra-species units
mBio
ANI
strain definition
micro-diversity
epidemiology
clonal complex
title An ANI gap within bacterial species that advances the definitions of intra-species units
title_full An ANI gap within bacterial species that advances the definitions of intra-species units
title_fullStr An ANI gap within bacterial species that advances the definitions of intra-species units
title_full_unstemmed An ANI gap within bacterial species that advances the definitions of intra-species units
title_short An ANI gap within bacterial species that advances the definitions of intra-species units
title_sort ani gap within bacterial species that advances the definitions of intra species units
topic ANI
strain definition
micro-diversity
epidemiology
clonal complex
url https://journals.asm.org/doi/10.1128/mbio.02696-23
work_keys_str_mv AT luismrodriguezr ananigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT rotheconrad ananigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT tomeuviver ananigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT dorianjfeistel ananigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT blakeglindner ananigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT stephanusnventer ananigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT luishorellana ananigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT rudolfamann ananigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT ramonrossellomora ananigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT konstantinostkonstantinidis ananigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT luismrodriguezr anigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT rotheconrad anigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT tomeuviver anigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT dorianjfeistel anigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT blakeglindner anigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT stephanusnventer anigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT luishorellana anigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT rudolfamann anigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT ramonrossellomora anigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits
AT konstantinostkonstantinidis anigapwithinbacterialspeciesthatadvancesthedefinitionsofintraspeciesunits