Multiple origins of endosymbiosis within the Enterobacteriaceae (γ-Proteobacteria): convergence of complex phylogenetic approaches

<p>Abstract</p> <p>Background</p> <p>The bacterial family Enterobacteriaceae gave rise to a variety of symbiotic forms, from the loosely associated commensals, often designated as secondary (S) symbionts, to obligate mutualists, called primary (P) symbionts. Determinati...

Full description

Bibliographic Details
Main Authors: Husník Filip, Chrudimský Tomáš, Hypša Václav
Format: Article
Language:English
Published: BMC 2011-12-01
Series:BMC Biology
Online Access:http://www.biomedcentral.com/1741-7007/9/87
Description
Summary:<p>Abstract</p> <p>Background</p> <p>The bacterial family Enterobacteriaceae gave rise to a variety of symbiotic forms, from the loosely associated commensals, often designated as secondary (S) symbionts, to obligate mutualists, called primary (P) symbionts. Determination of the evolutionary processes behind this phenomenon has long been hampered by the unreliability of phylogenetic reconstructions within this group of bacteria. The main reasons have been the absence of sufficient data, the highly derived nature of the symbiont genomes and lack of appropriate phylogenetic methods. Due to the extremely aberrant nature of their DNA, the symbiotic lineages within Enterobacteriaceae form long branches and tend to cluster as a monophyletic group. This state of phylogenetic uncertainty is now improving with an increasing number of complete bacterial genomes and development of new methods. In this study, we address the monophyly versus polyphyly of enterobacterial symbionts by exploring a multigene matrix within a complex phylogenetic framework.</p> <p>Results</p> <p>We assembled the richest taxon sampling of Enterobacteriaceae to date (50 taxa, 69 orthologous genes with no missing data) and analyzed both nucleic and amino acid data sets using several probabilistic methods. We particularly focused on the long-branch attraction-reducing methods, such as a nucleotide and amino acid data recoding and exclusion (including our new approach and slow-fast analysis), taxa exclusion and usage of complex evolutionary models, such as nonhomogeneous model and models accounting for site-specific features of protein evolution (CAT and CAT+GTR). Our data strongly suggest independent origins of four symbiotic clusters; the first is formed by <it>Hamiltonella </it>and <it>Regiella </it>(S-symbionts) placed as a sister clade to <it>Yersinia</it>, the second comprises <it>Arsenophonus </it>and <it>Riesia </it>(S- and P-symbionts) as a sister clade to <it>Proteus</it>, the third <it>Sodalis</it>, <it>Baumannia</it>, <it>Blochmannia </it>and <it>Wigglesworthia </it>(S- and P-symbionts) as a sister or paraphyletic clade to the <it>Pectobacterium </it>and <it>Dickeya </it>clade and, finally, <it>Buchnera </it>species and <it>Ishikawaella </it>(P-symbionts) clustering with the <it>Erwinia </it>and <it>Pantoea </it>clade.</p> <p>Conclusions</p> <p>The results of this study confirm the efficiency of several artifact-reducing methods and strongly point towards the polyphyly of P-symbionts within Enterobacteriaceae. Interestingly, the model species of symbiotic bacteria research, <it>Buchnera </it>and <it>Wigglesworthia</it>, originated from closely related, but different, ancestors. The possible origins of intracellular symbiotic bacteria from gut-associated or pathogenic bacteria are suggested, as well as the role of facultative secondary symbionts as a source of bacteria that can gradually become obligate maternally transferred symbionts.</p>
ISSN:1741-7007