A neutral theory of genome evolution and the frequency distribution of genes

<p>Abstract</p> <p>Background</p> <p>The gene composition of bacteria of the same species can differ significantly between isolates. Variability in gene composition can be summarized in terms of gene frequency distributions, in which individual genes are ranked accordin...

Full description

Bibliographic Details
Main Authors: Haegeman Bart, Weitz Joshua S
Format: Article
Language:English
Published: BMC 2012-05-01
Series:BMC Genomics
Subjects:
Online Access:http://www.biomedcentral.com/1471-2164/13/196
_version_ 1828948929587183616
author Haegeman Bart
Weitz Joshua S
author_facet Haegeman Bart
Weitz Joshua S
author_sort Haegeman Bart
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>The gene composition of bacteria of the same species can differ significantly between isolates. Variability in gene composition can be summarized in terms of gene frequency distributions, in which individual genes are ranked according to the frequency of genomes in which they appear. Empirical gene frequency distributions possess a U-shape, such that there are many rare genes, some genes of intermediate occurrence, and many common genes. It would seem that U-shaped gene frequency distributions can be used to infer the essentiality and/or importance of a gene to a species. Here, we ask: can U-shaped gene frequency distributions, instead, arise generically via neutral processes of genome evolution?</p> <p>Results</p> <p>We introduce a neutral model of genome evolution which combines birth-death processes at the organismal level with gene uptake and loss at the genomic level. This model predicts that gene frequency distributions possess a characteristic U-shape even in the absence of selective forces driving genome and population structure. We compare the model predictions to empirical gene frequency distributions from 6 multiply sequenced species of bacterial pathogens. We fit the model with constant population size to data, matching U-shape distributions albeit without matching all quantitative features of the distribution. We find stronger model fits in the case where we consider exponentially growing populations. We also show that two alternative models which contain a "rigid" and "flexible" core component of genomes provide strong fits to gene frequency distributions.</p> <p>Conclusions</p> <p>The analysis of neutral models of genome evolution suggests that U-shaped gene frequency distributions provide less information than previously suggested regarding gene essentiality. We discuss the need for additional theory and genomic level information to disentangle the roles of evolutionary mechanisms operating within and amongst individuals in driving the dynamics of gene distributions.</p>
first_indexed 2024-12-14T05:57:13Z
format Article
id doaj.art-587102ac607f4935864471734cc90932
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-12-14T05:57:13Z
publishDate 2012-05-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-587102ac607f4935864471734cc909322022-12-21T23:14:32ZengBMCBMC Genomics1471-21642012-05-0113119610.1186/1471-2164-13-196A neutral theory of genome evolution and the frequency distribution of genesHaegeman BartWeitz Joshua S<p>Abstract</p> <p>Background</p> <p>The gene composition of bacteria of the same species can differ significantly between isolates. Variability in gene composition can be summarized in terms of gene frequency distributions, in which individual genes are ranked according to the frequency of genomes in which they appear. Empirical gene frequency distributions possess a U-shape, such that there are many rare genes, some genes of intermediate occurrence, and many common genes. It would seem that U-shaped gene frequency distributions can be used to infer the essentiality and/or importance of a gene to a species. Here, we ask: can U-shaped gene frequency distributions, instead, arise generically via neutral processes of genome evolution?</p> <p>Results</p> <p>We introduce a neutral model of genome evolution which combines birth-death processes at the organismal level with gene uptake and loss at the genomic level. This model predicts that gene frequency distributions possess a characteristic U-shape even in the absence of selective forces driving genome and population structure. We compare the model predictions to empirical gene frequency distributions from 6 multiply sequenced species of bacterial pathogens. We fit the model with constant population size to data, matching U-shape distributions albeit without matching all quantitative features of the distribution. We find stronger model fits in the case where we consider exponentially growing populations. We also show that two alternative models which contain a "rigid" and "flexible" core component of genomes provide strong fits to gene frequency distributions.</p> <p>Conclusions</p> <p>The analysis of neutral models of genome evolution suggests that U-shaped gene frequency distributions provide less information than previously suggested regarding gene essentiality. We discuss the need for additional theory and genomic level information to disentangle the roles of evolutionary mechanisms operating within and amongst individuals in driving the dynamics of gene distributions.</p>http://www.biomedcentral.com/1471-2164/13/196BacteriaNeutral modelPan-genomePopulation genomicsSelection
spellingShingle Haegeman Bart
Weitz Joshua S
A neutral theory of genome evolution and the frequency distribution of genes
BMC Genomics
Bacteria
Neutral model
Pan-genome
Population genomics
Selection
title A neutral theory of genome evolution and the frequency distribution of genes
title_full A neutral theory of genome evolution and the frequency distribution of genes
title_fullStr A neutral theory of genome evolution and the frequency distribution of genes
title_full_unstemmed A neutral theory of genome evolution and the frequency distribution of genes
title_short A neutral theory of genome evolution and the frequency distribution of genes
title_sort neutral theory of genome evolution and the frequency distribution of genes
topic Bacteria
Neutral model
Pan-genome
Population genomics
Selection
url http://www.biomedcentral.com/1471-2164/13/196
work_keys_str_mv AT haegemanbart aneutraltheoryofgenomeevolutionandthefrequencydistributionofgenes
AT weitzjoshuas aneutraltheoryofgenomeevolutionandthefrequencydistributionofgenes
AT haegemanbart neutraltheoryofgenomeevolutionandthefrequencydistributionofgenes
AT weitzjoshuas neutraltheoryofgenomeevolutionandthefrequencydistributionofgenes