Selection or drift: The population biology underlying transposon insertion sequencing experiments

Transposon insertion sequencing methods such as Tn-seq revolutionized microbiology by allowing the identification of genomic loci that are critical for viability in a specific environment on a genome-wide scale. While powerful, transposon insertion sequencing suffers from limited reproducibility whe...

Full description

Bibliographic Details
Main Authors: Anel Mahmutovic, Pia Abel zur Wiesch, Sören Abel
Format: Article
Language:English
Published: Elsevier 2020-01-01
Series:Computational and Structural Biotechnology Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2001037019304398
_version_ 1818950704779231232
author Anel Mahmutovic
Pia Abel zur Wiesch
Sören Abel
author_facet Anel Mahmutovic
Pia Abel zur Wiesch
Sören Abel
author_sort Anel Mahmutovic
collection DOAJ
description Transposon insertion sequencing methods such as Tn-seq revolutionized microbiology by allowing the identification of genomic loci that are critical for viability in a specific environment on a genome-wide scale. While powerful, transposon insertion sequencing suffers from limited reproducibility when different analysis methods are compared. From the perspective of population biology, this may be explained by changes in mutant frequency due to chance (drift) rather than differential fitness (selection).Here, we develop a mathematical model of the population biology of transposon insertion sequencing experiments, i.e. the changes in size and composition of the transposon-mutagenized population during the experiment. We use this model to investigate mutagenesis, the growth of the mutant library, and its passage through bottlenecks. Specifically, we study how these processes can lead to extinction of individual mutants depending on their fitness and the distribution of fitness effects (DFE) of the entire mutant population.We find that in typical in vitro experiments few mutants with high fitness go extinct. However, bottlenecks of a size that is common in animal infection models lead to so much random extinction that a large number of viable mutants would be misclassified. While mutants with low fitness are more likely to be lost during the experiment, mutants with intermediate fitness are expected to be much more abundant and can constitute a large proportion of detected hits, i.e. false positives. Thus, incorporating the DFEs of randomly generated mutations in the analysis may improve the reproducibility of transposon insertion experiments, especially when strong bottlenecks are encountered.
first_indexed 2024-12-20T09:22:49Z
format Article
id doaj.art-f559f71a31294aa987afd93eefc6fece
institution Directory Open Access Journal
issn 2001-0370
language English
last_indexed 2024-12-20T09:22:49Z
publishDate 2020-01-01
publisher Elsevier
record_format Article
series Computational and Structural Biotechnology Journal
spelling doaj.art-f559f71a31294aa987afd93eefc6fece2022-12-21T19:45:16ZengElsevierComputational and Structural Biotechnology Journal2001-03702020-01-0118791804Selection or drift: The population biology underlying transposon insertion sequencing experimentsAnel Mahmutovic0Pia Abel zur Wiesch1Sören Abel2Department of Pharmacy, Faculty of Health Sciences, UiT - The Arctic University of Norway, 9037 Tromsø, NorwayDepartment of Pharmacy, Faculty of Health Sciences, UiT - The Arctic University of Norway, 9037 Tromsø, Norway; Centre for Molecular Medicine Norway, Nordic EMBL Partnership, 0318 Oslo, Norway; Department of Biology, The Pennsylvania State University, University Park, PA 16802, USA; Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, USADepartment of Pharmacy, Faculty of Health Sciences, UiT - The Arctic University of Norway, 9037 Tromsø, Norway; Centre for Molecular Medicine Norway, Nordic EMBL Partnership, 0318 Oslo, Norway; Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, USA; Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, PA 16802, USA; Corresponding author at: Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 18602, USA.Transposon insertion sequencing methods such as Tn-seq revolutionized microbiology by allowing the identification of genomic loci that are critical for viability in a specific environment on a genome-wide scale. While powerful, transposon insertion sequencing suffers from limited reproducibility when different analysis methods are compared. From the perspective of population biology, this may be explained by changes in mutant frequency due to chance (drift) rather than differential fitness (selection).Here, we develop a mathematical model of the population biology of transposon insertion sequencing experiments, i.e. the changes in size and composition of the transposon-mutagenized population during the experiment. We use this model to investigate mutagenesis, the growth of the mutant library, and its passage through bottlenecks. Specifically, we study how these processes can lead to extinction of individual mutants depending on their fitness and the distribution of fitness effects (DFE) of the entire mutant population.We find that in typical in vitro experiments few mutants with high fitness go extinct. However, bottlenecks of a size that is common in animal infection models lead to so much random extinction that a large number of viable mutants would be misclassified. While mutants with low fitness are more likely to be lost during the experiment, mutants with intermediate fitness are expected to be much more abundant and can constitute a large proportion of detected hits, i.e. false positives. Thus, incorporating the DFEs of randomly generated mutations in the analysis may improve the reproducibility of transposon insertion experiments, especially when strong bottlenecks are encountered.http://www.sciencedirect.com/science/article/pii/S2001037019304398Tn-seqTransposon insertion sequencingPopulation biologyRandom birth-death processMultinomial random samplingBottleneck
spellingShingle Anel Mahmutovic
Pia Abel zur Wiesch
Sören Abel
Selection or drift: The population biology underlying transposon insertion sequencing experiments
Computational and Structural Biotechnology Journal
Tn-seq
Transposon insertion sequencing
Population biology
Random birth-death process
Multinomial random sampling
Bottleneck
title Selection or drift: The population biology underlying transposon insertion sequencing experiments
title_full Selection or drift: The population biology underlying transposon insertion sequencing experiments
title_fullStr Selection or drift: The population biology underlying transposon insertion sequencing experiments
title_full_unstemmed Selection or drift: The population biology underlying transposon insertion sequencing experiments
title_short Selection or drift: The population biology underlying transposon insertion sequencing experiments
title_sort selection or drift the population biology underlying transposon insertion sequencing experiments
topic Tn-seq
Transposon insertion sequencing
Population biology
Random birth-death process
Multinomial random sampling
Bottleneck
url http://www.sciencedirect.com/science/article/pii/S2001037019304398
work_keys_str_mv AT anelmahmutovic selectionordriftthepopulationbiologyunderlyingtransposoninsertionsequencingexperiments
AT piaabelzurwiesch selectionordriftthepopulationbiologyunderlyingtransposoninsertionsequencingexperiments
AT sorenabel selectionordriftthepopulationbiologyunderlyingtransposoninsertionsequencingexperiments