An improved probability mapping approach to assess genome mosaicism

<p>Abstract</p> <p>Background</p> <p>Maximum likelihood and posterior probability mapping are useful visualization techniques that are used to ascertain the mosaic nature of prokaryotic genomes. However, posterior probabilities, especially when calculated for four-taxon...

Full description

Bibliographic Details
Main Authors: Gogarten J Peter, Zhaxybayeva Olga
Format: Article
Language:English
Published: BMC 2003-09-01
Series:BMC Genomics
Subjects:
Online Access:http://www.biomedcentral.com/1471-2164/4/37
_version_ 1818937019158495232
author Gogarten J Peter
Zhaxybayeva Olga
author_facet Gogarten J Peter
Zhaxybayeva Olga
author_sort Gogarten J Peter
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>Maximum likelihood and posterior probability mapping are useful visualization techniques that are used to ascertain the mosaic nature of prokaryotic genomes. However, posterior probabilities, especially when calculated for four-taxon cases, tend to overestimate the support for tree topologies. Furthermore, because of poor taxon sampling four-taxon analyses suffer from sensitivity to the long branch attraction artifact. Here we extend the probability mapping approach by improving taxon sampling of the analyzed datasets, and by using bootstrap support values, a more conservative tool to assess reliability.</p> <p>Results</p> <p>Quartets of orthologous proteins were complemented with homologs from selected reference genomes. The mapping of bootstrap support values from these extended datasets gives results similar to the original maximum likelihood and posterior probability mapping. The more conservative nature of the plotted support values allows to focus further analyses on those protein families that strongly disagree with the majority or plurality of genes present in the analyzed genomes.</p> <p>Conclusion</p> <p>Posterior probability is a non-conservative measure for support, and posterior probability mapping only provides a quick estimation of phylogenetic information content of four genomes. This approach can be utilized as a pre-screen to select genes that might have been horizontally transferred. Better taxon sampling combined with subtree analyses prevents the inconsistencies associated with four-taxon analyses, but retains the power of visual representation. Nevertheless, a case-by-case inspection of individual multi-taxon phylogenies remains necessary to differentiate unrecognized paralogy and shared phylogenetic reconstruction artifacts from horizontal gene transfer events.</p>
first_indexed 2024-12-20T05:45:18Z
format Article
id doaj.art-2f2f75c9a5064fb4b4a281bf607e689f
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-12-20T05:45:18Z
publishDate 2003-09-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-2f2f75c9a5064fb4b4a281bf607e689f2022-12-21T19:51:19ZengBMCBMC Genomics1471-21642003-09-014137An improved probability mapping approach to assess genome mosaicismGogarten J PeterZhaxybayeva Olga<p>Abstract</p> <p>Background</p> <p>Maximum likelihood and posterior probability mapping are useful visualization techniques that are used to ascertain the mosaic nature of prokaryotic genomes. However, posterior probabilities, especially when calculated for four-taxon cases, tend to overestimate the support for tree topologies. Furthermore, because of poor taxon sampling four-taxon analyses suffer from sensitivity to the long branch attraction artifact. Here we extend the probability mapping approach by improving taxon sampling of the analyzed datasets, and by using bootstrap support values, a more conservative tool to assess reliability.</p> <p>Results</p> <p>Quartets of orthologous proteins were complemented with homologs from selected reference genomes. The mapping of bootstrap support values from these extended datasets gives results similar to the original maximum likelihood and posterior probability mapping. The more conservative nature of the plotted support values allows to focus further analyses on those protein families that strongly disagree with the majority or plurality of genes present in the analyzed genomes.</p> <p>Conclusion</p> <p>Posterior probability is a non-conservative measure for support, and posterior probability mapping only provides a quick estimation of phylogenetic information content of four genomes. This approach can be utilized as a pre-screen to select genes that might have been horizontally transferred. Better taxon sampling combined with subtree analyses prevents the inconsistencies associated with four-taxon analyses, but retains the power of visual representation. Nevertheless, a case-by-case inspection of individual multi-taxon phylogenies remains necessary to differentiate unrecognized paralogy and shared phylogenetic reconstruction artifacts from horizontal gene transfer events.</p>http://www.biomedcentral.com/1471-2164/4/37maximum likelihood mappinglong-branch attractionhorizontal gene transfertaxon samplingbootstrap support values mapping
spellingShingle Gogarten J Peter
Zhaxybayeva Olga
An improved probability mapping approach to assess genome mosaicism
BMC Genomics
maximum likelihood mapping
long-branch attraction
horizontal gene transfer
taxon sampling
bootstrap support values mapping
title An improved probability mapping approach to assess genome mosaicism
title_full An improved probability mapping approach to assess genome mosaicism
title_fullStr An improved probability mapping approach to assess genome mosaicism
title_full_unstemmed An improved probability mapping approach to assess genome mosaicism
title_short An improved probability mapping approach to assess genome mosaicism
title_sort improved probability mapping approach to assess genome mosaicism
topic maximum likelihood mapping
long-branch attraction
horizontal gene transfer
taxon sampling
bootstrap support values mapping
url http://www.biomedcentral.com/1471-2164/4/37
work_keys_str_mv AT gogartenjpeter animprovedprobabilitymappingapproachtoassessgenomemosaicism
AT zhaxybayevaolga animprovedprobabilitymappingapproachtoassessgenomemosaicism
AT gogartenjpeter improvedprobabilitymappingapproachtoassessgenomemosaicism
AT zhaxybayevaolga improvedprobabilitymappingapproachtoassessgenomemosaicism