Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data

Morphological data provide the only means of classifying the majority of life's history, but the choice between competing phylogenetic methods for the analysis of morphology is unclear. Traditionally, parsimony methods have been favoured but recent studies have shown that these approaches are l...

Full description

Bibliographic Details
Main Authors:	Puttick, MN, O'Reilly, JE, Tanner, AR, Fleming, JF, Clark, J, Holloway, L, Lozano-Fernandez, J, Parry, LA, Tarver, JE, Pisani, D, Donoghue, PCJ
Format:	Journal article
Language:	English
Published:	Royal Society 2017

_version_	1797074699118706688
author	Puttick, MN O'Reilly, JE Tanner, AR Fleming, JF Clark, J Holloway, L Lozano-Fernandez, J Parry, LA Tarver, JE Pisani, D Donoghue, PCJ
author_facet	Puttick, MN O'Reilly, JE Tanner, AR Fleming, JF Clark, J Holloway, L Lozano-Fernandez, J Parry, LA Tarver, JE Pisani, D Donoghue, PCJ
author_sort	Puttick, MN
collection	OXFORD
description	Morphological data provide the only means of classifying the majority of life's history, but the choice between competing phylogenetic methods for the analysis of morphology is unclear. Traditionally, parsimony methods have been favoured but recent studies have shown that these approaches are less accurate than the Bayesian implementation of the Mk model. Here we expand on these findings in several ways: we assess the impact of tree shape and maximum-likelihood estimation using the Mk model, as well as analysing data composed of both binary and multistate characters. We find that all methods struggle to correctly resolve deep clades within asymmetric trees, and when analysing small character matrices. The Bayesian Mk model is the most accurate method for estimating topology, but with lower resolution than other methods. Equal weights parsimony is more accurate than implied weights parsimony, and maximum-likelihood estimation using the Mk model is the least accurate method. We conclude that the Bayesian implementation of the Mk model should be the default method for phylogenetic estimation from phenotype datasets, and we explore the implications of our simulations in reanalysing several empirical morphological character matrices. A consequence of our finding is that high levels of resolution or the ability to classify species or groups with much confidence should not be expected when using small datasets. It is now necessary to depart from the traditional parsimony paradigms of constructing character matrices, towards datasets constructed explicitly for Bayesian methods.
first_indexed	2024-03-06T23:39:55Z
format	Journal article
id	oxford-uuid:6ef22fc9-ff89-4f7e-94c4-b4c013be15e3
institution	University of Oxford
language	English
last_indexed	2024-03-06T23:39:55Z
publishDate	2017
publisher	Royal Society
record_format	dspace
spelling	oxford-uuid:6ef22fc9-ff89-4f7e-94c4-b4c013be15e32022-03-26T19:27:44ZUncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype dataJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:6ef22fc9-ff89-4f7e-94c4-b4c013be15e3EnglishSymplectic ElementsRoyal Society2017Puttick, MNO'Reilly, JETanner, ARFleming, JFClark, JHolloway, LLozano-Fernandez, JParry, LATarver, JEPisani, DDonoghue, PCJMorphological data provide the only means of classifying the majority of life's history, but the choice between competing phylogenetic methods for the analysis of morphology is unclear. Traditionally, parsimony methods have been favoured but recent studies have shown that these approaches are less accurate than the Bayesian implementation of the Mk model. Here we expand on these findings in several ways: we assess the impact of tree shape and maximum-likelihood estimation using the Mk model, as well as analysing data composed of both binary and multistate characters. We find that all methods struggle to correctly resolve deep clades within asymmetric trees, and when analysing small character matrices. The Bayesian Mk model is the most accurate method for estimating topology, but with lower resolution than other methods. Equal weights parsimony is more accurate than implied weights parsimony, and maximum-likelihood estimation using the Mk model is the least accurate method. We conclude that the Bayesian implementation of the Mk model should be the default method for phylogenetic estimation from phenotype datasets, and we explore the implications of our simulations in reanalysing several empirical morphological character matrices. A consequence of our finding is that high levels of resolution or the ability to classify species or groups with much confidence should not be expected when using small datasets. It is now necessary to depart from the traditional parsimony paradigms of constructing character matrices, towards datasets constructed explicitly for Bayesian methods.
spellingShingle	Puttick, MN O'Reilly, JE Tanner, AR Fleming, JF Clark, J Holloway, L Lozano-Fernandez, J Parry, LA Tarver, JE Pisani, D Donoghue, PCJ Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data
title	Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data
title_full	Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data
title_fullStr	Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data
title_full_unstemmed	Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data
title_short	Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data
title_sort	uncertain tree discriminating among competing approaches to the phylogenetic analysis of phenotype data
work_keys_str_mv	AT puttickmn uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT oreillyje uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT tannerar uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT flemingjf uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT clarkj uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT hollowayl uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT lozanofernandezj uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT parryla uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT tarverje uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT pisanid uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT donoghuepcj uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata

Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data

Similar Items