Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data

Morphological data provide the only means of classifying the majority of life's history, but the choice between competing phylogenetic methods for the analysis of morphology is unclear. Traditionally, parsimony methods have been favoured but recent studies have shown that these approaches are l...

Full description

Bibliographic Details
Main Authors: Puttick, MN, O'Reilly, JE, Tanner, AR, Fleming, JF, Clark, J, Holloway, L, Lozano-Fernandez, J, Parry, LA, Tarver, JE, Pisani, D, Donoghue, PCJ
Format: Journal article
Language:English
Published: Royal Society 2017
_version_ 1797074699118706688
author Puttick, MN
O'Reilly, JE
Tanner, AR
Fleming, JF
Clark, J
Holloway, L
Lozano-Fernandez, J
Parry, LA
Tarver, JE
Pisani, D
Donoghue, PCJ
author_facet Puttick, MN
O'Reilly, JE
Tanner, AR
Fleming, JF
Clark, J
Holloway, L
Lozano-Fernandez, J
Parry, LA
Tarver, JE
Pisani, D
Donoghue, PCJ
author_sort Puttick, MN
collection OXFORD
description Morphological data provide the only means of classifying the majority of life's history, but the choice between competing phylogenetic methods for the analysis of morphology is unclear. Traditionally, parsimony methods have been favoured but recent studies have shown that these approaches are less accurate than the Bayesian implementation of the Mk model. Here we expand on these findings in several ways: we assess the impact of tree shape and maximum-likelihood estimation using the Mk model, as well as analysing data composed of both binary and multistate characters. We find that all methods struggle to correctly resolve deep clades within asymmetric trees, and when analysing small character matrices. The Bayesian Mk model is the most accurate method for estimating topology, but with lower resolution than other methods. Equal weights parsimony is more accurate than implied weights parsimony, and maximum-likelihood estimation using the Mk model is the least accurate method. We conclude that the Bayesian implementation of the Mk model should be the default method for phylogenetic estimation from phenotype datasets, and we explore the implications of our simulations in reanalysing several empirical morphological character matrices. A consequence of our finding is that high levels of resolution or the ability to classify species or groups with much confidence should not be expected when using small datasets. It is now necessary to depart from the traditional parsimony paradigms of constructing character matrices, towards datasets constructed explicitly for Bayesian methods.
first_indexed 2024-03-06T23:39:55Z
format Journal article
id oxford-uuid:6ef22fc9-ff89-4f7e-94c4-b4c013be15e3
institution University of Oxford
language English
last_indexed 2024-03-06T23:39:55Z
publishDate 2017
publisher Royal Society
record_format dspace
spelling oxford-uuid:6ef22fc9-ff89-4f7e-94c4-b4c013be15e32022-03-26T19:27:44ZUncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype dataJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:6ef22fc9-ff89-4f7e-94c4-b4c013be15e3EnglishSymplectic ElementsRoyal Society2017Puttick, MNO'Reilly, JETanner, ARFleming, JFClark, JHolloway, LLozano-Fernandez, JParry, LATarver, JEPisani, DDonoghue, PCJMorphological data provide the only means of classifying the majority of life's history, but the choice between competing phylogenetic methods for the analysis of morphology is unclear. Traditionally, parsimony methods have been favoured but recent studies have shown that these approaches are less accurate than the Bayesian implementation of the Mk model. Here we expand on these findings in several ways: we assess the impact of tree shape and maximum-likelihood estimation using the Mk model, as well as analysing data composed of both binary and multistate characters. We find that all methods struggle to correctly resolve deep clades within asymmetric trees, and when analysing small character matrices. The Bayesian Mk model is the most accurate method for estimating topology, but with lower resolution than other methods. Equal weights parsimony is more accurate than implied weights parsimony, and maximum-likelihood estimation using the Mk model is the least accurate method. We conclude that the Bayesian implementation of the Mk model should be the default method for phylogenetic estimation from phenotype datasets, and we explore the implications of our simulations in reanalysing several empirical morphological character matrices. A consequence of our finding is that high levels of resolution or the ability to classify species or groups with much confidence should not be expected when using small datasets. It is now necessary to depart from the traditional parsimony paradigms of constructing character matrices, towards datasets constructed explicitly for Bayesian methods.
spellingShingle Puttick, MN
O'Reilly, JE
Tanner, AR
Fleming, JF
Clark, J
Holloway, L
Lozano-Fernandez, J
Parry, LA
Tarver, JE
Pisani, D
Donoghue, PCJ
Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data
title Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data
title_full Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data
title_fullStr Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data
title_full_unstemmed Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data
title_short Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data
title_sort uncertain tree discriminating among competing approaches to the phylogenetic analysis of phenotype data
work_keys_str_mv AT puttickmn uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata
AT oreillyje uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata
AT tannerar uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata
AT flemingjf uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata
AT clarkj uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata
AT hollowayl uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata
AT lozanofernandezj uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata
AT parryla uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata
AT tarverje uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata
AT pisanid uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata
AT donoghuepcj uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata