Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data
Morphological data provide the only means of classifying the majority of life's history, but the choice between competing phylogenetic methods for the analysis of morphology is unclear. Traditionally, parsimony methods have been favoured but recent studies have shown that these approaches are l...
Main Authors: | , , , , , , , , , , |
---|---|
Format: | Journal article |
Language: | English |
Published: |
Royal Society
2017
|
_version_ | 1797074699118706688 |
---|---|
author | Puttick, MN O'Reilly, JE Tanner, AR Fleming, JF Clark, J Holloway, L Lozano-Fernandez, J Parry, LA Tarver, JE Pisani, D Donoghue, PCJ |
author_facet | Puttick, MN O'Reilly, JE Tanner, AR Fleming, JF Clark, J Holloway, L Lozano-Fernandez, J Parry, LA Tarver, JE Pisani, D Donoghue, PCJ |
author_sort | Puttick, MN |
collection | OXFORD |
description | Morphological data provide the only means of classifying the majority of life's history, but the choice between competing phylogenetic methods for the analysis of morphology is unclear. Traditionally, parsimony methods have been favoured but recent studies have shown that these approaches are less accurate than the Bayesian implementation of the Mk model. Here we expand on these findings in several ways: we assess the impact of tree shape and maximum-likelihood estimation using the Mk model, as well as analysing data composed of both binary and multistate characters. We find that all methods struggle to correctly resolve deep clades within asymmetric trees, and when analysing small character matrices. The Bayesian Mk model is the most accurate method for estimating topology, but with lower resolution than other methods. Equal weights parsimony is more accurate than implied weights parsimony, and maximum-likelihood estimation using the Mk model is the least accurate method. We conclude that the Bayesian implementation of the Mk model should be the default method for phylogenetic estimation from phenotype datasets, and we explore the implications of our simulations in reanalysing several empirical morphological character matrices. A consequence of our finding is that high levels of resolution or the ability to classify species or groups with much confidence should not be expected when using small datasets. It is now necessary to depart from the traditional parsimony paradigms of constructing character matrices, towards datasets constructed explicitly for Bayesian methods. |
first_indexed | 2024-03-06T23:39:55Z |
format | Journal article |
id | oxford-uuid:6ef22fc9-ff89-4f7e-94c4-b4c013be15e3 |
institution | University of Oxford |
language | English |
last_indexed | 2024-03-06T23:39:55Z |
publishDate | 2017 |
publisher | Royal Society |
record_format | dspace |
spelling | oxford-uuid:6ef22fc9-ff89-4f7e-94c4-b4c013be15e32022-03-26T19:27:44ZUncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype dataJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:6ef22fc9-ff89-4f7e-94c4-b4c013be15e3EnglishSymplectic ElementsRoyal Society2017Puttick, MNO'Reilly, JETanner, ARFleming, JFClark, JHolloway, LLozano-Fernandez, JParry, LATarver, JEPisani, DDonoghue, PCJMorphological data provide the only means of classifying the majority of life's history, but the choice between competing phylogenetic methods for the analysis of morphology is unclear. Traditionally, parsimony methods have been favoured but recent studies have shown that these approaches are less accurate than the Bayesian implementation of the Mk model. Here we expand on these findings in several ways: we assess the impact of tree shape and maximum-likelihood estimation using the Mk model, as well as analysing data composed of both binary and multistate characters. We find that all methods struggle to correctly resolve deep clades within asymmetric trees, and when analysing small character matrices. The Bayesian Mk model is the most accurate method for estimating topology, but with lower resolution than other methods. Equal weights parsimony is more accurate than implied weights parsimony, and maximum-likelihood estimation using the Mk model is the least accurate method. We conclude that the Bayesian implementation of the Mk model should be the default method for phylogenetic estimation from phenotype datasets, and we explore the implications of our simulations in reanalysing several empirical morphological character matrices. A consequence of our finding is that high levels of resolution or the ability to classify species or groups with much confidence should not be expected when using small datasets. It is now necessary to depart from the traditional parsimony paradigms of constructing character matrices, towards datasets constructed explicitly for Bayesian methods. |
spellingShingle | Puttick, MN O'Reilly, JE Tanner, AR Fleming, JF Clark, J Holloway, L Lozano-Fernandez, J Parry, LA Tarver, JE Pisani, D Donoghue, PCJ Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data |
title | Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data |
title_full | Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data |
title_fullStr | Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data |
title_full_unstemmed | Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data |
title_short | Uncertain-tree: discriminating among competing approaches to the phylogenetic analysis of phenotype data |
title_sort | uncertain tree discriminating among competing approaches to the phylogenetic analysis of phenotype data |
work_keys_str_mv | AT puttickmn uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT oreillyje uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT tannerar uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT flemingjf uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT clarkj uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT hollowayl uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT lozanofernandezj uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT parryla uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT tarverje uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT pisanid uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata AT donoghuepcj uncertaintreediscriminatingamongcompetingapproachestothephylogeneticanalysisofphenotypedata |