Dated ancestral trees from binary trait data and their application to the diversification of languages

Binary trait data record the presence or absence of distinguishing traits in individuals. We treat the problem of estimating ancestral trees with time depth from binary trait data. Simple analysis of such data is problematic. Each homology class of traits has a unique birth event on the tree, and th...

Cur síos iomlán

Sonraí bibleagrafaíochta
Príomhchruthaitheoirí: Nicholls, G, Gray, R
Formáid: Journal article
Teanga:English
Foilsithe / Cruthaithe: 2008
_version_ 1826260338313527296
author Nicholls, G
Gray, R
author_facet Nicholls, G
Gray, R
author_sort Nicholls, G
collection OXFORD
description Binary trait data record the presence or absence of distinguishing traits in individuals. We treat the problem of estimating ancestral trees with time depth from binary trait data. Simple analysis of such data is problematic. Each homology class of traits has a unique birth event on the tree, and the birth event of a trait that is visible at the leaves is biased towards the leaves. We propose a model-based analysis of such data and present a Markov chain Monte Carlo algorithm that can sample from the resulting posterior distribution. Our model is based on using a birth-death process for the evolution of the elements of sets of traits. Our analysis correctly accounts for the removal of singleton traits, which are commonly discarded in real data sets. We illustrate Bayesian inference for two binary trait data sets which arise in historical linguistics. The Bayesian approach allows for the incorporation of information from ancestral languages. The marginal prior distribution of the root time is uniform. We present a thorough analysis of the robustness of our results to model misspecification, through analysis of predictive distributions for external data, and fitting data that are simulated under alternative observation models. The reconstructed ages of tree nodes are relatively robust, whereas posterior probabilities for topology are not reliable. © 2008 Royal Statistical Society.
first_indexed 2024-03-06T19:04:02Z
format Journal article
id oxford-uuid:14861a84-c924-4061-bcf8-baa02856b5d7
institution University of Oxford
language English
last_indexed 2024-03-06T19:04:02Z
publishDate 2008
record_format dspace
spelling oxford-uuid:14861a84-c924-4061-bcf8-baa02856b5d72022-03-26T10:20:16ZDated ancestral trees from binary trait data and their application to the diversification of languagesJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:14861a84-c924-4061-bcf8-baa02856b5d7EnglishSymplectic Elements at Oxford2008Nicholls, GGray, RBinary trait data record the presence or absence of distinguishing traits in individuals. We treat the problem of estimating ancestral trees with time depth from binary trait data. Simple analysis of such data is problematic. Each homology class of traits has a unique birth event on the tree, and the birth event of a trait that is visible at the leaves is biased towards the leaves. We propose a model-based analysis of such data and present a Markov chain Monte Carlo algorithm that can sample from the resulting posterior distribution. Our model is based on using a birth-death process for the evolution of the elements of sets of traits. Our analysis correctly accounts for the removal of singleton traits, which are commonly discarded in real data sets. We illustrate Bayesian inference for two binary trait data sets which arise in historical linguistics. The Bayesian approach allows for the incorporation of information from ancestral languages. The marginal prior distribution of the root time is uniform. We present a thorough analysis of the robustness of our results to model misspecification, through analysis of predictive distributions for external data, and fitting data that are simulated under alternative observation models. The reconstructed ages of tree nodes are relatively robust, whereas posterior probabilities for topology are not reliable. © 2008 Royal Statistical Society.
spellingShingle Nicholls, G
Gray, R
Dated ancestral trees from binary trait data and their application to the diversification of languages
title Dated ancestral trees from binary trait data and their application to the diversification of languages
title_full Dated ancestral trees from binary trait data and their application to the diversification of languages
title_fullStr Dated ancestral trees from binary trait data and their application to the diversification of languages
title_full_unstemmed Dated ancestral trees from binary trait data and their application to the diversification of languages
title_short Dated ancestral trees from binary trait data and their application to the diversification of languages
title_sort dated ancestral trees from binary trait data and their application to the diversification of languages
work_keys_str_mv AT nichollsg datedancestraltreesfrombinarytraitdataandtheirapplicationtothediversificationoflanguages
AT grayr datedancestraltreesfrombinarytraitdataandtheirapplicationtothediversificationoflanguages