A Hierarchical Bayesian Language Model based on Pitman-Yor Processes

We propose a new hierarchical Bayesian n-gram model of natural languages. Our model makes use of a generalization of the commonly used Dirichlet distributions called Pitman-Yor processes which produce power-law distributions more closely resembling those in natural languages. We show that an approxi...

Volledige beschrijving

Bibliografische gegevens
Hoofdauteurs: Teh, Y, COLING
Formaat: Journal article
Taal:English
Gepubliceerd in: 2006