A Hierarchical Bayesian Language Model based on Pitman-Yor Processes

We propose a new hierarchical Bayesian n-gram model of natural languages. Our model makes use of a generalization of the commonly used Dirichlet distributions called Pitman-Yor processes which produce power-law distributions more closely resembling those in natural languages. We show that an approxi...

Full description

Bibliographic Details
Main Authors: Teh, Y, COLING
Format: Journal article
Language:English
Published: 2006