A Hierarchical Bayesian Language Model based on Pitman-Yor Processes

We propose a new hierarchical Bayesian n-gram model of natural languages. Our model makes use of a generalization of the commonly used Dirichlet distributions called Pitman-Yor processes which produce power-law distributions more closely resembling those in natural languages. We show that an approxi...

Fuld beskrivelse

Bibliografiske detaljer
Main Authors: Teh, Y, COLING
Format: Journal article
Sprog:English
Udgivet: 2006