Scaling laws and fluctuations in the statistics of word frequencies

In this paper, we combine statistical analysis of written texts and simple stochastic models to explain the appearance of scaling laws in the statistics of word frequencies. The average vocabulary of an ensemble of fixed-length texts is known to scale sublinearly with the total number of words (Heap...

Full description

Bibliographic Details
Main Authors: Martin Gerlach, Eduardo G Altmann
Format: Article
Language:English
Published: IOP Publishing 2014-01-01
Series:New Journal of Physics
Subjects:
Online Access:https://doi.org/10.1088/1367-2630/16/11/113010