Randomness versus specifics for word-frequency distributions

  title={Randomness versus specifics for word-frequency distributions},
  author={Xiao-Yong Yan and P. Minnhagen},
  journal={Physica A-statistical Mechanics and Its Applications},
The text-length-dependence of real word-frequency distributions can be connected to the general properties of a random book. It is pointed out that this finding has strong implications, when deciding between two conceptually different views on word-frequency distributions, i.e. the specific ‘Zipf’s-view’ and the non-specific ‘Randomness-view’, as is discussed. It is also noticed that the text-length transformation of a random book does have an exact scaling property precisely for the power-law… Expand
10 Citations

Figures from this paper

Dependence of exponents on text length versus finite-size scaling for word-frequency distributions.
Two Universality Properties Associated with the Monkey Model of Zipf's Law
The likely determines the unlikely
The Dependence of Frequency Distributions on Multiple Meanings of Words, Codes and Signs
Benford's Law and First Letter of Word
Fe b 20 16 The likely determines the unlikely
Assessing sustainability performance of high-tech firms through a hybrid approach


Random texts exhibit Zipf's-law-like word frequency distribution
  • Wentian Li
  • Mathematics, Computer Science
  • IEEE Trans. Inf. Theory
  • 1992
Scaling Laws in Human Language
A Paradoxical Property of the Monkey Book
Least effort and the origins of scaling in human language
  • R. F. Cancho, R. Solé
  • Computer Science, Medicine
  • Proceedings of the National Academy of Sciences of the United States of America
  • 2003
Dynamics of Text Generation with Realistic Zipf's Distribution
Language as an evolving word web
  • S. N. Dorogovtsev, J. Mendes
  • Computer Science, Medicine
  • Proceedings of the Royal Society of London. Series B: Biological Sciences
  • 2001