Heaps' law

Known as: Heaps law, Herdan's law 
In linguistics, Heaps' law (also called Herdan's law) is an empirical law which describes the number of distinct words in a document (or set of… (More)
Wikipedia

Topic mentions per year

Topic mentions per year

1977-2017
051019772017

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
2016
2016
This article is devoted to the verification of the empirical Heaps law in European languages using Google Books Ngram corpus data… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
2016
2016
Herdan-Heaps law and Lotka’s law are two important laws in linguistics and many other fields, which are often found to coexist in… (More)
  • table 1
  • table 1
  • table 2
  • table 2
  • figure 1
Is this relevant?
2014
2014
Seeking information online can be an exercise in time wasted wading through repetitive, verbose text with little actual content… (More)
  • figure 1
  • figure 2
  • figure 3
Is this relevant?
2014
2014
In this paper we combine statistical analysis of large text databases and simple stochastic models to explain the appearance of… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
Is this relevant?
Review
2012
Review
2012
The relaxed Hilberg conjecture is a proposition about natural language which states that mutual information between two adjacent… (More)
Is this relevant?
2008
2008
Power-law distributions have been observed in a wide variety of areas. To our knowledge however, there has been no systematic… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • table 1
Is this relevant?
2007
2007
and this formula represents the formulation of Herdan’s law: The logarithm of vocabulary size divided by the logarithm of text… (More)
  • figure 1
Is this relevant?
2004
2004
The number of features to be considered in a text classification system is given by the size of the vocabulary and this is… (More)
  • table 1
  • figure 1
  • table 2
  • figure 2
Is this relevant?
2003
2003
A speech recognition system targeting high inflective languages is described that combines the traditional trigram language model… (More)
  • table 1
  • figure 1
  • table 3
Is this relevant?
2001
2001
We observed that the coefficients of two important empirical statistical laws of language – Zipf law and Heaps law – are… (More)
Is this relevant?