Zipf’s, Heaps’ and Taylor’s Laws are Determined by the Expansion into the Adjacent Possible

  title={Zipf’s, Heaps’ and Taylor’s Laws are Determined by the Expansion into the Adjacent Possible},
  author={Francesca Tria and Vittorio Loreto and Vito Domenico Pietro Servedio},
Zipf’s, Heaps’ and Taylor’s laws are ubiquitous in many different systems where innovation processes are at play. Together, they represent a compelling set of stylized facts regarding the overall statistics, the innovation rate and the scaling of fluctuations for systems as diverse as written texts and cities, ecological systems and stock markets. Many modeling schemes have been proposed in literature to explain those laws, but only recently a modeling framework has been introduced that… 

Figures from this paper

Taylor’s Law in Innovation Processes

It is suggested that Taylor’'s law is a fundamental complement to Zipf’s and Heaps’ laws in unveiling the complex dynamical processes underlying the evolution of systems featuring innovation.

Twitter as an innovation process with damping effect

This work introduces an extremely general model, whose key element is the update function, that can be suitably chosen in order to reproduce the behaviour exhibited by the empirical data, and shows great performances with respect to Heaps’ law.

Heaps’ Law and Heaps functions in tagged texts: evidences of their linguistic relevance

The relationship between vocabulary size and text length in a corpus of 75 literary works in English is studied, distinguishing between the contributions of three grammatical classes (or ‘tags,’ namely, nouns, verbs and others), and the progressive appearance of new words of each tag along each individual text is analyzed.

The Compressed Vocabulary of Microbial Life

The vocabularies of proteins (proteomes) and functions (functionomes) revealed a significant universal lexical core supporting a universal common ancestor, an ancestral evolutionary link between Bacteria and Eukarya, and distinct reductive evolutionary strategies of language compression in Archaea and Bacteria.

The exploration of the Adjacent Possible explains the emergence and evolution of social networks

This work uses a recent mathematical formalization of the adjacent possible space to propose a first-principles theory of social exploration based on simple microscopic rules defining how people get in touch and interact and predicts both microscopic and macroscopic features of social networks.

On the Dual Nature of Adoption Processes in Complex Networks

This work proposes to look at adoption processes in socio-technological systems as two complementary aspects of the same adoption process: on the one hand, items spreading over a social network of individuals influencing each other, and on the other hand, individuals explore a network of similarities among items to adopt.

Adjacent Possible

Adjacent, i.e., nearby, possibilities constantly emerge in a multitude of settings for a multitude of entities. When these possibilities are explored, yet new possibilities emerge. The concept of the

The Language of Innovation

The methods proposed in this paper provide a completely new way of understanding and forecasting innovation, by tackling it from a revealing perspective and opening interesting scenarios for a number of applications and further analytic approaches.

Interacting discovery processes on complex networks

A model in which many urns, representing different explorers, are coupled through the links of a social network and exploit opportunities coming from their contacts is proposed, showing that the pace of discovery of an explorer depends on its centrality in the social network.

A Standardized Project Gutenberg Corpus for Statistical Analysis of Natural Language and Quantitative Linguistics

The Standardized Project Gutenberg Corpus (SPGC), an open science approach to a curated version of the complete PG data containing more than 50,000 books and more than 3×109 word-tokens, is presented, providing a reproducible, pre-processed, full-size version of Project Gutenberg as a new scientific resource for corpus linguistics, natural language processing, and information retrieval.



Zipf's Law Leads to Heaps' Law: Analyzing Their Relation in Finite-Size Systems

The present analysis provides a clear picture about the relation between the Zipf's law and Heaps' law without the help of any specific stochastic model, namely the Heps' law is indeed a derivative phenomenon from the ZipF's law.

There is More than a Power Law in Zipf

The largest cities, the most frequently used words, the income of the richest countries, and the most wealthy billionaires, can be all described in terms of Zipf’s Law, a rank-size rule capturing the

Zipf's Law everywhere

At the 100th anniversary of the birth of George Kingsley Zipf, one striking fact about the statistical regularity that bears his name, Zipf's law, is that it seems to appear everywhere. We may ask

Untangling Herdan's law and Heaps' law: Mathematical and informetric arguments

  • L. Egghe
  • Mathematics
    J. Assoc. Inf. Sci. Technol.
  • 2007
Heaps’ law and Herdan’s law are equivalent, apart from the less important constant k, which is not well-known in linguistics: Heaps is a specialist in information retrieval.

The dynamics of correlated novelties

A simple mathematical model is proposed that mimics the process of exploring a physical, biological, or conceptual space that enlarges whenever a novelty occurs and predicts statistical laws for the rate at which novelties happen and for the probability distribution on the space explored.

Zipf’s word frequency law in natural language: A critical review and future directions

It is shown that human language has a highly complex, reliable structure in the frequency distribution over and above Zipf’s law, although prior data visualization methods have obscured this fact.

Power laws, Pareto distributions and Zipf's law

When the probability of measuring a particular value of some quantity varies inversely as a power of that value, the quantity is said to follow a power law, also known variously as Zipf's law or the

Modeling Statistical Properties of Written Text

A generative model is introduced that explains the simultaneous emergence of bursty nature of rare words and the topical organization of texts and dynamic word ranking and memory across documents as key mechanisms explaining the non trivial organization of written text.

Dynamics on Expanding Spaces: Modeling the Emergence of Novelties

Novelties are part of our daily lives. We constantly adopt new technologies, conceive new ideas, meet new people, and experiment with new situations. Occasionally, we as individual, in a complicated

Waves of novelties in the expansion into the adjacent possible

This paper quantifies the emergence of novelties and their rise and fall in popularity by empirically looking at different systems that display innovation at very different levels: the creation of hashtags in Twitter, the evolution of online code repositories, thecreation of texts and the listening of songs on online platforms.