Predicting the unpredictable

  title={Predicting the unpredictable},
  author={Sandy L. Zabell},
  • S. Zabell
  • Published 1 February 1992
  • Mathematics
  • Synthese
A major difficulty for currently existing theories of inductive inference involves the question of what to do when novel, unknown, or previously unsuspected phenomena occur. In this paper one particular instance of this difficulty is considered, the so-called sampling of species problem.The classical probabilistic theories of inductive inference due to Laplace, Johnson, de Finetti, and Carnap adopt a model of simple enumerative induction in which there are a prespecified number of types or… 

Inductive Inference and Partition Exchangeability in Classification

It is proven that, in contrast to classifiers based on de Finetti type exchangeability which can optimally handle test items independently of each other in the presence of infinite amounts of training data, a classifier based on partition exchangeability still continues to benefit from a joint prediction of labels for the whole population of test items.

An Alternative Prior Process for Nonparametric Bayesian Clustering

This paper investigates the predictive probabilities that underlie the Dirichlet and Pitman-Yor processes, and the implicit "rich-get-richer" characteristic of the resulting partitions, and explores an alternative prior for nonparametric Bayesian clustering|the uniform process|for applications where the rich- get-ricer property is undesirable.

The two-parameter Ewens distribution: a finitary approach

The well-known Ewens Sampling Formula has been generalized by Pitman recently. We derive some essential feature of the model without introducing notions like frequency spectrum, structure

A Finitary Characterization of the Ewens Sampling Formula

The clustering of agents in the market is a typical problem dealt with in recent approaches to macroeconomic modeling, describing macroscopic variables in terms of the behavior of a large collection

Models with Random Exchangeable Structures, and Coexistence of Several Types Agents in the Long-Run: New Implementations of Schumpeter’s Dynamics

We propose a procedure for building macroeconomic models of a large number of interacting agents by dividing the collection of agents by ”types” into clusters or subsets. These clusters form random

Twitter as an innovation process with damping effect

This work introduces an extremely general model, whose key element is the update function, that can be suitably chosen in order to reproduce the behaviour exhibited by the empirical data, and shows great performances with respect to Heaps’ law.

Process Capability Analysis

  • Yohannes Yebabe Tesfay
  • Computer Science
    Developing Structured Procedural and Methodological Engineering Designs
  • 2021



Posterior Distribution of Percentiles: Bayes' Theorem for Sampling From a Population

A Bayesian approach to inference about the percentiles and other characteristics of a finite population is proposed. The approach does not depend upon, though it need not exclude, the use of

Posterior Moments of the Number of Species in a Finite Population and the Posterior Probability of Finding a New Species

Abstract By using a model of Hill for sampling from a finite population, the posterior moments of the number of species in the population and the posterior probability of finding a new species are


A sample of size N is drawn at random from a population of animals of various species. Methods are given for estimating, knowing only the contents of this sample, the number of species which will be

The rule of succession

Zipf's Law and Prior Distributions for the Composition of a Population

Abstract The limiting distribution of frequencies of frequencies as the population size becomes large is obtained under Bose-Einstein and Maxwell-Boltzmann forms of the classical occupancy problem,