# Species sampling models: consistency for the number of species

@article{Bissiri2013SpeciesSM, title={Species sampling models: consistency for the number of species}, author={Pier Giovanni Bissiri and Andrea Ongaro and Stephen G. Walker}, journal={Biometrika}, year={2013}, volume={100}, pages={771-777} }

This paper considers species sampling models using constructions that arise from Bayesian nonparametric prior distributions. A discrete random measure, used to generate a species sampling model, can have either a countable infinite number of atoms, which has been the emphasis in the recent literature, or a finite number of atoms K, while allowing K to be assigned a prior probability distribution on the positive integers. It is the latter class of model we consider here, due to the…

## 7 Citations

On the topological support of species sampling priors

- Mathematics
- 2014

: In Bayesian nonparametric statistics, it is crucial that the sup- port of the prior is very large. Here, we consider species sampling priors. Such priors are widely used within mixture models and…

Generalized Ewens–Pitman model for Bayesian clustering

- Computer Science
- 2014

A Bayesian method for clustering from discrete data structures that commonly arise in genetics and other applications is proposed, which is equivariant with respect to relabelling units and the posterior predictive distribution enables supervised learning based on a partial clustering of the sample.

Generalized Ewens–Pitman model for Bayesian clustering

- Computer Science
- 2015

A Bayesian method for clustering from discrete data structures that commonly arise in genetics and other applications is proposed, which is equivariant with respect to relabelling units and the posterior predictive distribution enables supervised learning based on a partial clustering of the sample.

Stick-breaking processes with exchangeable length variables

- Mathematics
- 2020

We investigate the general class of stick-breaking processes with exchangeable length variables. These generalize well-known Bayesian non-parametric priors in an unexplored direction. We give…

A Comparative overview of some recent Bayesian nonparametric approaches for the size of a population Una rassegna comparativa su alcuni recenti approcci nonparametrici bayesiani per la stima della numerosità di una popolazione

- 2018

We review some recent approaches that have been used to address the difficult problem of estimating the unknown size of a finite population. We start from illustrating what types of inferential…

Bayesian Spatial Nonparametric Models for Confounding Manifest Variables with an Application to China Earthquake Data

- Mathematics2017 13th International Conference on Computational Intelligence and Security (CIS)
- 2017

The model is built on a class of Gaussian Conditional Autoregressive models, in combination with dependent sampling models (SSM) as well as probit stick-breaking process prior for accounting for complex interactions and high correlations of data.

Statistical Inference From Stem Cell Barcoding Data Using Adaptive Approximate Bayesian Computation

- Computer Science
- 2021

A truncated population approximate Bayesian computation (ABC) algorithm which is derived from sequential Monte Carlo ABC (SMC-ABC) and applied the method to the symmetric Dirichlet-multinomial model proposed by Zhang et al. (2005) suggests that flexibility of the asymmetric DirICHlet- Multinomial helps to obtain insight into heterogeneity of proliferating cell systems such as HSC.

## References

SHOWING 1-10 OF 28 REFERENCES

Bayesian Nonparametric Estimation of the Probability of Discovering New Species

- Mathematics
- 2007

We consider the problem of evaluating the probability of discovering a certain number of new species in a new sample of population units, conditional on the number of species recorded in a basic…

Estimating the Number of Species in a Stochastic Abundance Model

- MathematicsBiometrics
- 2002

Simulation studies show that this estimator compares well with maximum likelihood estimators (i.e., empirical Bayes estimators from the Bayesian viewpoint) for which an iterative numerical procedure is needed and may be infeasible.

Investigation of a generalized multinomial model for species data

- Mathematics
- 2005

The question of how to estimate the number of specie in a region given the species frequency distribution for a sample of animals from the region has been of interest for more than 60 years. Data…

Posterior Moments of the Number of Species in a Finite Population and the Posterior Probability of Finding a New Species

- Mathematics
- 1979

Abstract By using a model of Hill for sampling from a finite population, the posterior moments of the number of species in the population and the posterior probability of finding a new species are…

Bayesian non‐parametric inference for species variety with a two‐parameter Poisson–Dirichlet process prior

- Mathematics
- 2009

Summary. A Bayesian non‐parametric methodology has been recently proposed to deal with the issue of prediction within species sampling problems. Such problems concern the evaluation, conditional on…

Estimating species richness by a Poisson-compound gamma model.

- MathematicsBiometrika
- 2010

We propose a Poisson-compound gamma approach for species richness estimation. Based on the denseness and nesting properties of the gamma mixture, we fix the shape parameter of each gamma component at…

A multinomial Bayesian approach to the estimation of population and vocabulary size

- Mathematics
- 1987

We approach estimation of the size of a population or a vocabulary through a Bayesian analysis of the multinomial distribution. We view the sample as being generated from such a distribution with an…

A Penalized Nonparametric Maximum Likelihood Approach to Species Richness Estimation

- Mathematics
- 2005

We propose a class of penalized nonparametric maximum likelihood estimators (NPMLEs) for the species richness problem. We use a penalty term on the likelihood because likelihood estimators that lack…

Estimating the Number of Classes via Sample Coverage

- Mathematics, Computer Science
- 1992

This work generalizes the result of Esty to a nonparametric approach and extends Darroch and Ratcliff to incorporate the heterogeneity of the class probabilities to play an important role in the recommended estimation procedures.

Estimating the Number of Unseen Species: How Many Words did Shakespeare Know?

- Linguistics
- 2008

This paper is the first of two written by Brad Efron and Ron Thisted studying the frequency distribution of words in the Shakespearean canon. The key idea due to Fisher in the context of sampling of…