Species sampling models: consistency for the number of species

@article{Bissiri2013SpeciesSM,
  title={Species sampling models: consistency for the number of species},
  author={Pier Giovanni Bissiri and Andrea Ongaro and Stephen G. Walker},
  journal={Biometrika},
  year={2013},
  volume={100},
  pages={771-777}
}
This paper considers species sampling models using constructions that arise from Bayesian nonparametric prior distributions. A discrete random measure, used to generate a species sampling model, can have either a countable infinite number of atoms, which has been the emphasis in the recent literature, or a finite number of atoms K, while allowing K to be assigned a prior probability distribution on the positive integers. It is the latter class of model we consider here, due to the… 
On the topological support of species sampling priors
: In Bayesian nonparametric statistics, it is crucial that the sup- port of the prior is very large. Here, we consider species sampling priors. Such priors are widely used within mixture models and
Generalized Ewens–Pitman model for Bayesian clustering
TLDR
A Bayesian method for clustering from discrete data structures that commonly arise in genetics and other applications is proposed, which is equivariant with respect to relabelling units and the posterior predictive distribution enables supervised learning based on a partial clustering of the sample.
Generalized Ewens–Pitman model for Bayesian clustering
TLDR
A Bayesian method for clustering from discrete data structures that commonly arise in genetics and other applications is proposed, which is equivariant with respect to relabelling units and the posterior predictive distribution enables supervised learning based on a partial clustering of the sample.
Stick-breaking processes with exchangeable length variables
We investigate the general class of stick-breaking processes with exchangeable length variables. These generalize well-known Bayesian non-parametric priors in an unexplored direction. We give
A Comparative overview of some recent Bayesian nonparametric approaches for the size of a population Una rassegna comparativa su alcuni recenti approcci nonparametrici bayesiani per la stima della numerosità di una popolazione
We review some recent approaches that have been used to address the difficult problem of estimating the unknown size of a finite population. We start from illustrating what types of inferential
Bayesian Spatial Nonparametric Models for Confounding Manifest Variables with an Application to China Earthquake Data
  • Yingzi Fu, Dexin Ren
  • Mathematics
    2017 13th International Conference on Computational Intelligence and Security (CIS)
  • 2017
TLDR
The model is built on a class of Gaussian Conditional Autoregressive models, in combination with dependent sampling models (SSM) as well as probit stick-breaking process prior for accounting for complex interactions and high correlations of data.
Statistical Inference From Stem Cell Barcoding Data Using Adaptive Approximate Bayesian Computation
TLDR
A truncated population approximate Bayesian computation (ABC) algorithm which is derived from sequential Monte Carlo ABC (SMC-ABC) and applied the method to the symmetric Dirichlet-multinomial model proposed by Zhang et al. (2005) suggests that flexibility of the asymmetric DirICHlet- Multinomial helps to obtain insight into heterogeneity of proliferating cell systems such as HSC.

References

SHOWING 1-10 OF 28 REFERENCES
Bayesian Nonparametric Estimation of the Probability of Discovering New Species
We consider the problem of evaluating the probability of discovering a certain number of new species in a new sample of population units, conditional on the number of species recorded in a basic
Estimating the Number of Species in a Stochastic Abundance Model
TLDR
Simulation studies show that this estimator compares well with maximum likelihood estimators (i.e., empirical Bayes estimators from the Bayesian viewpoint) for which an iterative numerical procedure is needed and may be infeasible.
Investigation of a generalized multinomial model for species data
The question of how to estimate the number of specie in a region given the species frequency distribution for a sample of animals from the region has been of interest for more than 60 years. Data
Posterior Moments of the Number of Species in a Finite Population and the Posterior Probability of Finding a New Species
Abstract By using a model of Hill for sampling from a finite population, the posterior moments of the number of species in the population and the posterior probability of finding a new species are
Bayesian non‐parametric inference for species variety with a two‐parameter Poisson–Dirichlet process prior
Summary.  A Bayesian non‐parametric methodology has been recently proposed to deal with the issue of prediction within species sampling problems. Such problems concern the evaluation, conditional on
Estimating species richness by a Poisson-compound gamma model.
We propose a Poisson-compound gamma approach for species richness estimation. Based on the denseness and nesting properties of the gamma mixture, we fix the shape parameter of each gamma component at
A multinomial Bayesian approach to the estimation of population and vocabulary size
We approach estimation of the size of a population or a vocabulary through a Bayesian analysis of the multinomial distribution. We view the sample as being generated from such a distribution with an
A Penalized Nonparametric Maximum Likelihood Approach to Species Richness Estimation
We propose a class of penalized nonparametric maximum likelihood estimators (NPMLEs) for the species richness problem. We use a penalty term on the likelihood because likelihood estimators that lack
Estimating the Number of Classes via Sample Coverage
TLDR
This work generalizes the result of Esty to a nonparametric approach and extends Darroch and Ratcliff to incorporate the heterogeneity of the class probabilities to play an important role in the recommended estimation procedures.
Estimating the Number of Unseen Species: How Many Words did Shakespeare Know?
This paper is the first of two written by Brad Efron and Ron Thisted studying the frequency distribution of words in the Shakespearean canon. The key idea due to Fisher in the context of sampling of
...
...