EstimateS turns 20: statistical estimation of species richness and shared species from samples, with non‐parametric extrapolation

  title={EstimateS turns 20: statistical estimation of species richness and shared species from samples, with non‐parametric extrapolation},
  author={Robert K. Colwell and Robert K. Colwell and Johanna E. Elsensohn},
EstimateS offers statistical tools for analyzing and comparing the diversity and composition of species assemblages, based on sampling data. The latest version computes a wide range of biodiversity statistics for both sample-based and individual-based data, including analytical rarefaction and non-parametric extrapolation, estimators of asymptotic species richness, diversity indices, Hill numbers, and (for sample-based data) measures of compositional similarity among assemblages. In the first… 

Figures from this paper

Thirty years of progeny from Chao’s inequality: Estimating and comparing richness with incidence data and incomplete sampling

In the context of capture-recapture studies, Chao (1987) derived an inequality among capture frequency counts to obtain a lower bound for the size of a population based on individuals’

Tourist species bias estimates of extrapolated species density in dispersive taxa: a case study from a litter beetle assemblage in temperate woodland

Extrapolative nonparametric estimators of species density are commonly used in community ecology. However, they are dependent on either (1) their use on non-dispersive taxa, or (2) the ability to

Inventory statistics meet big data: complications for estimating numbers of species

The problem is documented via simulations of sampling from virtual biotas, its potential is illustrated using a large empirical dataset (bird records from Cape May, New Jersey, USA), and the circumstances under which these problems may be expected to emerge are outlined.

Variation in the diversity and richness of parasitoid wasps based on sampling effort

It is demonstrated that even with an intense sampling effort the community is incompletely sampled, and the use of only a few traps and for very short periods severely limits the estimates of richness.

β-Diversity partitioning of moth communities within and between different forest types

It is demonstrated that β-diversity partitioning can help to detect differences in magnitude and role of processes determining the composition of forest moth communities as in different forests the same pattern can be due to opposite processes, providing strong ecological insights into managing forest biodiversity.

Sampling biases shape our view of the natural world

Spatial patterns of biodiversity are inextricably linked to their collection methods, yet no synthesis of bias patterns or their consequences exists. As such, views of organismal distribution and the

Ant Species Diversity, Distribution, and Community Composition in Different Forest Types in Papua New Guinea

Previous studies have provided important scientific information on ant species richness and composition relating to the effects of elevation, sampling approaches, stratification, and forest

Parasite species richness and intensity of interspecific interactions increase with latitude in two wide-ranging hosts.

Counter to typical patterns, it is found that species richness, levels of parasitism, and intensity of intraguild predation increased with latitude, and this system may serve as a useful tool to provide insight into what processes drive diversity gradients in general.



Rarefaction and extrapolation with Hill numbers: a framework for sampling and estimation in species diversity studies

Quantifying and assessing changes in biological diversity are central aspects of many ecological studies, yet accurate methods of estimating biological diversity from sampling data have been elusive.

Quantifying biodiversity: procedures and pitfalls in the measurement and comparison of species richness

A series of common pitfalls in quantifying and comparing taxon richness are surveyed, including category‐subcategory ratios (species-to-genus and species-toindividual ratios) and rarefaction methods, which allow for meaningful standardization and comparison of datasets.

Models and estimators linking individual-based and sample-based rarefaction, extrapolation and comparison of assemblages

Aims In ecology and conservation biology, the number of species counted in a biodiversity study is a key metric but is usually a biased underestimate of total species richness because many rare

Estimating terrestrial biodiversity through extrapolation.

The importance of using 'reference' sites to assess the true richness and composition of species assemblages, to measure ecologically significant ratios between unrelated taxa, toMeasure taxon/sub-taxon (hierarchical) ratios, and to 'calibrate' standardized sampling methods is discussed.

A new statistical approach for assessing similarity of species composition with incidence and abundance data

This work provides a probabilistic derivation for the classic, incidence-based forms of Jaccard and Sorensen indices of compositional similarity and proposes estimators for these indices that include the effect of unseen shared species, based on either (replicated) incidence- or abundancebased sample data.

Estimating species richness: still a long way off!

Xu et al., in this issue of the Journal of Vegetation Science, compare several species richness estimators. All the non-parametric estimators, such as Chao and jackknife estimators, underestimated

Estimating the Richness of a Population When the Maximum Number of Classes Is Fixed: A Nonparametric Solution to an Archaeological Problem

A new method for estimating class richness: doubly-bounded confidence intervals (both lower and upper bounds are known) is introduced, allowing conclusions to be drawn that were not otherwise possible with singly-Bounded richness estimates, namely, that Lower Great Lakes Clovis Paleoindians utilized a settlement pattern that was probably more logistical in nature than residential.


In statistical ecology, the number of shared species is a standard measure of similarity between two communities. Assume that a multinomial sample is drawn from each of the two target communities.