Similarity indices, sample size and diversity

  title={Similarity indices, sample size and diversity},
  author={Henk Wolda},
  • H. Wolda
  • Published 1 September 1981
  • Environmental Science
  • Oecologia
SummaryThe effect of sample size and species diversity on a variety of similarity indices is explored. Real values of a similarity index must be evaluated relative to the expected maximum value of that index, which is the value obtained for samples randomly drawn from the same universe, with the diversity and sample sizes of the real samples. It is shown that these expected maxima differ from the theoretical maxima, the values obtained for two identical samples, and that the relationship… 

Accounting for differences in species frequency distributions when calculating beta diversity in the fossil record

Beta diversity is a measure of the taxonomic differentiation between habitats/localities within an assemblage, and is normally calculated as a set of pairwise taxonomic “distances” between the

Reliable estimates of beta diversity with incomplete sampling.

This work assesses the correlation between complete community compositional data and reduced subsets of a varying number of dominant species to find that gross beta diversity is usually depicted accurately when only the 80th percentile or five of the most abundant species of each site is considered.

Entropy and diversity

The standard similarity measure based on untransformed indices is shown to give misleading results, but transforming the indices or entropies to effective numbers of species produces a stable, easily interpreted, sensitive general similarity measure.

Undersampling and the measurement of beta diversity

Beta diversity is a conceptual link between diversity at local and regional scales. Various additional methodologies of quantifying this and related phenomena have been applied. Among them, measures

Abundance‐Based Similarity Indices and Their Estimation When There Are Unseen Species in Samples

This work provides a new probabilistic derivation for any incidence-based index that is symmetric and homogeneous and proposes estimators that adjust for the effect of unseen shared species on the authors' abundance-based indices.


Obtaining an adequate, representative sample of ecological communities to make taxon richness (TR) or compositional comparisons among sites is a continuing challenge. Although randomization in the

Comparative Analysis of Diversity and Similarity Indices with Special Relevance to Vegetations around Sewage Drains

Indices summarizing community structure are used to evaluate fundamental community ecology, species interaction, biogeographical factors, and environmental stress. Some of these indices are

Reliable sample sizes for estimating similarity among macroinvertebrate assemblages in tropical streams

Studies in tropical streams are relatively few, and one of the still-unresolved methodological is- sues is sample size. Adequate sample size for temperate streams cannot be extrapolated for tropical

A new twist on a very old binary similarity coefficient.

  • J. Alroy
  • Environmental Science
  • 2015
The corrected coefficient indicates that local assemblages of North American mammals are random subsamples of larger species pools by returning similarity of values of around 1, and it suggests a more consistent relationship between biome- scale comparisons and local-scale comparisons.

A new statistical approach for assessing similarity of species composition with incidence and abundance data

This work provides a probabilistic derivation for the classic, incidence-based forms of Jaccard and Sorensen indices of compositional similarity and proposes estimators for these indices that include the effect of unseen shared species, based on either (replicated) incidence- or abundancebased sample data.

The Relation Between the Number of Species and the Number of Individuals in a Random Sample of an Animal Population

Part 1. It is shown that in a large collection of Lepidoptera captured in Malaya the frequency of the number of species represented by different numbers of individuals fitted somewhat closely to a

Kendall's “Tau” Coefficient as an Index of Similarity in Comparisons of Plant or Animal Communities

  • A. Ghent
  • Mathematics
    The Canadian Entomologist
  • 1963
Abstract Comparisons of ecologic communities are often limited to presentations of frequency lists in tabular or bar-graph form. Kendall's “Tau” coefficient is appropriate as a measure of rank

Similarity of Binary Data

There is just one type of similarity which is really useful both in Q and R analysis and in the whole of biology as well as in the humanities, and this new coefficient is proposed to obviate this inconvenience.

Evaluation of different similarity indices as measures of succession in arthropod communities of the forest floor after clear-cutting

  • V. Huhta
  • Environmental Science
  • 2004
Communities of spiders and beetles living in the soil and litter of clear-cut areas were compared with those of intact forest stands and showed that succession in the spider community was divergent for at least 7 years after felling.

A Graphic Computation Procedure for Kendall's Tau Suited to Extensive Species-Density Comparisons

A rapid computation for Kendall's tau is presented in the context of a species-density comparison involving 53 bird species (with extensive tied frequencies) in two spruce-fir communities. Paired

Spatial variation in the timing of the seasonal occurrence in coprophagous beetles.

It is concluded that, due to spatial variation in the species composition, intraspecific diversity in the seasonal occurrence of different dung-inhabiting beetles is increased, and this will affect the spatio-temporal structure of populations.


Various numerical coefficients have been employed in comparisons of taxa or bioassociational units, especially in studies involving large arrays of multivariate data. Nomenclatural and conceptual

Measurement of "Overlap" in Comparative Ecological Studies

  • H. S. Horn
  • Environmental Science
    The American Naturalist
  • 1966
Objective, empirical measures of overlap between samples of items distributed proportionally into various qualitative categories derived from either probability or information theory should prove useful to the ecologist in comparative studies of diet, habitat preference, seasonal patterns of abundance, faunal lists, or similar data.

An Ordination of the Upland Forest Communities of Southern Wisconsin

It is shown that nature of unit variation is a naajor problenl in systematies, and that whether this variation is diserete, continuous, or in some other form, there is a need for appliGation of (uantitative and statistical methods.

An Introduction to Numerical Classification

Interestingly, introduction to numerical classification that you really wait for now is coming. It's significant to wait for the representative and beneficial books to read. Every book that is