Identifiability of a Coalescent-Based Population Tree Model

  title={Identifiability of a Coalescent-Based Population Tree Model},
  author={Arindam Roychoudhury},
  journal={Journal of Applied Probability},
  pages={921 - 929}
  • A. Roychoudhury
  • Published 12 April 2013
  • Computer Science, Mathematics
  • Journal of Applied Probability
Identifiability of evolutionary tree models has been a recent topic of discussion and some models have been shown to be nonidentifiable. A coalescent-based rooted population tree model, originally proposed by Nielsen et al. (1998), has been used by many authors in the last few years and is a simple tool to accurately model the changes in allele frequencies in the tree. However, the identifiability of this model has never been proven. Here we prove this model to be identifiable by showing that… 
4 Citations

Journal of Applied Probability Volume 51 (2014): Index

  • Mathematics
    Journal of Applied Probability
  • 2014
pages Albrecher, H., Boxma, O. J. and Ivanovs, J. On simple ruin expressions in dependent Sparre Andersen risk models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Evolutionary Bioinformatics

  • D. Forsdyke
  • Biology
    Springer International Publishing
  • 2016
This research presents a novel probabilistic method called “spot-time PCR” that allows for real-time analysis of the response of the immune system to natural disasters.

Approximate Likelihood Estimation of Divergence Time Range Using a Coalescent-based Model

An estimation of divergence time-range based on a coalescent model, taking into account the effect of incomplete lineage sorting, which is much faster and as accurate a simulation-based approach as well as less computationally intensive.



Identifiability of a Markovian model of molecular evolution with gamma-distributed rates

This is the first proof of identifiability of a phylogenetic model with a continuous distribution of rates, and one of the most widely used models (GTR + Γ) is identifiable for generic parameters, and for all parameter choices in the case of four-state (DNA) models.

Estimating species phylogenies using coalescence times among sequences.

It can be shown that the 2 methods are statistically consistent under the multispecies coalescent model, and it is suggested that STAR consistently outperforms STEAC, SC, and GLASS when the substitution rates among lineages are highly variable.


Abstract We develop a Monte Carlo–based likelihood method for estimating migration rates and population divergence times from data at unlinked loci at which mutation rates are sufficiently low that,


The maximum‐likelihood estimator provides a statistical framework for the analysis of population history given genetic data and is compared to a commonly applied estimator based on Wright's FST statistic.

Phylogenetic mixtures on a single tree can mimic a tree of another topology.

It is shown that the assumption that mixture model data on one topology can be distinguished from data evolved on an unmixed tree of another topology given enough data and the "correct" method can be false.

Composite likelihood-based inferences on genetic data from dependent loci

This work studies the local maxima of the composite likelihood (ECLE, the efficient composite likelihood estimators), which is straightforward to compute and establishes desirable properties of the ECLE and provides an estimator of the variance of MCLE and ECLE.

A Two-Stage Pruning Algorithm for Likelihood Computation for a Population Tree

A pruning algorithm for likelihood estimation of a tree of populations that utilizes the differences accumulated by random genetic drift in allele count data from single-nucleotide polymorphisms (SNPs), ignoring the effect of mutation after divergence from the common ancestral population.

Gene genealogy and variance of interpopulational nucleotide differences.

The time of divergence between the two most closely related genes can be used as an approximate estimate of the time of population splitting only when T identical to t/(2N) is small, where t and N are the number of generations and the effective population size, respectively.

Ascertainment correction for a population tree via a pruning algorithm for likelihood computation.

Reconstructing Trees When Sequence Sites Evolve at Variable Rates

This work shows that, given suitable restrictions on the rate distribution, the true tree is uniquely identified by its sequence spectrum, and exploits a novel theorem on the action of polynomials with non-negative coefficients on sequences.