A polynomial time algorithm for calculating the probability of a ranked gene tree given a species tree

@article{Stadler2011APT,
  title={A polynomial time algorithm for calculating the probability of a ranked gene tree given a species tree},
  author={Tanja Stadler and James H. Degnan},
  journal={Algorithms for Molecular Biology : AMB},
  year={2011},
  volume={7},
  pages={7 - 7}
}
BackgroundThe ancestries of genes form gene trees which do not necessarily have the same topology as the species tree due to incomplete lineage sorting. Available algorithms determining the probability of a gene tree given a species tree require exponential computational runtime.ResultsIn this paper, we provide a polynomial time algorithm to calculate the probability of a ranked gene tree topology for a given species tree, where a ranked tree topology is a tree topology with the internal… 
A Characterization of the Set of Species Trees that Produce Anomalous Ranked Gene Trees
TLDR
This work fully characterize the set of unranked species tree topologies that give rise to ARGTs, showing that this set contains all species treeTopologies with five or more taxa, with the exceptions of caterpillars and pseudocaterpillars.
On the unranked topology of maximally probable ranked gene tree topologies.
TLDR
It is shown that although the set of maximally probable ranked gene tree topologies for a ranked species tree can contain ranked trees with different unranked topologies, at least one of these maximal ranked geneTree topologies must have the same un ranked topology as the species tree.
A lattice structure for ancestral configurations arising from the relationship between gene trees and species trees
TLDR
This work shows that a specific set of paths on the digraph of ancestral configurations is in bijection with the set of labeled histories — a well-known phylogenetic object that enumerates possible temporal orderings of the coalescences of a tree.
Probabilities of unranked and ranked anomaly zones under birth-death models.
TLDR
This article studies how the parameters of a species tree simulated under a constant rate birth-death process can affect the probability that the species tree lies in the anomaly zone, and designs the software PRANC, which computes probabilities of ranked gene tree topologies given aspecies tree under the coalescent model.
On the Number of Ranked Species Trees Producing Anomalous Ranked Gene Trees
TLDR
This work studies enumerative and probabilistic properties of two classes of ranked labeled species trees, focusing on the presence or avoidance of certain subtree patterns associated with the production of ARGTs, and provides exact enumerations and asymptotic estimates for cardinalities.
The probability of monophyly of a sample of gene lineages on a species tree
TLDR
These analyses reveal new behavior, including the maintenance of nontrivial monophyly probabilities for gene lineage samples that span multiple species and even for lineages that do not derive from a monophyletic species group.
The probability of joint monophyly of all species in an arbitrary species tree
TLDR
The results expand the scope of joint monophyly calculations beyond small numbers of species, subsuming past formulas that have been used in simpler cases and enabling computation of relatively simple lower and upper bounds on the jointmonophyly probability.
PRANC: ML species tree estimation from the ranked gene trees under coalescence
TLDR
PRANC computes the Probabilities of RANked gene tree topologies under the multispecies Coalescent and estimates the maximum likelihood tree with estimated branch lengths in coalescent units.
...
...

References

SHOWING 1-10 OF 43 REFERENCES
The probability distribution of ranked gene trees on a species tree.
Properties of consensus methods for inferring species trees from gene trees.
TLDR
The results show that for some combinations of species-tree branch lengths, increasing the number of independent loci can make the rooted majority-rule consensus tree more likely to be at least partially unresolved, and suggest a method for using multiple loci to infer the species- tree topology, even when it is discordant with the most likely gene tree.
COALESCENT‐BASED SPECIES TREE INFERENCE FROM GENE TREE TOPOLOGIES UNDER INCOMPLETE LINEAGE SORTING BY MAXIMUM LIKELIHOOD
  • Y. Wu
  • Computer Science
    Evolution; international journal of organic evolution
  • 2012
TLDR
An improved method for computing the probability of a gene tree topology given a species tree, which is much faster than an existing algorithm by Degnan and Salter (2005) and is implemented in a program that is downloadable from the author’s web page.
The probability of topological concordance of gene trees and species trees.
TLDR
The concordance of gene trees and species trees is reconsidered in detail, allowing for samples of arbitrary size to be taken from the species, and the term speciodendric is introduced to refer to genes whose trees are topologically concordant with species trees.
Species trees from gene trees: reconstructing Bayesian posterior distributions of a species phylogeny using estimated gene tree distributions.
TLDR
A Bayesian hierarchical model to estimate the phylogeny of a group of species using multiple estimated gene tree distributions, such as those that arise in a Bayesian analysis of DNA sequence data, is proposed and applied to two multilocus data sets of DNA sequences.
Relationships between gene trees and species trees.
It is well known that a phylogenetic tree (gene tree) constructed from DNA sequences for a genetic locus does not necessarily agree with the tree that represents the actual evolutionary pathway of
Relationships between Gene Trees and Species Trees1
It is well known that a phylogenetic tree (gene tree) constructed from DNA sequences for a genetic locus does not necessarily agree with the tree that represents the actual evolutionary pathway of
GENE TREE DISTRIBUTIONS UNDER THE COALESCENT PROCESS
  • J. Degnan, L. Salter
  • Computer Science, Environmental Science
    Evolution; international journal of organic evolution
  • 2005
TLDR
A method for computing the distribution of gene tree topologies given a bifurcating species tree for trees with an arbitrary number of taxa in the case that there is one gene sampled per species is derived.
Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent
TLDR
It is shown that when there are four species with one gene sampled per species, the distribution of unrooted gene tree topologies identifies the unrooting species tree topology and some, but not all, information in the species tree edges (branch lengths).
GENE TREE DISTRIBUTIONS UNDER THE COALESCENT PROCESS
TLDR
A method for computing the distribution of gene tree topologies given a bifurcating species tree for trees with an arbitrary number of taxa in the case that there is one gene sampled per species is derived.
...
...