Phylogenetic Stochastic Mapping Without Matrix Exponentiation

@article{Irvahn2014PhylogeneticSM,
  title={Phylogenetic Stochastic Mapping Without Matrix Exponentiation},
  author={Jan Irvahn and Vladimir N. Minin},
  journal={Journal of computational biology : a journal of computational molecular cell biology},
  year={2014},
  volume={21 9},
  pages={
          676-90
        }
}
  • Jan Irvahn, V. Minin
  • Published 20 March 2014
  • Computer Science
  • Journal of computational biology : a journal of computational molecular cell biology
Phylogenetic stochastic mapping is a method for reconstructing the history of trait changes on a phylogenetic tree relating species/organism carrying the trait. State-of-the-art methods assume that the trait evolves according to a continuous-time Markov chain (CTMC) and works well for small state spaces. The computations slow down considerably for larger state spaces (e.g., space of codons), because current methodology relies on exponentiating CTMC infinitesimal rate matrices-an operation whose… 
Bayesian analysis of continuous time Markov chains with applications to phylogenetics
TLDR
It is shown that an important challenge with Bayesian generalized linear modelling of Continuous Time Markov Chains (CTMCs) is that classical Markov Chain Monte Carlo techniques are too ineffective to be practical in that setup and two computational methods are proposed to address this issue.
Bayesian Analysis of Continuous Time Markov Chains with Application to Phylogenetic Modelling
TLDR
It is shown that an important challenge with Bayesian generalized linear modelling of continuous time Markov chains is that classical Markov chain Monte Carlo techniques are too ineffective to be practical in that setup, and an auxiliary variable construction combined with an adaptive Hamiltonian Monte Carlo algorithm is addressed.
Phylogenetic Models Linking Speciation and Extinction to Chromosome and Mating System Evolution
TLDR
This dissertation aims to advance statistical and computational approaches that model the timing and nature of these transitions over evolutionary trees by introducing a novel metric, the missing sequence decisiveness score (MSDS), and demonstrating a new method of stochastic character mapping for state-dependent speciation and extinction models.
Markov-Modulated Continuous-Time Markov Chains to Identify Site- and Branch-Specific Evolutionary Variation in BEAST
TLDR
This work proposes incorporating time variability through Markov-modulated models (MMMs), which extend covarion-like models and allow the substitution process at individual sites to vary across lineages, and implements a general MMM framework in BEAST, a popular Bayesian phylogenetic inference software package, to mitigate the increased computational demands associated with MMMs.
Inferring Long-Term Effective Population Size with Mutation–Selection Models
TLDR
The reconstructed history of Ne in these groups appears to correlate with LHTs or ecological variables in a way that suggests that the reconstruction is reasonable, at least in its global trends, and some of the assumptions of the model, in particular concerning the assumed absence of epistatic interactions between sites, are potentially problematic.
Inferring long-term effective population size with Mutation-Selection models
TLDR
The reconstructed history of Ne in these groups appears to correlate with LHTs or ecological variables in a way that suggests that the reconstruction is reasonable, at least in its global trends, and some of the assumptions of the model, in particular concerning the assumed absence of epistatic interactions between sites, are potentially problematic.
Stochastic Character Mapping of State‐Dependent Diversification Reveals the Tempo of Evolutionary Decline in Self‐Compatible Onagraceae Lineages
TLDR
This work introduces a new stochastic character mapping algorithm that overcomes limitations, and applies it to study mating system evolution over a time‐calibrated phylogeny of the plant family Onagraceae, and demonstrates that stochastically character mapping of SSE models is a powerful tool for examining the timing and nature of both character state transitions and shifts in diversification rates over the phylogeny.
chromploid: An R package for chromosome number evolution across the plant tree of life
TLDR
The R package chromploid is introduced, which allows scientists to perform inference of chromosomal evolution rates across large phylogenetic trees and enables researchers to perform robust likelihood‐based inferences using complex models of chromosome number evolution acrossLarge phylogenies.
Stochastic character mapping of state-dependent diversification reveals the tempo of evolutionary decline in self-compatible Onagraceae lineages
TLDR
The results provide empirical evidence for the "senescing" diversification rates predicted in highly selfing lineages: the loss of self-incompatibility is followed by a short-term spike in speciation rates, which declines after a time lag of several million years resulting in negative net diversification.
The tempo of evolutionary decline in self-compatible plant lineages
TLDR
The results provide the first empirical evidence for the “senescing” diversification rates predicted in highly selfing lineages: the authors' mapped character histories show that the loss of self-incompatibility is followed by a short-term spike in speciation rates, which declines after a time lag of several million years resulting in negative net diversification.
...
...

References

SHOWING 1-10 OF 49 REFERENCES
Conjugate Gibbs Sampling for Bayesian Phylogenetic Models
TLDR
The conjugate Gibbs formalism allows one to propose efficient implementations of complex models, for instance assuming site-specific substitution processes, that would not be accessible to standard MCMC methods.
Uniformization for sampling realizations of Markov processes: applications to Bayesian implementations of codon substitution models
TLDR
A general method, based on a uniformization technique, which can be utilized to generate realizations of a Markovian substitution process conditional on an alignment of character states and a given tree topology is described.
Maximum-likelihood phylogenetic analysis under a covarion-like model.
  • N. Galtier
  • Biology
    Molecular biology and evolution
  • 2001
TLDR
Here, a model allowing covarion-like evolution of DNA sequences is introduced, and this model allows the site-specific rate to vary between lineages by adding as few as two parameters to the widely used among-site rate variation model.
Bayesian Phylogeography Finds Its Roots
TLDR
It is concluded that the Bayesian phylogeographic framework will make an important asset in molecular epidemiology that can be easily generalized to infer biogeogeography from genetic data for many organisms.
PhyloSim - Monte Carlo simulation of sequence evolution in the R statistical computing environment
TLDR
PhyloSim is an extensible framework for the Monte Carlo simulation of sequence evolution, written in R, using the Gillespie algorithm to integrate the actions of many concurrent processes such as substitutions, insertions and deletions and allows for the incorporation of selective constraints on indel events.
Stochastic mapping of morphological characters.
TLDR
The utility of the method described by Nielsen to the mapping of morphological characters under continuous-time Markov models for mapping characters on trees and for identifying character correlation is demonstrated.
PhyloBayes MPI: phylogenetic reconstruction with infinite mixtures of profiles in a parallel environment.
TLDR
A message passing interface version of PhyloBayes is introduced, implementing the Dirichlet process mixture models as well as more classical empirical matrices and finite mixtures, allowing faster phylogenetic reconstruction under complex mixture models.
SIMMAP: Stochastic character mapping of discrete traits on phylogenies
TLDR
SimMAP has been developed to implement stochastic character mapping that is useful to both molecular evolutionists, systematists, and bioinformaticians and enables users to address questions that require mapping characters onto phylogenies using a probabilistic approach that does not rely on parsimony.
Rapid likelihood analysis on large phylogenies using partial sampling of substitution histories.
TLDR
Partial sampling of substitutions dramatically improves the utility of likelihood approaches for analyzing complex evolutionary processes on large data sets, and scales exceptionally well with dimensions of model complexity and data set size.
Bayesian analysis of biogeography when the number of areas is large.
TLDR
A Bayesian approach for inferring biogeographic history that extends the application of biogeographical models to the analysis of more realistic problems that involve a large number of areas, and develops this approach in a Bayesian framework, marginalizing over all possible biogeography histories using Markov chain Monte Carlo.
...
...