Probabilistic Graphical Model Representation in Phylogenetics
@article{Hhna2013ProbabilisticGM, title={Probabilistic Graphical Model Representation in Phylogenetics}, author={Sebastian H{\"o}hna and Tracy A. Heath and Bastien Boussau and Michael J. Landis and Fredrik Ronquist and John P. Huelsenbeck}, journal={Systematic Biology}, year={2013}, volume={63}, pages={753 - 771} }
Recent years have seen a rapid expansion of the model space explored in statistical phylogenetics, emphasizing the need for new approaches to statistical model representation and software development. Clear communication and representation of the chosen model is crucial for: (i) reproducibility of an analysis, (ii) model development, and (iii) software design. Moreover, a unified, clear and understandable framework for model representation lowers the barrier for beginners and nonspecialists to…
90 Citations
A road map for phylogenetic models of species trees.
- BiologyMolecular phylogenetics and evolution
- 2022
Universal probabilistic programming offers a powerful approach to statistical phylogenetics
- Biology, Computer ScienceCommunications Biology
- 2021
This work develops automated generation of sequential Monte Carlo algorithms for PPL descriptions of arbitrary biological diversification (birth-death) models, and shows that few hurdles remain before these techniques can be effectively applied to the full range of phylogenetic models.
Phylogenetic Inference Using RevBayes
- BiologyCurrent protocols in bioinformatics
- 2017
This unit describes how to specify standard phylogenetic models and perform Bayesian phylogenetic analyses in RevBayes and describes the basic analysis of inferring a phylogeny from single and multiple loci, a hypothesis‐testing approach, and point to advanced topics.
Software for Systematics and Evolution
- Computer Science
- 2016
RevBayes is a new open-source software package that is entirely based on probabilistic graphical models, a powerful generic framework for specifying and analyzing statistical models, and outperforms competing software for several standard analyses.
Universal probabilistic programming: a powerful new approach to statistical phylogenetics
- Computer Science, BiologybioRxiv
- 2020
This work develops automated generation of sequential Monte Carlo algorithms for PPL descriptions of arbitrary biological diversification (birth-death) models, and shows that few hurdles remain before these techniques can be effectively applied to the full range of phylogenetic models.
RevBayes: Bayesian Phylogenetic Inference Using Graphical Models and an Interactive Model-Specification Language
- Computer ScienceSystematic biology
- 2016
RevBayes is a new open-source software package based on probabilistic graphical models, a powerful generic framework for specifying and analyzing statistical models that outperforms competing software for several standard analyses and needs to explicitly specify each part of the model and analysis.
Revticulate: An R framework for interaction with RevBayes
- Computer ScienceMethods in Ecology and Evolution
- 2022
The package Revticulate is described, which provides an R-based interface to the software RevBayes, a Bayesian phylogenetic program that implements an R‐like computing language, but does not interface with R itself.
Phycas: software for Bayesian phylogenetic analysis.
- BiologySystematic biology
- 2015
Phycas is open source, freely available Bayesian phylogenetics software written primarily in C++ but with a Python interface that provides for several choices for edge length priors, including a hierarchical model as well as the recently described compound Dirichlet prior.
Phylogenetic Models Linking Speciation and Extinction to Chromosome and Mating System Evolution
- Biology
- 2017
This dissertation aims to advance statistical and computational approaches that model the timing and nature of these transitions over evolutionary trees by introducing a novel metric, the missing sequence decisiveness score (MSDS), and demonstrating a new method of stochastic character mapping for state-dependent speciation and extinction models.
Stepwise Bayesian Phylogenetic Inference
- BiologybioRxiv
- 2020
The stepwise approach to Bayesian inference uses importance sampling to generate observations for the next step of an analysis pipeline from the posterior produced in the previous step, and can be more computationally efficient than the traditional joint approach when multiple models are being tested.
References
SHOWING 1-10 OF 77 REFERENCES
Bayesian model adequacy and choice in phylogenetics.
- BiologyMolecular biology and evolution
- 2002
This article presents a Bayesian phylogenetic method that evaluates the adequacy of evolutionary models using posterior predictive distributions and, unlike the likelihood-ratio test and parametric bootstrap, accounts for uncertainty in the phylogeny and model parameters.
PuMA: Bayesian analysis of partitioned (and unpartitioned) model adequacy
- BiologyBioinform.
- 2009
PuMA implements a posterior predictive simulation approach to assessing the adequacy of partitioned, unpartitioned and mixture models of DNA sequence evolution in a Bayesian context.
Phylogenies and the Comparative Method
- BiologyThe American Naturalist
- 1985
A method of correcting for the phylogeny has been proposed, which specifies a set of contrasts among species, contrasts that are statistically independent and can be used in regression or correlation studies.
MrBayes 3.2: Efficient Bayesian Phylogenetic Inference and Model Choice Across a Large Model Space
- Computer ScienceSystematic biology
- 2012
The new version provides convergence diagnostics and allows multiple analyses to be run in parallel with convergence progress monitored on the fly, and provides more output options than previously, including samples of ancestral states, site rates, site dN/dS rations, branch rates, and node dates.
A Branch-Heterogeneous Model of Protein Evolution for Efficient Inference of Ancestral Sequences
- BiologySystematic biology
- 2013
A new branch-nonhomogeneous and nonstationary model of protein evolution that captures more accurately the high complexity of sequence evolution that makes use of a correspondence analysis to reduce the number of parameters to be optimized through maximum likelihood, focusing on most of the compositional variation observed in the data.
A structural EM algorithm for phylogenetic inference
- Computer ScienceJ. Comput. Biol.
- 2002
This paper describes a new algorithm that uses Structural-EM for learning maximum likelihood trees, and proves that each iteration of this procedure increases the likelihood of the topology, and thus the procedure must converge.
Fast simulation of reconstructed phylogenies under global time-dependent birth-death processes
- BiologyBioinform.
- 2013
This model is considered as the model a global time-dependent birth-death process where each species has the same rates but rates may vary over time and can be used to simulate efficiently reconstructed phylogenetic trees when conditioning on the number of species, the time of the process or both.
A Mixture Model and a Hidden Markov Model to Simultaneously Detect Recombination Breakpoints and Reconstruct Phylogenies
- BiologyEvolutionary bioinformatics online
- 2009
A Mixture Model on trees and a phylogenetic Hidden Markov Model are proposed and implemented to reveal recombination breakpoints while searching for the various evolutionary histories that are present in an alignment known to have undergone homologous recombination.
A phylogenetic model for investigating correlated evolution of substitution rates and continuous phenotypic characters.
- BiologyMolecular biology and evolution
- 2011
A probabilistic framework for testing the coupling between continuous characters and parameters of the molecular substitution process is developed and a negative correlation between the rate of substitution and mass and longevity is observed, thus in partial agreement with the nearly neutral theory.
On the use of nucleic acid sequences to infer early branchings in the tree of life.
- BiologyMolecular biology and evolution
- 1995
A (discretized) gamma distribution is made use of to account for variable rates of substitution among sites and built models that allowed for unequal base frequencies in different sequences of small-subunit rRNAs.