Quantifying the Impact of Dependent Evolution among Sites in Phylogenetic Inference

Abstract

Nearly all commonly used methods of phylogenetic inference assume that characters in an alignment evolve independently of one another. This assumption is attractive for simplicity and computational tractability but is not biologically reasonable for RNAs and proteins that have secondary and tertiary structures. Here, we simulate RNA and protein-coding DNA sequence data under a general model of dependence in order to assess the robustness of traditional methods of phylogenetic inference to violation of the assumption of independence among sites. We find that the accuracy of independence-assuming methods is reduced by the dependence among sites; for proteins this reduction is relatively mild, but for RNA this reduction may be substantial. We introduce the concept of effective sequence length and its utility for considering information content in phylogenetics.

DOI: 10.1093/sysbio/syq074

Extracted Key Phrases

9 Figures and Tables

Cite this paper

@inproceedings{Nasrallah2011QuantifyingTI, title={Quantifying the Impact of Dependent Evolution among Sites in Phylogenetic Inference}, author={Chris A. Nasrallah and David H. Mathews and John P. Huelsenbeck}, booktitle={Systematic biology}, year={2011} }