SKEWNESS AND PERMUTATION

@article{Kllersj1992SKEWNESSAP,
  title={SKEWNESS AND PERMUTATION},
  author={Mari K{\"a}llersj{\"o} and James S Farris and Arnold G. Kluge and Carol J. Bult},
  journal={Cladistics},
  year={1992},
  volume={8}
}
Abstract— The skewness criterion of phylogenetic structure in data is too sensitive to character state frequencies, is not sensitive enough to number of characters (degree of corroboration) and relies on counts of arbitrarily‐resolved bifurcating trees. For these reasons it can give misleading results. Permutation tests lack those drawbacks and can be performed quickly by using approximate parsimony calculations, but the test based on minimal tree length can imply strong structure in ambiguous… 
When Are Random Data Not Random, or Is the PTP Test Useful?
TLDR
Applying an appropriate statistical approach, it is statistically demonstrated that the PTP test is extremely weak in detecting the absence of character covariation and advocated the use of simulation studies when testing the performance of statistical tools applied to phylogenetic data.
When Are Random Data Not Random, or Is the PTP Test Useful?
TLDR
Applying an appropriate statistical approach, it is statistically demonstrated that the PTP test is extremely weak in detecting the absence of character covariation and advocated the use of simulation studies when testing the performance of statistical tools applied to phylogenetic data.
Measuring Support for Phylogenies: The “Proportional Support Index”
TLDR
This index measures the actual support for a cladogram relative to the maximum potential support as determined by the number of informative characters, and is thus a more accurate means to compare the strength of phylogenetic signals in different data sets.
BRANCH SUPPORT AND TREE STABILITY
Abstract— Branch support is quantified as the extra length needed to lose a branch in the consensus of near‐most‐parsimonious trees. This approach is based solely on the original data, as opposed to
Discriminating and locating character covariance: an application of permutation tail probability (PTP) analyses.
TLDR
Per permutation tail probability analyses to discriminate between character covariance and noise is investigated with both hypothetical and published data sets and a three-step PTP procedure for locating covaried characters is presented.
Data exploration in phylogenetic inference: scientific, heuristic, or neither
  • Taran Grant, A. Kluge
  • Biology
    Cladistics : the international journal of the Willi Hennig Society
  • 2003
TLDR
It is concluded that undue emphasis has been placed on data exploration in phylogenetic inference, and phylogeneticists are urged to consider more carefully the relevance of the methods that they employ.
Decay analysis of large sets of phylogenetic data
TLDR
ASummary matrix was constructed so that all trees up to five steps longer than the shortest could be saved and examined, resulting in overestimates of support for monophyletic groups and two methods of decay analysis that use constraint.
Phylogenetic Noise Leads to Unbalanced Cladistic Tree Reconstructions
?Cladistic tree balance is the extent to which internal nodes on a cladistic tree define clades of equal size. More robust maximum-parsimony trees taken from the literature are more balanced.
Maximum Parsimony and the Skewness Test: A Simulation Study of the Limits of Applicability
TLDR
Large-scale computations with simulated phylogenetic data are employed to estimate the probability that MP succeeds in finding the true phylogeny for up to twelve taxa and 256 characters, and the skewness test of Hillis is found to perform well on simulated data.
Assessing progress in systematics with continuous jackknife function analysis.
TLDR
Convergence of data on a reference tree does not guarantee historical accuracy, but it does predict that the accumulation of further data under the sampling model will not lead to rapid changes in the hypothesis.
...
...

References

SHOWING 1-10 OF 20 REFERENCES
Tree-Length Distribution Skewness: An Indicator of Phylogenetic Information
Computer simulations in which phylogenies were generated under various conditions were used to examine the relationship between the phylogenetic signal of a character data set, the skewness of the
Cladistic permutation tests for monophyly and nonmonophyly
TLDR
Application of the bootstrap test revealed significant support for the hypothesis that the thylacine is not an outgroup to the Australian marsupials, and is the sister of the Dasyuridae.
DISTANCE DATA REVISITED
  • J. Farris
  • Environmental Science
    Cladistics : the international journal of the Willi Hennig Society
  • 1985
Abstract— Objections to my earlier demonstration, that the branch lengths of trees fitted to distance matrices have no physical interpretation, are shown to be ill‐founded. In particular the
COULD A CLADOGRAM THIS SHORT HAVE ARISEN BY CHANCE ALONE?: ON PERMUTATION TESTS FOR CLADISTIC STRUCTURE
TLDR
A means of quantitative evaluation is presented based on tree length of the most parsimonious tree reflects the degree to which the observed characters co‐vary such that a single tree topology can explain shared character states among the taxa.
A randomization test for phylogenetic information in systematic data
-A randomization procedure is proposed to determine if sets of data used for phylogenetic analysis contain phylogenetically nonrandom information. The method compares the observed number of steps on
FREQUENCY DISTRIBUTIONS OF LENGTHS OF POSSIBLE NETWORKS FROM A DATA MATRIX
TLDR
An algorithm has been found for calculation of the expected mean, variance and skewness for random binary data with up to 13 OTUs, based on the number of characters representing each type of partition of the OTUs.
A Successive Approximations Approach to Character Weighting
TLDR
Results indicate that the successive weighting procedure can be highly successful, even when cladistically reliable characters are heavily outnumbered by unreliable ones, and computer simulation tests of the technique are described.
EXCESS HOMOPLASY RATIOS
Abstract Archie (1990) prefers his “homoplasy excess ratio” HER to Farris' (1989) 1 ensemble retention index R. HER, he writes, lacks R's defects: R's minimum is not zero, and varies with number of
THE LIMITS OF AMINO ACID SEQUENCE DATA IN ANGIOSPERM PHYLOGENETIC RECONSTRUCTION
  • K. Bremer
  • Biology
    Evolution; international journal of organic evolution
  • 1988
TLDR
Cladistic analysis of amino acid sequence data, including evaluation of all equally or almost equally parsimonious cladograms, shows that much homoplasy (parallelisms and reversals) is present and that few or no well supported monophyletic groups of families can be demonstrated.
MAJOR CLADES OF THE ANGIOSPERMS
TLDR
A data matrix for a sample of the angiosperms with 61 characters for 49 taxa, mostly magnoliifloran and related families, is presented, illustrative of the great homoplasy in the available character information, and of how little actually is known about fundamental angiosperm interrelationships or phylogeny.
...
...