Identifying optimal incomplete phylogenetic data sets from sequence databases.

@article{Yan2005IdentifyingOI,
  title={Identifying optimal incomplete phylogenetic data sets from sequence databases.},
  author={Changhui Yan and John Gordon Burleigh and Oliver Eulenstein},
  journal={Molecular phylogenetics and evolution},
  year={2005},
  volume={35 3},
  pages={528-35}
}
We introduce a new method for identifying optimal incomplete data sets from large sequence databases based on the graph theoretic concept of alpha-quasi-bicliques. The quasi-biclique method searches large sequence databases to identify useful phylogenetic data sets with a specified amount of missing data while maintaining the necessary amount of overlap among genes and taxa. The utility of the quasi-biclique method is demonstrated on large simulated sequence databases and on a data set of green… CONTINUE READING
Highly Cited
This paper has 48 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.

Citations

Publications citing this paper.
Showing 1-10 of 27 extracted citations

References

Publications referenced by this paper.
Showing 1-10 of 25 references

Obtaining maximal concatenated phylogenetic data sets from large sequence databases

  • M. J. Sanderson, A. C. Driskell, R. H. Ree, O. Eulenstein, S. Langley
  • Mol. Biol. Evol
  • 2003
Highly Influential
13 Excerpts

Assembling the tree of life: where we stand at the beginning of the 21st century

  • J. Cracraft, M. J. Donoghue
  • 2004
2 Excerpts

Similar Papers

Loading similar papers…