Efficient Coalescent Simulation and Genealogical Analysis for Large Sample Sizes

  title={Efficient Coalescent Simulation and Genealogical Analysis for Large Sample Sizes},
  author={Jerome Kelleher and Alison M. Etheridge and Gilean McVean},
  journal={PLoS computational biology},
  volume={12 5},
A central challenge in the analysis of genetic variation is to provide realistic genome simulation across millions of samples. Present day coalescent simulations do not scale well, or use approximations that fail to capture important long-range linkage properties. Analysing the results of simulations also presents a substantial challenge, as current methods to store genealogies consume a great deal of space, are slow to parse and do not take advantage of shared structure in correlated trees. We… CONTINUE READING
Highly Cited
This paper has 26 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 21 extracted citations


Publications referenced by this paper.
Showing 1-10 of 118 references

DendroPy: a Python library for phylogenetic computing

Bioinformatics • 2010
View 4 Excerpts
Highly Influenced

Fast and flexible simulation of DNA sequence data.

Genome research • 2009
View 7 Excerpts
Highly Influenced

Approximating the coalescent with recombination.

Philosophical transactions of the Royal Society of London. Series B, Biological sciences • 2005
View 5 Excerpts
Highly Influenced