Accurate identification of centromere locations in yeast genomes using Hi-C


Centromeres are essential for proper chromosome segregation. Despite extensive research, centromere locations in yeast genomes remain difficult to infer, and in most species they are still unknown. Recently, the chromatin conformation capture assay, Hi-C, has been re-purposed for diverse applications, including de novo genome assembly, deconvolution of metagenomic samples and inference of centromere locations. We describe a method, Centurion, that jointly infers the locations of all centromeres in a single genome from Hi-C data by exploiting the centromeres' tendency to cluster in three-dimensional space. We first demonstrate the accuracy of Centurion in identifying known centromere locations from high coverage Hi-C data of budding yeast and a human malaria parasite. We then use Centurion to infer centromere locations in 14 yeast species. Across all microbes that we consider, Centurion predicts 89% of centromeres within 5 kb of their known locations. We also demonstrate the robustness of the approach in datasets with low sequencing depth. Finally, we predict centromere coordinates for six yeast species that currently lack centromere annotations. These results show that Centurion can be used for centromere identification for diverse species of yeast and possibly other microorganisms.

DOI: 10.1093/nar/gkv424

Extracted Key Phrases

4 Figures and Tables

Citations per Year

614 Citations

Semantic Scholar estimates that this publication has 614 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@inproceedings{Varoquaux2015AccurateIO, title={Accurate identification of centromere locations in yeast genomes using Hi-C}, author={Nelle Varoquaux and Ivan Liachko and Ferhat Ay and Joshua N. Burton and Jay Shendure and Maitreya J. Dunham and Jean-Philippe Vert and William S. Noble}, booktitle={Nucleic acids research}, year={2015} }