Scalable and Distributed Clustering via Lightweight Coresets

  title={Scalable and Distributed Clustering via Lightweight Coresets},
  author={Olivier Bachem and Mario Lucic and Andreas Krause},
Coresets are compact representations of data sets such that models trained on a coreset are provably competitive with models trained on the full data set. As such, they have been successfully used to scale up clustering models to massive data sets. While existing approaches generally only allow for multiplicative approximation errors, we propose a novel notion of coresets called lightweight coresets that allows for both multiplicative and additive errors. We provide a single algorithm to… CONTINUE READING
Related Discussions
This paper has been referenced on Twitter 12 times. VIEW TWEETS

From This Paper

Figures, tables, and topics from this paper.


Publications referenced by this paper.

Similar Papers

Loading similar papers…