A divide-and-merge methodology for clustering

@inproceedings{Cheng2005ADM,
  title={A divide-and-merge methodology for clustering},
  author={David Cheng and Santosh S. Vempala and Ravi Kannan and Grant Wang},
  booktitle={PODS},
  year={2005}
}
We present a divide-and-merge methodology for clustering a set of objects that combines a top-down "divide" phase with a bottom-up "merge" phase. In contrast, previous algorithms either use top-down or bottom-up methods to construct a hierarchical clustering or produce a flat clustering using local search (e.g., k-means). Our divide phase produces a tree whose leaves are the elements of the set. For this phase, we use an efficient spectral algorithm. The merge phase quickly finds an optimal… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-10 OF 110 CITATIONS

Data Mining Applications of Singular Value Decomposition

VIEW 8 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Mitigating spam using network-level features

VIEW 4 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

A spectral-based clustering algorithm for categorical data using data summaries

  • KDD Workshop on Data Mining using Matrices and Tensors
  • 2009
VIEW 10 EXCERPTS
CITES METHODS
HIGHLY INFLUENCED

BNAK-Divide-and-Merge Clustering Algorithm

  • 2009 First International Conference on Information Science and Engineering
  • 2009
VIEW 6 EXCERPTS
CITES BACKGROUND & METHODS
HIGHLY INFLUENCED

Spectral clustering in telephone call graphs

VIEW 9 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

An Approach to Enhance the CPI Using Porter Stemming

VIEW 3 EXCERPTS
CITES METHODS
HIGHLY INFLUENCED

Non-uniform Sampling in Clustering and Streaming

VIEW 3 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

FILTER CITATIONS BY YEAR

2005
2019

CITATION STATISTICS

  • 12 Highly Influenced Citations

References

Publications referenced by this paper.
SHOWING 1-3 OF 3 REFERENCES

Principal Direction Divisive Partitioning

  • Data Mining and Knowledge Discovery
  • 1998
VIEW 7 EXCERPTS
HIGHLY INFLUENTIAL

On clusterings-good, bad and spectral

Ravi Kannan, Santosh Vempala, A. Veta
  • Proceedings 41st Annual Symposium on Foundations of Computer Science
  • 2000
VIEW 3 EXCERPTS
HIGHLY INFLUENTIAL