Fast agglomerative hierarchical clustering algorithm using Locality-Sensitive Hashing
@article{Koga2006FastAH, title={Fast agglomerative hierarchical clustering algorithm using Locality-Sensitive Hashing}, author={H. Koga and T. Ishibashi and T. Watanabe}, journal={Knowledge and Information Systems}, year={2006}, volume={12}, pages={25-53} }
The single linkage method is a fundamental agglomerative hierarchical clustering algorithm. [...] Key Method In the agglomeration step, it connects a pair of clusters such that the distance between the nearest members is the shortest. This step is repeated until only one cluster remains. The single linkage method can efficiently detect clusters in arbitrary shapes. However, a drawback of this method is a large time complexity of O(n2), where n represents the number of data points. This time complexity makes…Expand
Figures, Tables, and Topics from this paper
87 Citations
Hierarchical clustering of large text datasets using Locality-Sensitive Hashing
- Computer Science
- 2015
- 3
- PDF
Distance Based Fast Hierarchical Clustering Method for Large Datasets
- Mathematics, Computer Science
- RSCTC
- 2010
- 15
- PDF
Hashing-based clustering in high dimensional data
- Mathematics, Computer Science
- Expert Syst. Appl.
- 2016
- 12
SAHN Clustering in Arbitrary Metric Spaces Using Heuristic Nearest Neighbor Search
- Computer Science
- WALCOM
- 2014
- 7
- PDF
Practical SAHN Clustering for Very Large Data Sets and Expensive Distance Metrics
- Mathematics, Computer Science
- J. Graph Algorithms Appl.
- 2014
- 10
- PDF
SPARCL: an effective and efficient algorithm for mining arbitrary shape-based clusters
- Mathematics, Computer Science
- Knowledge and Information Systems
- 2009
- 27
- PDF
A distance based clustering method for arbitrary shaped clusters in large datasets
- Mathematics, Computer Science
- Pattern Recognit.
- 2011
- 35
- PDF
Approximative distance computation by random hashing
- Computer Science
- The Journal of Supercomputing
- 2011
- 2
- PDF
References
SHOWING 1-10 OF 16 REFERENCES
An agglomerative hierarchical clustering using partial maximum array and incremental similarity computation method
- Computer Science
- Proceedings 2001 IEEE International Conference on Data Mining
- 2001
- 18
SLINK: An Optimally Efficient Algorithm for the Single-Link Cluster Method
- Computer Science
- Comput. J.
- 1973
- 1,076
- PDF
BIRCH: an efficient data clustering method for very large databases
- Computer Science
- SIGMOD '96
- 1996
- 4,621
- PDF
An Efficient Approach to Clustering in Large Multimedia Databases with Noise
- Computer Science
- KDD
- 1998
- 1,327
- PDF
A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise
- Computer Science
- KDD
- 1996
- 15,473
- Highly Influential
- PDF
WaveCluster: A Multi-Resolution Clustering Approach for Very Large Spatial Databases
- Computer Science
- VLDB
- 1998
- 848
- PDF
OPTICS: ordering points to identify the clustering structure
- Computer Science
- SIGMOD '99
- 1999
- 3,317
- Highly Influential
- PDF
STING: A Statistical Information Grid Approach to Spatial Data Mining
- Computer Science
- VLDB
- 1997
- 1,321
- Highly Influential
- PDF