Incremental, distributed single-linkage hierarchical clustering algorithm using mapreduce

  title={Incremental, distributed single-linkage hierarchical clustering algorithm using mapreduce},
  author={Chen Jin and Zhengzhang Chen and William Hendrix and Ankit Agrawal and Alok N. Choudhary},
Single-linkage hierarchical clustering is one of the prominent and widely-used data mining techniques for its informative representation of clustering results. However, the parallelization of this algorithm is challenging as it exhibits inherent data dependency during the hierarchical tree construction. Moreover, in many modern applications, new data is continuously added into the already huge datasets. It would be impractical to reapply the clustering algorithm on the augmented datasets from… CONTINUE READING


Publications citing this paper.
Showing 1-3 of 3 extracted citations

Parallel hierarchical subspace clustering for segmenting large text corpuses

2017 International Conference on Trends in Electronics and Informatics (ICEI) • 2017
View 9 Excerpts
Highly Influenced

A Fast, Scalable SLINK Algorithm for Commodity Cluster Computing Exploiting Spatial Locality

2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS) • 2016
View 1 Excerpt


Publications referenced by this paper.
Showing 1-10 of 28 references

Shortest connection networks and some generalizations

R. C. Prim
Bell System Technology Journal, • 1957
View 5 Excerpts
Highly Influenced

On the shortest spanning subtree of a graph and the traveling salesman problem

J. B. Kruskal
Proceedings of the American Mathematical Society, • 1956
View 5 Excerpts
Highly Influenced

Jistém Problému Minimálnı́m (About a Certain Minimal Problem) (in Czech, German summary)

O O. Boruvka
Práce Mor. Prı́rodoved. Spol. v Brne III, • 1926
View 5 Excerpts
Highly Influenced

Finding connected components in map-reduce in logarithmic rounds

2013 IEEE 29th International Conference on Data Engineering (ICDE) • 2013
View 1 Excerpt

Parallel hierarchical clustering on shared memory platforms

2012 19th International Conference on High Performance Computing • 2012
View 6 Excerpts

Topic mining on web-shared videos

2008 IEEE International Conference on Acoustics, Speech and Signal Processing • 2008
View 1 Excerpt

Similar Papers

Loading similar papers…