A Fast Clustering Algorithm to Cluster Very Large Categorical Data Sets in Data Mining

@inproceedings{Huang1997AFC,
  title={A Fast Clustering Algorithm to Cluster Very Large Categorical Data Sets in Data Mining},
  author={Joshua Zhexue Huang},
  booktitle={DMKD},
  year={1997}
}
Partiti oning a large set of objects into homogeneous clusters is a fundamental operation in data mining. The k-means algorithm is best suited for implementing this operation because of its eff iciency in clustering large data sets. However, working only on numeric values limits its use in data mining because data sets in data mining often contain categorical values. In this paper we present an algorithm, called k-modes, to extend the k-means paradigm to categorical domains. We introduce new… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-10 OF 287 CITATIONS, ESTIMATED 20% COVERAGE

FILTER CITATIONS BY YEAR

1997
2019

CITATION STATISTICS

  • 40 Highly Influenced Citations

  • Averaged 19 Citations per year over the last 3 years

Similar Papers

Loading similar papers…