Clustering validity assessment: finding the optimal partitioning of a data set

  title={Clustering validity assessment: finding the optimal partitioning of a data set},
  author={M. Halkidi and M. Vazirgiannis},
  journal={Proceedings 2001 IEEE International Conference on Data Mining},
  • M. Halkidi, M. Vazirgiannis
  • Published 2001
  • Computer Science
  • Proceedings 2001 IEEE International Conference on Data Mining
  • Clustering is a mostly unsupervised procedure and the majority of clustering algorithms depend on certain assumptions in order to define the subgroups present in a data set. [...] Key Method We define a validity index, S Dbw, based on well-defined clustering criteria enabling the selection of optimal input parameter values for a clustering algorithm that result in the best partitioning of a data set. We evaluate the reliability of our index both theoretically and experimentally, considering three representative…Expand Abstract
    357 Citations
    Clustering validity assessment using multi representatives
    • 54
    • PDF
    A density-based cluster validity approach using multi-representatives
    • 77
    • PDF
    A Clustering Validity Assessment Index
    • 8
    • Highly Influenced
    NbClust: An R Package for Determining the Relevant Number of Clusters in a Data Set
    • 977
    • Highly Influenced
    • PDF
    Density-Based Clustering Validation
    • 76
    • PDF
    Chapter 16 – Cluster Validity
    • 4
    Clustering validity based on the most similarity
    • 1
    • PDF
    A new cluster validity index using maximum cluster spread based compactness measure
    • 14


    An examination of procedures for determining the number of clusters in a data set
    • 2,728
    ROCK: A Robust Clustering Algorithm for Categorical Attributes
    • 555
    • Highly Influential
    • PDF
    A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise
    • 15,083
    • Highly Influential
    • PDF
    Data clustering: a review
    • 13,075
    • Highly Influential
    • PDF
    CURE: an efficient clustering algorithm for large databases
    • 2,771
    • PDF
    Automatic subspace clustering of high dimensional data for data mining applications
    • 2,588
    • PDF
    Unsupervised Optimal Fuzzy Clustering
    • I. Gath, A. Geva
    • Mathematics, Computer Science
    • IEEE Trans. Pattern Anal. Mach. Intell.
    • 1989
    • 1,695