Extension of Partitional Clustering Methods for Handling Mixed Data

Abstract

Clustering is an active research topic in data mining and different methods have been proposed in the literature. Most of these methods are based on the use of a distance measure defined either on numerical attributes or on categorical attributes. However, in fields such as road traffic and medicine, datasets are composed of numerical and categorical attributes. Recently, there have been several proposals to develop clustering methods that support mixed attributes. There are three basic categories of clustering methods: partitional methods, hierarchical methods and density-based methods. This paper proposes an extension of partitional clustering methods devoted to mixed attributes. The proposed extension looks to create several partitions by using numerical attributes-based clustering methods and then chooses the one that maximizes a measure---called ``homogeneity degree"---of these partitions according to categorical attributes.

DOI: 10.1109/ICDMW.2008.85
View Slides

Extracted Key Phrases

6 Figures and Tables

Cite this paper

@article{Naja2008ExtensionOP, title={Extension of Partitional Clustering Methods for Handling Mixed Data}, author={Yosr Na{\"{i}ja and Salem Chakhar and Kaouther Blibech Sinaoui and Riadh Robbana}, journal={2008 IEEE International Conference on Data Mining Workshops}, year={2008}, pages={257-266} }