Context-Based Distance Learning for Categorical Data Clustering

  title={Context-Based Distance Learning for Categorical Data Clustering},
  author={Dino Ienco and Ruggero G. Pensa and Rosa Meo},
Clustering data described by categorical attributes is a challenging task in data mining applications. Unlike numerical attributes, it is difficult to define a distance between pairs of values of the same categorical attribute, since they are not ordered. In this paper, we propose a method to learn a context-based distance for categorical attributes. The key intuition of this work is that the distance between two values of a categorical attribute Ai can be determined by the way in which the… CONTINUE READING
Highly Cited
This paper has 54 citations. REVIEW CITATIONS
33 Extracted Citations
17 Extracted References
Similar Papers

Citing Papers

Publications influenced by this paper.
Showing 1-10 of 33 extracted citations

54 Citations

Citations per Year
Semantic Scholar estimates that this publication has 54 citations based on the available data.

See our FAQ for additional information.

Referenced Papers

Publications referenced by this paper.
Showing 1-10 of 17 references

Dataset generator, perfect data for an imperfect world

  • G. Melli
  • 2008
1 Excerpt

Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn

  • I. H. Witten, E. Frank
  • Data Management Systems. Morgan Kaufmann, San…
  • 2005
2 Excerpts

Similar Papers

Loading similar papers…