Automatic Subspace Clustering of High Dimensional Data

@article{Agrawal2005AutomaticSC,
  title={Automatic Subspace Clustering of High Dimensional Data},
  author={R. Agrawal and J. Gehrke and D. Gunopulos and P. Raghavan},
  journal={Data Mining and Knowledge Discovery},
  year={2005},
  volume={11},
  pages={5-33}
}
  • R. Agrawal, J. Gehrke, +1 author P. Raghavan
  • Published 2005
  • Mathematics, Computer Science
  • Data Mining and Knowledge Discovery
  • Data mining applications place special requirements on clustering algorithms including: the ability to find clusters embedded in subspaces of high dimensional data, scalability, end-user comprehensibility of the results, non-presumption of any canonical data distribution, and insensitivity to the order of input records. We present CLIQUE, a clustering algorithm that satisfies each of these requirements. CLIQUE identifies dense clusters in subspaces of maximum dimensionality. It generates… CONTINUE READING
    353 Citations
    Locally adaptive metrics for clustering high dimensional data
    • 221
    • PDF
    A weighting k-modes algorithm for subspace clustering of categorical data
    • 43
    • PDF
    Mining Projected Clusters in High-Dimensional Spaces
    • 59
    • PDF
    A Comprehensive Study of Challenges and Approaches for Clustering High Dimensional Data
    • 2
    Subspace clustering for complex data
    • 2
    • PDF
    Mining Significant Subspaces
    • PDF
    High-Dimensional Clustering Method for High Performance Data Mining
    • PDF

    References

    SHOWING 1-10 OF 63 REFERENCES
    Finding generalized projected clusters in high dimensional spaces
    • 517
    • PDF
    Subspace Clustering of High Dimensional Data
    • 188
    • PDF
    Fast algorithms for projected clustering
    • 1,107
    • PDF
    BIRCH: an efficient data clustering method for very large databases
    • 4,598
    • Highly Influential
    • PDF
    CURE: an efficient clustering algorithm for large databases
    • 2,775
    • Highly Influential
    • PDF
    A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise
    • 15,274
    • Highly Influential
    • PDF
    SLIQ: A Fast Scalable Classifier for Data Mining
    • 871
    • PDF
    Efficient and Effective Clustering Methods for Spatial Data Mining
    • 2,019
    • Highly Influential
    • PDF