• Publications
  • Influence
CURE: an efficient clustering algorithm for large databases
TLDR
We propose a new clustering algorithm called CURE that is more robust to outliers, and identifies clusters having non-spherical shapes and wide variances in size. Expand
  • 2,793
  • 211
  • PDF
ROCK: a robust clustering algorithm for categorical attributes
TLDR
We study clustering algorithms for data with Boolean and categorical attributes. Expand
  • 1,506
  • 161
  • PDF
Efficient algorithms for mining outliers from large data sets
TLDR
In this paper, we propose a novel formulation for distance-based outliers that is based on the distance of a point from its kth nearest neighbor. Expand
  • 1,334
  • 97
  • PDF
SPIRIT: Sequential Pattern Mining with Regular Expression Constraints
TLDR
An infrared generator wherein an ellipsoidal reflector has a source rich in infra red at one focus thereof. Expand
  • 589
  • 48
  • PDF
Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases
TLDR
We introduce a new model of similarity of time sequences that captures the intuitive notion that two sequences should be considered similar if they have enough non-overlapping time-ordered pairs of subsequences thar are similar. Expand
  • 769
  • 37
  • PDF
Approximate query processing using wavelets
TLDR
In this paper, we propose the use of multi-dimensional wavelets as an effective tool for general-purpose approximate query processing in modern, high-dimensional applications. Expand
  • 503
  • 33
  • PDF
Cure: An Efficient Clustering Algorithm for Large Databases
TLDR
We propose a new clustering algorithm called CURE that is more robust to outliers, and identifies clusters having non-spherical shapes and wide variances in size. Expand
  • 422
  • 28
Optimizing queries with materialized views
TLDR
We analyze the optimization question of optimizing queries in the presence of materialised views and provide a comprehensive and efficient solution. Expand
  • 464
  • 26
  • PDF
APEX: an adaptive path index for XML data
TLDR
We propose APEX, an adaptive path index for XML data. Expand
  • 333
  • 20
  • PDF
WALRUS: A Similarity Retrieval Algorithm for Image Databases
TLDR
We propose WALRUS (wavelet-based retrieval of user-specified scenes), a novel similarity retrieval algorithm that is robust to scaling and translation of objects within an image. Expand
  • 191
  • 16