Outlier mining in large high-dimensional data sets

  title={Outlier mining in large high-dimensional data sets},
  author={Fabrizio Angiulli and Clara Pizzuti},
  journal={IEEE Transactions on Knowledge and Data Engineering},
A new definition of distance-based outlier and an algorithm, called HilOut, designed to efficiently detect the top n outliers of a large and high-dimensional data set are proposed. Given an integer k, the weight of a point is defined as the sum of the distances separating it from its k nearest-neighbors. Outlier are those points scoring the largest values of weight. The algorithm HilOut makes use of the notion of space-filling curve to linearize the data set, and it consists of two phases. The… CONTINUE READING
Highly Influential
This paper has highly influenced 19 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 245 citations. REVIEW CITATIONS
145 Extracted Citations
29 Extracted References
Similar Papers

Citing Papers

Publications influenced by this paper.
Showing 1-10 of 145 extracted citations

245 Citations

Citations per Year
Semantic Scholar estimates that this publication has 245 citations based on the available data.

See our FAQ for additional information.

Similar Papers

Loading similar papers…