DOLPHIN: An efficient algorithm for mining distance-based outliers in very large datasets

@article{Angiulli2009DOLPHINAE,
  title={DOLPHIN: An efficient algorithm for mining distance-based outliers in very large datasets},
  author={Fabrizio Angiulli and Fabio Fassetti},
  journal={TKDD},
  year={2009},
  volume={3},
  pages={4:1-4:57}
}
In this work a novel distance-based outlier detection algorithm, named DOLPHIN, working on disk-resident datasets and whose I/O cost corresponds to the cost of sequentially reading the input dataset file twice, is presented. It is both theoretically and empirically shown that the main memory usage of DOLPHIN amounts to a small fraction of the dataset and that DOLPHIN has linear time performance with respect to the dataset size. DOLPHIN gains efficiency by naturally merging together in a… CONTINUE READING
Highly Cited
This paper has 88 citations. REVIEW CITATIONS
47 Citations
6 References
Similar Papers

Citations

Publications citing this paper.
Showing 1-10 of 47 extracted citations

88 Citations

01020'10'12'14'16'18
Citations per Year
Semantic Scholar estimates that this publication has 88 citations based on the available data.

See our FAQ for additional information.

Similar Papers

Loading similar papers…