A Survey of Outlier Detection Methodologies

  title={A Survey of Outlier Detection Methodologies},
  author={Victoria J. Hodge and J. Austin},
  journal={Artificial Intelligence Review},
  • Victoria J. Hodge, J. Austin
  • Published 2004
  • Artificial Intelligence Review
  • Outlier detection has been used for centuries to detect and, where appropriate, remove anomalous observations from data. Outliers arise due to mechanical faults, changes in system behaviour, fraudulent behaviour, human error, instrument error or simply through natural deviations in populations. Their detection can identify system faults and fraud before they escalate with potentially catastrophic consequences. It can identify errors and remove their contaminating effect on the data set and as… CONTINUE READING
    43 Citations

    Figures from this paper

    Machine learning to detect anomalies in datacenter
    • Highly Influenced
    • PDF
    Cyber Data Anomaly Detection Using Autoencoder Neural Networks
    • Highly Influenced
    Ensemble based unsupervised anomaly detection
    • Highly Influenced
    An Approach to Outlier Detection of Software Measurement Data using the K-means Clustering Method
    • 42
    • Highly Influenced
    • PDF
    Abnormality Detection in Hard Disk Drive Assembly Process Using Support Vector Machine
    • Masayuti Simongyi, P. Chongstitvatana
    • Computer Science
    • 2018 15th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)
    • 2018
    Circadian clock-dependent and -independent posttranscriptional regulation underlies temporal mRNA accumulation in mouse liver
    • 37
    • PDF
    A real-time anomaly detection algorithm/or water quality data using dual time-moving windows
    • 3


    Statistical Independence and Novelty Detection with Information Preserving Nonlinear Maps
    • 162
    • Highly Influential
    • PDF
    On-Line Novelty Detection through self-organisation with application to inspection robotics
    • 34
    • Highly Influential
    A System for the Analysis of Jet System Vibration Data
    • Integrated ComputerAided Engineering
    • 1999
    A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise
    • 15,288
    • Highly Influential
    • PDF
    A massively parallel architecture for a self-organizing neural pattern recognition machine
    • 2,785
    • Highly Influential
    • PDF
    Robust regression and outlier detection
    • 5,193
    • Highly Influential
    UCI Repository of machine learning databases
    • 12,990
    • Highly Influential
    A study of distance-based machine learning algorithms
    • 140
    • Highly Influential
    Novelty detection & Neural Network validation
    • Proceedings of the IEE Conference on Vision, Image and Signal Processing
    • 1994
    Novelty detection and neural network validation
    • 543
    • Highly Influential