Corpus ID: 3590788

Histogram-based Outlier Score (HBOS): A fast Unsupervised Anomaly Detection Algorithm

@inproceedings{Goldstein2012HistogrambasedOS,
  title={Histogram-based Outlier Score (HBOS): A fast Unsupervised Anomaly Detection Algorithm},
  author={Markus Goldstein and Andreas R. Dengel},
  year={2012}
}
Unsupervised anomaly detection is the process of nding outliers in data sets without prior training. In this paper, a histogrambased outlier detection (HBOS) algorithm is presented, which scores records in linear time. It assumes independence of the features making it much faster than multivariate approaches at the cost of less precision. A comparative evaluation on three UCI data sets and 10 standard algorithms show, that it can detect global outliers as reliable as state-of-theart algorithms… Expand
A Novel Density-Based Clustering Approach for Outlier Detection in High-Dimensional Data
TLDR
A new outlier detection technique is proposed based on a feature selection strategy to avoid the curse of dimensionality, named Infinite Feature Selection DBSCAN, to reduce the dimensions of a high dimensional data set in order to efficiently identify outliers using clustering techniques. Expand
Anomaly Detection in Wireless Sensor Networks Data by Using Histogram Based Outlier Score Method
  • I. Kalayci, Tuncay Ercan
  • Computer Science
  • 2018 2nd International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT)
  • 2018
TLDR
This study proposes the Histogram Based Outlier Score (HBOS) method to detect anomalies in data acquired by wireless sensor networks, which is unsupervised and provide fast solutions. Expand
Linear time identification of local and global outliers
TLDR
An extensive comparison with 18 state-of-the-art anomaly detection algorithms on a diverse collection of 1000 annotated datasets is provided and a new ”outlierness” criterion is proposed to infer the local or global identity of outliers. Expand
Outlier Detection Based on Low Density Models
TLDR
Results show that SDO satisfactorily competes with the best ranked outlier detection alternatives and makes SDO highly flexible and adaptable to stand-alone frameworks that must detect outliers fast with accuracy rates equivalent to lazy learning algorithms. Expand
Applying KSE-test and K-means clustering towards scalable unsupervised intrusion detection
We proposed a scalable outlier detection method to identify outliers in large datasets with a goal to create unsupervised intrusion detection. In our work, the strength of Kolmogorov-Smirnov test andExpand
Improved histogram-based anomaly detector with the extended principal component features
TLDR
The results show that using the original input features together with principal components improves the detection accuracy of histogram-based anomaly detector significantly without compromising much in terms of run-time. Expand
Correlation-based Anomaly Detection in Time Series
Anomaly detection is one of the most fundamental tasks in time series analysis. The proposed techniques belong generally to one of the following categories: supervised, semi-supervised orExpand
Application of Histogram-Based Outlier Scores to Detect Computer Network Anomalies
Misuse activity in computer networks constantly creates new challenges and difficulties to ensure data confidentiality, integrity, and availability. The capability to identify and quickly stop theExpand
A Case Study of Cluster-based and Histogram-based Multivariate Anomaly Detection Approach in General Ledgers
TLDR
The main aim of this research was to find anomalies in general ledgers of a real company in Bosnia and Herzegovina, as well as results achieved using synthetic data. Expand
Comparison of Outlier Detection Techniques for Structured Data
TLDR
The goal of this work is to highlight and compare some of the existing outlier detection techniques for the data scientists to use that information for outlier algorithm selection while building a machine learning model. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 14 REFERENCES
FastLOF: An Expectation-Maximization based Local Outlier detection algorithm
  • Markus Goldstein
  • Computer Science
  • Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012)
  • 2012
TLDR
An Expectation-Maximization algorithm is proposed which computes the Local Outlier Factor (LOF) incrementally and up to 80% faster than the standard method, and Evaluation on real world data sets reveal that FastLOF performs comparable to the best outlier detection algorithms although being significantly faster. Expand
Nearest-Neighbor and Clustering based Anomaly Detection Algorithms for RapidMiner
Unsupervised anomaly detection is the process of finding outlying records in a given dataset without prior need for training. In this paper we introduce an anomaly detection extension for RapidMinerExpand
Comparison of Unsupervised Anomaly Detection Techniques Bachelor Thesis
TLDR
A RapidMiner extension that contains the most applicable unsupervised anomaly detection algorithms to enable non-experts to easily apply them is implemented and the proposed algorithms showed promising results where they outperformed CBLOF. Expand
Discovering cluster-based local outliers
TLDR
A measure for identifying the physical significance of an outlier is designed, which is called cluster-based local outlier factor (CBLOF), which is meaningful and provides importance to the local data behavior. Expand
LOF: identifying density-based local outliers
TLDR
This paper contends that for many scenarios, it is more meaningful to assign to each object a degree of being an outlier, called the local outlier factor (LOF), and gives a detailed formal analysis showing that LOF enjoys many desirable properties. Expand
LOCI: fast outlier detection using the local correlation integral
TLDR
Experiments show that LOCI and aLOCI can automatically detect outliers and micro-clusters, without user-required cut-offs, and that they quickly spot both expected and unexpected outliers. Expand
Histogram-based traffic anomaly detection
TLDR
This work describes a new approach to feature-based anomaly detection that constructs histograms of different traffic features, models histogram patterns, and identifies deviations from the created models. Expand
Efficient algorithms for mining outliers from large data sets
TLDR
A novel formulation for distance-based outliers that is based on the distance of a point from its kth nearest neighbor is proposed and the top n points in this ranking are declared to be outliers. Expand
Anomaly detection: A survey
TLDR
This survey tries to provide a structured and comprehensive overview of the research on anomaly detection by grouping existing techniques into different categories based on the underlying approach adopted by each technique. Expand
Packetscore: statistics-based overload control against distributed denial-of-service attacks
TLDR
A key idea is to prioritize packets based on a per-packet score which estimates the legitimacy of a packet given the attribute values it carries, and perform score-based selective packet discarding where the dropping threshold is dynamically adjusted based on the score distribution of recent incoming packets. Expand
...
1
2
...