GrammarViz 2.0: A Tool for Grammar-Based Pattern Discovery in Time Series

@inproceedings{Senin2014GrammarViz2A,
  title={GrammarViz 2.0: A Tool for Grammar-Based Pattern Discovery in Time Series},
  author={Pavel Senin and Jessica Lin and Xing Wang and Tim Oates and S. Gandhi and Arnold P. Boedihardjo and Crystal Chen and Susan Frankenstein and Manfred Lerner},
  booktitle={ECML/PKDD},
  year={2014}
}
The problem of frequent and anomalous patterns discovery in time series has received a lot of attention in the past decade. Addressing the common limitation of existing techniques, which require a pattern length to be known in advance, we recently proposed grammar-based algorithms for efficient discovery of variable length frequent and rare patterns. In this paper we present GrammarViz 2.0, an interactive tool that, based on our previous work, implements algorithms for grammar-driven mining and… Expand

Figures and Topics from this paper

GrammarViz 3.0
TLDR
GrammarViz 3.0 is presented—a software package that provides implementations of proposed algorithms and graphical user interface for interactive variable-length time series pattern discovery and introduces an experimental procedure for automated discretization parameter selection that builds upon the minimum cardinality maximum cover principle and aids the time series recurrent and anomalous pattern discovery. Expand
IterativE Grammar-Based Framework for Discovering Variable-Length Time Series Motifs
TLDR
This work proposes an iterative framework based on grammar induction that can find motifs of the same quality, with much faster running time compared to the state-of-the-art variable-length exact motif discovery algorithm in real world time series data. Expand
Time series anomaly discovery with grammar-based compression
TLDR
These approaches not only allow discovery of multiple variable-length anomalous subsequences at once, but also significantly outperform the current state-of-the-art exact algorithms for time series anomaly detection. Expand
Ensemble Grammar Induction For Detecting Anomalies in Time Series
TLDR
It is demonstrated that the proposed ensemble approach can outperform existing grammar-induction-based approaches with different criteria for selection of parameter values and achieve performance similar to that of the state-of-the-art distance-based anomaly detection algorithm. Expand
RPM: Representative Pattern Mining for Efficient Time Series Classification
TLDR
This work proposes a novel algorithm for time series classication based on the discovery of class-specic representative patterns that selects the most representative patterns of a class as a set of subsequences that has the greatest discriminative power to distinguish one class of time series from another. Expand
Shape grammar extraction for efficient query-by-sketch pattern matching in long time series
TLDR
This work proposes an efficient method for exploring user-sketched patterns, incorporating the domain expert's knowledge, in time series data through a shape grammar based approach and demonstrates the effectiveness of the method in a case study on stock market data. Expand
A Framework for Pattern Mining and Anomaly Detection in Multi-dimensional Time Series and Event Logs
TLDR
This work proposes a framework, consisting of generic transformations, that allows to combine state-of-the-art time series representation, pattern mining, and pattern-based anomaly detection algorithms, and presents an open-source, lightweight, interactive tool that assists both pattern mining and domain experts to select algorithms, specify parameters, and visually inspect the results. Expand
Efficient discovery of time series motifs with large length range in million scale time series
TLDR
This work introduces an approximate algorithm called HierarchIcal based Motif Enumeration (HIME) to detect variable-length motifs with a large enumeration range in million-scale time series and shows that the scalability of the proposed algorithm is significantly better than that of the state-of-theart algorithm. Expand
Exposé Master’s Thesis: A Synthetic Motif Generator
In colloquial language a time series is a (mostly long) sequence of reals and a subsequence is a sequence of consecutive values in a time series. Two sequences of reals are matching if both have theExpand
HIME: discovering variable-length motifs in large-scale time series
TLDR
This work introduces an approximate algorithm called hierarchical-based motif enumeration (HIME) to detect variable-length motifs with a large enumeration range in million-scale time series and shows that the scalability of the proposed algorithm is significantly better than that of the state-of-the-art algorithm. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 12 REFERENCES
Finding Motifs in Time Series
TLDR
An efficient motif discovery algorithm for time series would be useful as a tool for summarizing and visualizing massive time series databases and could be used as a subroutine in various other data mining tasks, including the discovery of association rules, clustering and classification. Expand
Mining motifs in massive time series databases
TLDR
This paper carefully motivate, then introduces, a nontrivial definition of time series motifs, and proposes an efficient algorithm to discover them, and demonstrates the utility and efficiency of the approach on several real world datasets. Expand
Visualizing Variable-Length Time Series Motifs
TLDR
It is demonstrated that grammar induction in time series can effectively identify repeated patterns without prior knowledge of their lengths, and a motif visualization system based on grammar induction is developed. Expand
Visual exploration of frequent patterns in multivariate time series
TLDR
Four novel visual analytics methods are introduced to interactively examine motifs and gain new insights into the recurring patterns to analyze system operations, and both power consumption and server utilization in data centers are predicted. Expand
Motif discovery in spatial trajectories using grammar inference
TLDR
This work proposes a fast and robust grammar induction algorithm called mSEQUITUR to infer a grammar rule set from a trajectory for motif generation and designed the Symbolic Trajectory Analysis and VIsualization System (STAVIS), the first of its kind trajectory analytical system. Expand
Identifying Hierarchical Structure in Sequences: A linear-time algorithm
SEQUITUR is an algorithm that infers a hierarchical structure from a sequence of discrete symbols by replacing repeated phrases with a grammatical rule that generates the phrase, and continuing thisExpand
HOT SAX: efficiently finding the most unusual time series subsequence
TLDR
The utility of discords with objective experiments on domains as diverse as Space Shuttle telemetry monitoring, medicine, surveillance, and industry, and the effectiveness of the discord discovery algorithm with more than one million experiments, on 82 different datasets from diverse domains are demonstrated. Expand
Detecting Anomalies In Time Series Data
TLDR
An anomaly detection algorithm as a monitoring aid applied to time series data from the pulp and paper industry, developed for the company Eurocon MOPSsys AB is shown to be able to find deviating patterns. Expand
Visually mining and monitoring massive time series
TLDR
VizTree is introduced, a novel time-series visualization tool to aid the Aerospace analysts who must make these engineering assessments, and is unique in that the same tool is used for mining archival data and monitoring incoming live telemetry. Expand
Grammar-driven anomaly discovery in time series
  • CSDL Techreport 14-05
  • 2014
...
1
2
...