ASAP: Prioritizing Attention via Time Series Smoothing

@article{Rong2017ASAPPA,
  title={ASAP: Prioritizing Attention via Time Series Smoothing},
  author={Kexin Rong and Peter D. Bailis},
  journal={Proc. VLDB Endow.},
  year={2017},
  volume={10},
  pages={1358-1369}
}
Time series visualization of streaming telemetry (i.e., charting of key metrics such as server load over time) is increasingly prevalent in modern data platforms and applications. However, many existing systems simply plot the raw data streams as they arrive, often obscuring large-scale trends due to small-scale noise. We propose an alternative: to better prioritize end users' attention, smooth time series visualizations as much as possible to remove noise, while retaining large-scale structure… 

Prioritizing Attention via Time Series Smoothing

This work develops a new analytics operator called ASAP that automatically smooths streaming time series by adaptively optimizing the trade-off between noise reduction and trend retention, and introduces metrics to quantitatively assess the quality of smoothed plots.

OnlineSTL: Scaling Time Series Decomposition by 100x

OnlineSTL is proposed, a novel online algorithm for time series decomposition which is highly scalable and is deployed for real-time metrics monitoring on high-resolution, high-ingest rate data.

Pattern Matching with Adaptive Granularity Over Streaming Time Series

A novel approach to solve fine-grained matching of streaming time series data from sensors with lower latency and limited computing resource is proposed, which outperforms the brute-force method and MSM, a multi-step filter mechanism over the multi-scaled representation, by orders of magnitude.

MacroBase: Prioritizing Attention in Fast Data

MacroBase is able to deliver order-of-magnitude speedups over alternatives by optimizing the combination of explanation and classification tasks and by leveraging a new reservoir sampler and heavy-hitters sketch specialized for fast data streams.

Demonstration: MacroBase, A Fast Data Analysis Engine

MacroBase is developed, a fast data analytics engine that acts as a search engine over fast data streams that provides a set of highly-optimized, modular operators for streaming feature transformation, classification, and explanation.

Towards a Near Universal Time Series Data Mining Tool: Introducing the Matrix Profile

The utility of matrix profile is demonstrated for many time series data mining problems, including motif discovery, discord discovery, weakly labeled time series classification, and representation learning on domains as diverse as seismology, entomology, music processing, bioinformatics, human activity monitoring, electrical power-demand monitoring, and medicine.

Map-Based Visual Exploration of Geolocated Time Series

This work presents an approach that relies on a hybrid spatial-time series index to allow for interactive map-based visual exploration and summarization of geolocated time series data, and uses the BTSR-tree index, which extends the R-tree by maintaining bounds for the time series indexed at each node.

Stream Data Cleaning under Speed and Acceleration Constraints

This article proposes the constraint-based approach for cleaning stream data, and includes a polynomial time algorithm for global optimum, linear time algorithm towards local optimum under an efficient median-based solution, and experiments demonstrate that the method can show significantly lower L1 error than the existing approaches such as smoother.

Mr. Plotter: Unifying Data Reduction Techniques in Storage and Visualization Systems

A new architecture is presented in which the visualizer and data store are tightly coupled, which enables a new class of visualization clients that automatically manage data, quickly and transparently requesting data from the underlying database without requiring the user to explicitly initiate queries.

TSExplain: Surfacing Evolving Explanations for Time Series

TSExplain is demonstrated, a system to help users understand the underlying evolving explanations for any aggregated time-series, and models the explanation problem as a segmentation problem over the time dimension.

References

SHOWING 1-10 OF 67 REFERENCES

Prioritizing Attention via Time Series Smoothing

This work develops a new analytics operator called ASAP that automatically smooths streaming time series by adaptively optimizing the trade-off between noise reduction and trend retention, and introduces metrics to quantitatively assess the quality of smoothed plots.

M4: A Visualization-Oriented Time Series Data Aggregation

This work introduces M4, an aggregation-based time series dimensionality reduction technique that provides error-free visualizations at high data reduction rates and explains in detail the drawbacks of existing data reduction techniques and how this approach outperforms state of the art, by respecting the process of line rasterization.

Prioritizing Attention in Analytic Monitoring

A set of design principles for the design of fast data analytics engines that leverage the relative scarcity of human attention and overabundance of data are presented: return fewer results, prioritize iterative analysis, and filter fast to compute less.

Gorilla: A Fast, Scalable, In-Memory Time Series Database

Gorilla, Facebook's in-memory TSDB, is introduced and insight is that users of monitoring systems do not place much emphasis on individual data points but rather on aggregate analysis, and recent data points are of much higher value than older points to quickly detect and diagnose the root cause of an ongoing problem.

MacroBase: Prioritizing Attention in Fast Data

MacroBase is able to deliver order-of-magnitude speedups over alternatives by optimizing the combination of explanation and classification tasks and by leveraging a new reservoir sampler and heavy-hitters sketch specialized for fast data streams.

General Incremental Sliding-Window Aggregation

Reactive Aggregator (RA), a new framework for incremental sliding-window aggregation, is presented, which has an algorithmic complexity of O(m + m log (n/m)), rivaling the best prior algorithms for any m.

zenvisage: Effortless Visual Data Exploration

Data visualization is by far the most commonly used mechanism to explore data, especially by novice data analysts and data scientists. And yet, current visual analytics tools are rather limited in

HOT SAX: efficiently finding the most unusual time series subsequence

The utility of discords with objective experiments on domains as diverse as Space Shuttle telemetry monitoring, medicine, surveillance, and industry, and the effectiveness of the discord discovery algorithm with more than one million experiments, on 82 different datasets from diverse domains are demonstrated.

WaveScope: a signal-oriented data stream management system

This poster shows how WaveScope supports a large number of signal-oriented streaming sensor applications, such as preventive maintenance of industrial equipment; detection of fractures and ruptures in various structures; in situ animal behavior studies using acoustic sensing; network traffic analysis; and medical applications such as anomaly detection in EKGs.

Effortless Data Exploration with zenvisage: An Expressive and Interactive Visual Analytics System

Data visualization is by far the most commonly used mechanism to explore data, especially by novice data analysts and data scientists. And yet, current visual analytics tools are rather limited in
...