Spreadsheets for stream processing with unbounded windows and partitions

@article{Hirzel2016SpreadsheetsFS,
  title={Spreadsheets for stream processing with unbounded windows and partitions},
  author={Martin Hirzel and Rodric M. Rabbah and Philippe Suter and Olivier Tardieu and Mandana Vaziri},
  journal={Proceedings of the 10th ACM International Conference on Distributed and Event-based Systems},
  year={2016}
}
  • Martin Hirzel, R. Rabbah, M. Vaziri
  • Published 13 June 2016
  • Computer Science
  • Proceedings of the 10th ACM International Conference on Distributed and Event-based Systems
Stream processing is a computational paradigm that allows the analysis of live data streams as they are produced. This paper describes a programming model, based on enhancements to spreadsheets, that enables users with limited programming experience to participate directly in the development of complex streaming applications. The programming model augments a conventional spreadsheet with streaming features that permit operating over unbounded data sets despite the finite interface provided by… 

Figures and Tables from this paper

SPL: An Extensible Language for Distributed Stream Processing
TLDR
SPL is now the gateway for the IBM Streams platform, used by the customers for stream processing in a broad range of application domains, with an emphasis on the language design, distributed runtime, and extensibility mechanism.
Hardware-Conscious Stream Processing
TLDR
A systematic survey of recent work in the field of DSPSs, particularly along with the following three directions: 1) computation optimization, 2) stream I/O optimization, and 3) query deployment is conducted.
Hardware-Conscious Stream Processing: A Survey
TLDR
A systematic survey of recent work in the field of DSPSs, particularly along with the following three directions: 1) computation optimization, 2) stream I/O optimization, and 3) query deployment is conducted.
Scalable Complex Event Processing on a Notebook: Demo
TLDR
This work implements WSO2 Data Analytics Server (DAS)'s event processor in Apache Zeppelin notebook environment and demonstrates how such network could be extended for distributed stream processing scenario using WSO1 DAS and Apache Storm.
Sliding-Window Aggregation Algorithms: Tutorial
TLDR
This paper was written to accompany a tutorial, but can be read as a stand-alone survey that aims to better educate the community about fast sliding-window aggregation algorithms for a variety of common aggregation operations and window types.
S Sliding-Window Aggregation Algorithms
TLDR
A sliding-window aggregation algorithm updates the aggregate value of the input collection, often using incremental-computation techniques, as the window contents change over time, as illustrated in Fig. 1.
In-Order Sliding-Window Aggregation in Worst-Case Constant Time
TLDR
A new variant of DABA, the first algorithm for sliding-window aggregation with worst-case constant time, is introduced, which achieves the same time bounds in less memory and only requires space for storing partial aggregates.
Sliding-Window Aggregation Algorithms
Stream Processing Languages and Abstractions
Tutorial: Sliding-Window Aggregation Algorithms
  • 2017
...
...

References

SHOWING 1-10 OF 46 REFERENCES
Stream Processing with a Spreadsheet
TLDR
The design and implementation of an enhanced spreadsheet is presented that enables visualizing live streams, live programming to compute new streams, and exporting computations to be run on a server where they can be shared with other users, and persisted beyond the life of the spreadsheet.
A Spreadsheet Model for Handling Streaming Data
TLDR
The prototype tool presents techniques to let the user stream data from web services and web input elements to a spreadsheet without preprogramming those sources into the tool, allowing the user to view and manipulate streaming data using temporal information.
Spreadsheets for Stream Partitions and Windows
TLDR
It is argued that, while spreadsheets can function as powerful models for stream operators, their fundamental boundedness limits their scope of application.
Haxcel A spreadsheet interface to Haskell written in Java.
TLDR
An extended array library for Haskell is presented, which provides a number of typical array-language facilities and provides an interactive environment that can be used both for development of general Haskell code and for array-oriented spreadsheet calculations.
IBM Streams Processing Language: Analyzing Big Data in motion
TLDR
SPL abstracts away the complexity of the distributed system, instead exposing a simple graph-of-operators view to the user and provides a strong type system and user-defined operator models.
Generic windowing support for extensible stream processing systems
  • B. Gedik
  • Computer Science
    Softw. Pract. Exp.
  • 2014
TLDR
Windowing makes it possible to implement streaming versions of the traditionally blocking relational operators, such as streaming aggregations, joins, and sorts, as well as any other analytic operator that requires keeping the most recent tuples as state,such as time series analysis operators and signal processing operators.
SECRET: A Model for Analysis of the Execution Semantics of Stream Processing Systems
TLDR
SECRET is a descriptive model that allows users to analyze the behavior of systems and understand the results of window-based queries for a broad range of heterogeneous SPEs.
Auto-parallelizing stateful distributed streaming applications
TLDR
This paper presents a compiler and runtime system that automatically extract data parallelism for distributed stream processing, guaranteeing safety, even in the presence of stateful, selective, and user-defined operators.
Deadlock avoidance for streaming computations with filtering
TLDR
This paper formalizes a model of streaming computation systems with filtering, and proposes provably correct mechanisms to avoid deadlock in distributed systems of diverse computing architectures, where global coordination or modification of buffer sizes may be difficult or impossible in practice.
A catalog of stream processing optimizations
TLDR
A survey of optimizations for stream processing, in a style similar to catalogs of design patterns or refactorings, to help future streaming system builders to stand on the shoulders of giants from not just their own community.
...
...