Corpus ID: 2992790

Lightweight Asynchronous Snapshots for Distributed Dataflows

@article{Carbone2015LightweightAS,
  title={Lightweight Asynchronous Snapshots for Distributed Dataflows},
  author={Paris Carbone and Gyula F{\'o}ra and Stephan Ewen and Seif Haridi and Kostas Tzoumas},
  journal={ArXiv},
  year={2015},
  volume={abs/1506.08603}
}
Distributed stateful stream processing enables the deployment and execution of large scale continuous computations in the cloud, targeting both low latency and high throughput. [...] Key Method We implemented ABS on Apache Flink, a distributed analytics engine that supports stateful stream processing. Our evaluation shows that our algorithm does not have a heavy impact on the execution, maintaining linear scalability and performing well with frequent snapshots.Expand
75 Citations
Fast and Precise recovery in Stream processing based on Distributed Cache
Efficient Migration of Very Large Distributed State for Scalable Stream Processing
  • 3
  • PDF
Large-Scale Data Stream Processing Systems
  • 12
StreamScope: Continuous Reliable Distributed Processing of Big Data Streams
  • 61
  • PDF
Stream processing platforms for analyzing big dynamic data
  • PDF
Lineage stash: fault tolerance off the critical path
  • 12
  • Highly Influenced
  • PDF
AutoFlow: Hotspot-Aware, Dynamic Load Balancing for Distributed Stream Processing
  • PDF
Asynchronous snapshots of actor systems for latency-sensitive applications
  • 2
  • Highly Influenced
  • PDF
TSpoon: Transactions on a stream processor
  • 1
Drizzle: Fast and Adaptable Stream Processing at Scale
  • 122
  • PDF
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 16 REFERENCES
Naiad: a timely dataflow system
  • 649
  • Highly Influential
  • PDF
TimeStream: reliable stream computation in the cloud
  • 245
  • PDF
Making State Explicit for Imperative Big Data Processing
  • 43
  • PDF
Discretized Streams: An Efficient and Fault-Tolerant Model for Stream Processing on Large Clusters
  • 495
  • PDF
I An intrqduction to snapshot I algorithms in distributed computing
  • 16
Comet: batched stream processing for data intensive distributed computing
  • 161
  • PDF
The Stratosphere platform for big data analytics
  • 419
  • PDF
...
1
2
...