Comparison of Dataflow Control Techniques In Distributed Data-Intensive Systems

Abstract

In dataflow architectures, each dataflow node (i.e., operation) is typically executed on a single physical node. We are concerned with distributed data-intensive systems, in which each base (i.e., persistent) set of data has been declustered over many physical nodes to achieve load balancing. Because of large base set size, each operation is executed where… (More)
DOI: 10.1145/55595.55614

8 Figures and Tables

Topics

  • Presentations referencing similar topics