Performance Modeling and Optimization of Deadline-Driven Pig Programs

  title={Performance Modeling and Optimization of Deadline-Driven Pig Programs},
  author={Zhuoyao Zhang and Ludmila Cherkasova and Abhishek Verma and Boon Thau Loo},
Many applications associated with live business intelligence are written as complex data analysis programs defined by directed acyclic graphs of MapReduce jobs, for example, using Pig, Hive, or Scope frameworks. An increasing number of these applications have additional requirements for completion time guarantees. In this article, we consider the popular Pig framework that provides a high-level SQL-like abstraction on top of MapReduce engine for processing large data sets. There is a lack of… CONTINUE READING
Recent Discussions
This paper has been referenced on Twitter 1 time over the past 90 days. VIEW TWEETS