HadoopToSQL: a mapReduce query optimizer

  title={HadoopToSQL: a mapReduce query optimizer},
  author={Ming-Yee Iu and Willy Zwaenepoel},
MapReduce is a cost-effective way to achieve scalable performance for many log-processing workloads. These workloads typically process their entire dataset. MapReduce can be inefficient, however, when handling business-oriented workloads, especially when these workloads access only a subset of the data. HadoopToSQL seeks to improve MapReduce performance for the latter class of workloads by transforming MapReduce queries to use the indexing, aggregation and grouping features provided by SQL… CONTINUE READING
Highly Cited
This paper has 58 citations. REVIEW CITATIONS

6 Figures & Tables



Citations per Year

58 Citations

Semantic Scholar estimates that this publication has 58 citations based on the available data.

See our FAQ for additional information.