The HiBench benchmark suite: Characterization of the MapReduce-based data analysis

  title={The HiBench benchmark suite: Characterization of the MapReduce-based data analysis},
  author={Shengsheng Huang and Jie Huang and Jinquan Dai and Tao Xie and Bo Huang},
  journal={2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010)},
The MapReduce model is becoming prominent for the large-scale data analysis in the cloud. In this paper, we present the benchmarking, evaluation and characterization of Hadoop, an open-source implementation of MapReduce. We first introduce HiBench, a new benchmark suite for Hadoop. It consists of a set of Hadoop programs, including both synthetic micro-benchmarks and real-world Hadoop applications. We then evaluate and characterize the Hadoop framework using HiBench, in terms of speed (i.e… CONTINUE READING
Highly Influential
This paper has highly influenced 81 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 669 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 368 extracted citations

670 Citations

Citations per Year
Semantic Scholar estimates that this publication has 670 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-7 of 7 references

A Comparison of Approaches to Large- Scale Data Analysis

  • A. Pavlo, A. Rasin, +5 authors D. J. Abadi
  • 2009
2 Excerpts

Having fun with PageRank and MapReduce

  • P. Castagna
  • Hadoop User Group UK talk. Available: http…
  • 2009
1 Excerpt

Optimizing Hadoop Deployments

  • Nurcan Coskun
  • Hadoop World 2009 Presentation
  • 2009
1 Excerpt

Winning a 60 Second Dash with a Yellow Elephant

  • O. O’Malley, A. C. Murthy
  • Idle Period
  • 2009
1 Excerpt

Using the Wikipedia page-to-page link database

  • H. Haselgrove
  • Available:…
1 Excerpt

Similar Papers

Loading similar papers…