Load balancing in MapReduce on homogeneous and heterogeneous clusters: an in-depth review

  title={Load balancing in MapReduce on homogeneous and heterogeneous clusters: an in-depth review},
  author={Mohammad Javad Kargar and Meysam Vakili},
Numbers of various programming models have been proposed to process big data in recent years. However, MapReduce is the most famous programming model amongst cloud computing environments and includes many advantages, yet there are several challenges to deal with. Load balancing is considered as one of the most significant downsides of MapReduce which causes the increase in applications’ runtime and accordingly results in less-efficiency, where there is no appropriate proposed mechanism… CONTINUE READING


Publications citing this paper.

Performance evaluation and analysis of load balancing algorithms in cloud computing environments

2016 Second International Conference on Web Research (ICWR) • 2016
View 4 Excerpts
Highly Influenced


Publications referenced by this paper.
Showing 1-10 of 29 references

PIKACHU: How to Rebalance Load in Optimizing MapReduce On Heterogeneous Clusters

USENIX Annual Technical Conference • 2013
View 8 Excerpts
Highly Influenced

Improving MapReduce performance through data placement in heterogeneous Hadoop clusters

2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW) • 2010
View 4 Excerpts
Highly Influenced

LEEN: Locality/Fairness-Aware Key Partitioning for MapReduce in the Cloud

2010 IEEE Second International Conference on Cloud Computing Technology and Science • 2010
View 10 Excerpts
Highly Influenced

Balancing reducer workload for skewed data using sampling-based partitioning

Computers & Electrical Engineering • 2014
View 3 Excerpts
Highly Influenced

Similar Papers

Loading similar papers…