Improving Data Locality of MapReduce by Scheduling in Homogeneous Computing Environments

  title={Improving Data Locality of MapReduce by Scheduling in Homogeneous Computing Environments},
  author={Xiaohong Zhang and Zhiyong Zhong and Shengzhong Feng and Bibo Tu and Jianping Fan},
  journal={2011 IEEE Ninth International Symposium on Parallel and Distributed Processing with Applications},
Data Locality is one of the critical factors to affect performance. This paper proposes a next-k-node scheduling (NKS) method to improve the data locality of map tasks. The method first calculates the probabilities of each map task, and then preferentially schedules the one with the highest probability. It generates low probabilities for the tasks which satisfy node locality with the nodes to issue requests, so it can reserve these tasks to these nodes. We have implemented the NKS method in… CONTINUE READING
Highly Cited
This paper has 62 citations. REVIEW CITATIONS
37 Citations
12 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 37 extracted citations

62 Citations

Citations per Year
Semantic Scholar estimates that this publication has 62 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 12 references

Adaptive task scheduling for multijob mapreduce environments

  • J. Polo, D. de Nadal, +4 authors E. Ayguadé
  • Technical report UPC-DAC- RR-CAP-2009-28…
  • 2009
1 Excerpt

Research. development and practice of cloud computing in china

  • X. Huang
  • 2009
1 Excerpt

Similar Papers

Loading similar papers…