Traffic caused by map tasks Traffic caused by reduce tasks Rack 1 Aggregation Switch Rack 2 Rack

Running MapReduce in a shared cluster has become a recent trend to process large-scale data analytics applications while improving the cluster utilization. However, the network sharing among various applications can lead to constrained and heterogeneous network bandwidth available for MapReduce applications. This further increases the severity of network… CONTINUE READING