Data splits MASTER Reducer Input data Output data Mapper Job descriptor Distributed File System Mapper Mapper Intermediate results Reducer Final results

@inproceedings{Marozzo2011DataSM,
  title={Data splits MASTER Reducer Input data Output data Mapper Job descriptor Distributed File System Mapper Mapper Intermediate results Reducer Final results},
  author={Fabrizio Marozzo and Domenico Talia and Paolo Trunfio},
  year={2011}
}
MapReduce is a programming model for parallel data processing widely used in Cloud computing environments. Current MapReduce implementations are based on centralized master-slave architectures that do not cope well with dynamic Cloud infrastructures, like a Cloud of clouds, in which nodes may join and leave the network at high rates. We have designed an adaptive MapReduce framework, called P2P-MapReduce, which exploits a peer-to-peer model to manage node churn, master failures, and job recovery… CONTINUE READING