MapReduce: Simplified Data Processing on Large Clusters

  title={MapReduce: Simplified Data Processing on Large Clusters},
  author={Jeffrey Dean and Sanjay Ghemawat},
MapReduce is a programming model and an associated implementation for processing and generating large datasets that is amenable to a broad variety of real-world tasks. Users specify the computation in terms of a map and a reduce function, and the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks. Programmers find the… CONTINUE READING
Highly Influential
This paper has highly influenced 2,606 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 29,553 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.


Publications citing this paper.
Showing 1-10 of 12,627 extracted citations

Meteorological Data Analysis Using MapReduce

TheScientificWorldJournal • 2014
View 14 Excerpts
Highly Influenced

Analytics for the Internet of Things: A Survey

ACM Comput. Surv. • 2018
View 7 Excerpts
Highly Influenced

CloneHadoop: Process Cloning to Reduce Hadoop's Long Tail

2018 IEEE/ACM 5th International Conference on Big Data Computing Applications and Technologies (BDCAT) • 2018
View 15 Excerpts
Method Support
Highly Influenced

Exploring Textures in Traffic Matrices to Classify Data Center Communications

2018 IEEE 32nd International Conference on Advanced Information Networking and Applications (AINA) • 2018
View 11 Excerpts
Method Support
Highly Influenced

Merlin: A Language for Managing Network Resources

IEEE/ACM Transactions on Networking • 2018
View 6 Excerpts
Highly Influenced

Parallelizing Shortest Average-Distance Query Processing

2018 1st International Cognitive Cities Conference (IC3) • 2018
View 11 Excerpts
Method Support
Highly Influenced

29,554 Citations

Citations per Year
Semantic Scholar estimates that this publication has 29,554 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.