MapReduce

Known as: Hadoop map, Map/reduce, Map Reduce 
MapReduce is a programming model and an associated implementation for processing and generating large data sets with a parallel, distributed… (More)
Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2010
Highly Cited
2010
MapReduce is a popular framework for data-intensive distributed computing of batch jobs. To simplify fault tolerance, the output… (More)
  • figure 2
  • figure 4
  • figure 5
  • figure 6
  • figure 7
Is this relevant?
Highly Cited
2010
Highly Cited
2010
MapReduce advantages over parallel databases include storage-system independence and fine-grain fault tolerance for large jobs. 
Is this relevant?
Highly Cited
2010
Highly Cited
2010
MapReduce programming model has simplified the implementation of many data parallel applications. The simplicity of the… (More)
  • figure 1
  • figure 2
  • figure 3
  • table 1
  • figure 4
Is this relevant?
Highly Cited
2010
Highly Cited
2010
MapReduce complements DBMSs since databases are not designed for extract-transform-load tasks, a MapReduce specialty. 
Is this relevant?
Highly Cited
2009
Highly Cited
2009
MOTIVATION Next-generation DNA sequencing machines are generating an enormous amount of sequence data, placing unprecedented… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Highly Cited
2009
Highly Cited
2009
Data clustering has been received considerable attention in many applications, such as data mining, document retrieval, image… (More)
  • figure 1
Is this relevant?
Highly Cited
2009
Highly Cited
2009
Sharing a MapReduce cluster between users is attractive because it enables statistical multiplexing (lowering costs) and allows… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Highly Cited
2008
Highly Cited
2008
We design and implement Mars, a MapReduce framework, on graphics processors (GPUs). MapReduce is a distributed programming… (More)
  • figure 1
  • figure 2
  • table 1
  • figure 3
  • table 2
Is this relevant?
Highly Cited
2008
Highly Cited
2008
Most scientific data analyses comprise analyzing voluminous data collected from various instruments. Efficient parallel… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • figure 5
Is this relevant?
Highly Cited
2004
Highly Cited
2004
MapReduce is a programming model and an associated implementation for processing and generating large datasets that is amenable… (More)
  • figure 1
  • figure 2
  • figure 3
  • figure 4
  • table 1
Is this relevant?