The map-reduce model

CS 300 (PDC)

Origins

Structure of Hadoop computation

Fault-tolerance in Hadoop

Some additional terms

reduce shard
A subset of key values assigned to a particular single node during the reduce phase of a map-reduce computation.
______
______
______
______
______
______
______
______
______
______
______
______
______
______
______
______
______
______
____________
______