Introduction and MapReduce - SNAP - Stanford University
Introduction and MapReduce - SNAP - Stanford University
Introduction and MapReduce - SNAP - Stanford University
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
M map tasks, R reduce tasks<br />
Rule of a thumb:<br />
Make M <strong>and</strong> R much larger than the number of<br />
nodes in cluster<br />
One DFS chunk per map is common<br />
Improves dynamic load balancing <strong>and</strong> speeds<br />
recovery from worker failure<br />
Usually R is smaller than M<br />
because output is spread across R files<br />
1/8/2012 Jure Leskovec, <strong>Stanford</strong> CS246: Mining Massive Datasets, http://cs246.stanford.edu 47