19.08.2013 Views

Introduction and MapReduce - SNAP - Stanford University

Introduction and MapReduce - SNAP - Stanford University

Introduction and MapReduce - SNAP - Stanford University

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

M map tasks, R reduce tasks<br />

Rule of a thumb:<br />

Make M <strong>and</strong> R much larger than the number of<br />

nodes in cluster<br />

One DFS chunk per map is common<br />

Improves dynamic load balancing <strong>and</strong> speeds<br />

recovery from worker failure<br />

Usually R is smaller than M<br />

because output is spread across R files<br />

1/8/2012 Jure Leskovec, <strong>Stanford</strong> CS246: Mining Massive Datasets, http://cs246.stanford.edu 47

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!