19.08.2013 Views

Introduction and MapReduce - SNAP - Stanford University

Introduction and MapReduce - SNAP - Stanford University

Introduction and MapReduce - SNAP - Stanford University

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Input <strong>and</strong> final output are stored on a<br />

distributed file system:<br />

Scheduler tries to schedule map tasks “close” to<br />

physical storage location of input data<br />

Intermediate results are stored on local FS<br />

of map <strong>and</strong> reduce workers<br />

Output is often input to another map<br />

reduce task<br />

1/8/2012 Jure Leskovec, <strong>Stanford</strong> CS246: Mining Massive Datasets, http://cs246.stanford.edu 44

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!