19.08.2013 Views

Introduction and MapReduce - SNAP - Stanford University

Introduction and MapReduce - SNAP - Stanford University

Introduction and MapReduce - SNAP - Stanford University

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

For a map-reduce algorithm:<br />

Communication cost = input file size + 2 × (sum of<br />

the sizes of all files passed from Map processes to<br />

Reduce processes) + the sum of the output sizes of<br />

the Reduce processes.<br />

Elapsed communication cost is the sum of the<br />

largest input + output for any map process, plus<br />

the same for any reduce process<br />

1/8/2012 Jure Leskovec, <strong>Stanford</strong> CS246: Mining Massive Datasets, http://cs246.stanford.edu 58

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!