19.08.2013 Views

Introduction and MapReduce - SNAP - Stanford University

Introduction and MapReduce - SNAP - Stanford University

Introduction and MapReduce - SNAP - Stanford University

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

A big data-mining risk is that you will<br />

“discover” patterns that are meaningless.<br />

Bonferroni’s principle: (roughly) if you look in<br />

more places for interesting patterns than your<br />

amount of data will support, you are bound to<br />

find crap<br />

1/8/2012 Jure Leskovec, <strong>Stanford</strong> CS246: Mining Massive Datasets, http://cs246.stanford.edu 20

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!