19.08.2013 Views

Introduction and MapReduce - SNAP - Stanford University

Introduction and MapReduce - SNAP - Stanford University

Introduction and MapReduce - SNAP - Stanford University

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Discovery of patterns <strong>and</strong> models that are:<br />

Valid: hold on new data with some certainty<br />

Useful: should be possible to act on the item<br />

Unexpected: non-obvious to the system<br />

Underst<strong>and</strong>able: humans should be able to<br />

interpret the pattern<br />

Subsidiary issues:<br />

Data cleansing: detection of bogus data<br />

Visualization: something better than MBs of output<br />

Warehousing of data (for retrieval)<br />

1/8/2012 Jure Leskovec, <strong>Stanford</strong> CS246: Mining Massive Datasets, http://cs246.stanford.edu 16

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!