Introduction and MapReduce - SNAP - Stanford University
Introduction and MapReduce - SNAP - Stanford University
Introduction and MapReduce - SNAP - Stanford University
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Discovery of patterns <strong>and</strong> models that are:<br />
Valid: hold on new data with some certainty<br />
Useful: should be possible to act on the item<br />
Unexpected: non-obvious to the system<br />
Underst<strong>and</strong>able: humans should be able to<br />
interpret the pattern<br />
Subsidiary issues:<br />
Data cleansing: detection of bogus data<br />
Visualization: something better than MBs of output<br />
Warehousing of data (for retrieval)<br />
1/8/2012 Jure Leskovec, <strong>Stanford</strong> CS246: Mining Massive Datasets, http://cs246.stanford.edu 16