11.07.2015 Views

2DkcTXceO

2DkcTXceO

2DkcTXceO

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

534 Rise of the machinesFIGURE 44.5Ridge finder applied to simulated cosmic web data.of theoretical computer science and ML. Running an MCMC on an NP-hardproblem might be meaningless. Instead, it may be better to approximate theNP-hard problem with a simpler problem. How often do we teach this to ourstudents?44.6 The evolving meaning of dataFor most of us in statistics, data means numbers. But data now includesimages, documents, videos, web pages, twitter feeds and so on. Traditionaldata — numbers from experiments and observational studies — are still ofvital importance but they represent a tiny fraction of the data out there. Ifwe take the union of all the data in the world, what fraction is being analyzedby statisticians? I think it is a small number.This comes back to education. If our students can’t analyze giant datasetslike millions of twitter feeds or millions of web pages, then other people willanalyze those data. We will end up with a small cut of the pie.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!