13.07.2015 Views

Data Mining: Practical Machine Learning Tools and ... - LIDeCC

Data Mining: Practical Machine Learning Tools and ... - LIDeCC

Data Mining: Practical Machine Learning Tools and ... - LIDeCC

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

1.2 SIMPLE EXAMPLES: THE WEATHER PROBLEM AND OTHERS 9computers. To decide whether something has actually learned, you need to seewhether it intended to or whether there was any purpose involved. That makesthe concept moot when applied to machines because whether artifacts can behavepurposefully is unclear. Philosophic discussions of what is really meant by “learning,”like discussions of what is really meant by “intention” or “purpose,” arefraught with difficulty. Even courts of law find intention hard to grapple with.<strong>Data</strong> miningFortunately, the kind of learning techniques explained in this book do notpresent these conceptual problems—they are called machine learning withoutreally presupposing any particular philosophic stance about what learning actuallyis. <strong>Data</strong> mining is a practical topic <strong>and</strong> involves learning in a practical, nota theoretical, sense. We are interested in techniques for finding <strong>and</strong> describingstructural patterns in data as a tool for helping to explain that data <strong>and</strong> makepredictions from it. The data will take the form of a set of examples—examplesof customers who have switched loyalties, for instance, or situations in whichcertain kinds of contact lenses can be prescribed. The output takes the form ofpredictions about new examples—a prediction of whether a particular customerwill switch or a prediction of what kind of lens will be prescribed under givencircumstances. But because this book is about finding <strong>and</strong> describing patternsin data, the output may also include an actual description of a structure thatcan be used to classify unknown examples to explain the decision. As well asperformance, it is helpful to supply an explicit representation of the knowledgethat is acquired. In essence, this reflects both definitions of learning consideredpreviously: the acquisition of knowledge <strong>and</strong> the ability to use it.Many learning techniques look for structural descriptions of what is learned,descriptions that can become fairly complex <strong>and</strong> are typically expressed as setsof rules such as the ones described previously or the decision trees describedlater in this chapter. Because they can be understood by people, these descriptionsserve to explain what has been learned <strong>and</strong> explain the basis for new predictions.Experience shows that in many applications of machine learning todata mining, the explicit knowledge structures that are acquired, the structuraldescriptions, are at least as important, <strong>and</strong> often very much more important,than the ability to perform well on new examples. People frequently use datamining to gain knowledge, not just predictions. Gaining knowledge from datacertainly sounds like a good idea if you can do it. To find out how, read on!1.2 Simple examples: The weather problem <strong>and</strong> othersWe use a lot of examples in this book, which seems particularly appropriate consideringthat the book is all about learning from examples! There are several

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!