01.04.2015 Views

1FfUrl0

1FfUrl0

1FfUrl0

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Preface<br />

Chapter 4, Topic Modeling, takes us beyond assigning each post to a single cluster<br />

and shows us how assigning them to several topics as real text can deal with<br />

multiple topics.<br />

Chapter 5, Classification – Detecting Poor Answers, explains how to use logistic<br />

regression to find whether a user's answer to a question is good or bad. Behind<br />

the scenes, we will learn how to use the bias-variance trade-off to debug machine<br />

learning models.<br />

Chapter 6, Classification II – Sentiment Analysis, introduces how Naive Bayes<br />

works, and how to use it to classify tweets in order to see whether they are<br />

positive or negative.<br />

Chapter 7, Regression – Recommendations, discusses a classical topic in handling<br />

data, but it is still relevant today. We will use it to build recommendation<br />

systems, a system that can take user input about the likes and dislikes to<br />

recommend new products.<br />

Chapter 8, Regression – Recommendations Improved, improves our recommendations<br />

by using multiple methods at once. We will also see how to build recommendations<br />

just from shopping data without the need of rating data (which users do not<br />

always provide).<br />

Chapter 9, Classification III – Music Genre Classification, illustrates how if someone has<br />

scrambled our huge music collection, then our only hope to create an order is to let<br />

a machine learner classify our songs. It will turn out that it is sometimes better to<br />

trust someone else's expertise than creating features ourselves.<br />

Chapter 10, Computer Vision – Pattern Recognition, explains how to apply classifications<br />

in the specific context of handling images, a field known as pattern recognition.<br />

Chapter 11, Dimensionality Reduction, teaches us what other methods exist<br />

that can help us in downsizing data so that it is chewable by our machine<br />

learning algorithms.<br />

Chapter 12, Big(ger) Data, explains how data sizes keep getting bigger, and how<br />

this often becomes a problem for the analysis. In this chapter, we explore some<br />

approaches to deal with larger data by taking advantage of multiple core or<br />

computing clusters. We also have an introduction to using cloud computing<br />

(using Amazon's Web Services as our cloud provider).<br />

Appendix, Where to Learn More about Machine Learning, covers a list of wonderful<br />

resources available for machine learning.<br />

[ 2 ]

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!