10.11.2016 Views

Learning Data Mining with Python

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

preprocessing, using pipelines<br />

about 35, 36<br />

example 36, 37<br />

features 35<br />

features, of animal 35<br />

standard preprocessing 37<br />

workflow, creating 38<br />

pricing alerts<br />

URL 295<br />

Principal Component Analysis (PCA) 96, 97<br />

prior belief 122<br />

probabilistic graphical models<br />

URL 308<br />

probabilities<br />

computing 290<br />

programmers, for <strong>Python</strong> language<br />

URL 4<br />

Project Gutenberg<br />

URL 189<br />

Pydoop<br />

about 307<br />

URL 307<br />

Pylearn2<br />

about 306<br />

URL 306<br />

<strong>Python</strong><br />

defining 106<br />

installing 3, 4<br />

URL 3<br />

using 3<br />

<strong>Python</strong> 3.4 3<br />

Q<br />

quotequail package 205<br />

R<br />

RandomForestClassifier 56<br />

random forests<br />

about 26<br />

applying 56, 57<br />

defining 54, 55<br />

ensembles, working 55<br />

new features, engineering 58<br />

parameters 56<br />

reasons, feature selection<br />

complexity, reducing 88<br />

noise, reducing 88<br />

readable models, creating 88<br />

recall 129<br />

recommendation engine<br />

building 307<br />

URL 307<br />

reddit<br />

about 212-215<br />

references 213<br />

URL 215<br />

regularization<br />

URL 97<br />

reinforcement learning<br />

URL 304<br />

RESTful interface (Representational<br />

State Transfer) 213<br />

rules<br />

confidence 10<br />

finding 13<br />

support 10<br />

S<br />

sample size<br />

increasing 304<br />

scikit-learn<br />

installing 6<br />

URL 7<br />

scikit-learn estimators<br />

algorithm, running 32, 33<br />

dataset, loading 29-31<br />

defining 25, 26<br />

distance metrics 27, 28<br />

fit() function 31<br />

Nearest neighbors 26, 27<br />

parameters, setting 33-35<br />

predict() 25<br />

predict() function 31<br />

standard workflow, defining 31<br />

scikit-learn library<br />

about 25<br />

estimators 25<br />

pipelines 25<br />

transformers 25<br />

[ 315 ]

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!