08.06.2015 Views

Building Machine Learning Systems with Python - Richert, Coelho

Building Machine Learning Systems with Python - Richert, Coelho

Building Machine Learning Systems with Python - Richert, Coelho

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

classification model<br />

building 35, 37<br />

evaluating 38-40<br />

loss function 40<br />

search procedure 40<br />

structure 40<br />

classification performance<br />

improving, <strong>with</strong> Mel Frequency<br />

Cepstral Coefficients 193-196<br />

classifier<br />

building, FFT used 186, 187<br />

classes, using 130, 132<br />

creating 95, 128-130<br />

integrating, into site 115<br />

parameters, tuning 132-136<br />

performance, improving 101, 102<br />

performance, measuring 97<br />

slimming 114<br />

training 97, 187<br />

classifier, classy answers<br />

tuning 90<br />

classifier performance<br />

measuring, receiver operator characteristic<br />

(ROC) used 190, 191<br />

classifier performance, improving<br />

Bias-variance 102<br />

high bias, fixing 102<br />

high bias or low bias 103-105<br />

high variance, fixing 103<br />

classy answers<br />

classifier, tuning 90<br />

classifying 90<br />

instance, tuning 90<br />

cloud machine<br />

jug, running on 254, 255<br />

cluster generation<br />

automating, <strong>with</strong> starcluster 255-258<br />

clustering<br />

about 50, 62<br />

flat clustering 63<br />

hierarchical clustering 63<br />

KMeans algorithm 63-65<br />

test data, obtaining for idea<br />

evaluation 65-67<br />

cluster package 17<br />

CommentCount attribute 93<br />

complex classifiers<br />

building 40, 41<br />

complex dataset 41<br />

computer vision 199<br />

confusion matrix<br />

used, for accuracy measurement in<br />

multiclass problems 188-190<br />

constants package 18<br />

correlation<br />

about 223<br />

using 223-225<br />

cost function 41<br />

CountVectorizer 52<br />

Coursera<br />

URL 261<br />

CreationDate attribute 93<br />

Cross Validated<br />

about 11, 262<br />

URL 262<br />

cross-validation 38, 39<br />

cross-validation, for regression 151<br />

cross-validation schedule 40<br />

D<br />

data<br />

fetching 91, 92<br />

slimming down, to chewable chunks 92<br />

data analysis<br />

jug, using for 246-248<br />

data, machine learning application<br />

cleaning 20, 21<br />

preprocessing 20, 21<br />

reading 19, 20<br />

data sources, machine language 263<br />

dimensionality reduction 222<br />

dot() function 16<br />

E<br />

Elastic net model 154<br />

Elastic nets<br />

using, in scikit-Learn 154<br />

ensemble learning 170<br />

Enthought <strong>Python</strong> Distribution<br />

URL 12<br />

[ 266 ]

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!