10.11.2016 Views

Learning Data Mining with Python

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

scikit-learn package<br />

references 305<br />

sepal length 16<br />

sepal width 16<br />

shapes adding, CAPTCHAs<br />

URL 303<br />

Silhouette Coefficient<br />

about 155<br />

computing 155<br />

parameters 158<br />

similarity graph<br />

creating 147-151<br />

SNAP<br />

URL 303<br />

softmax nonlinearity 252<br />

Spam detection<br />

references 302<br />

spam filter 129<br />

sparse matrix 27<br />

sparse matrix format 66<br />

sports outcome prediction<br />

about 49<br />

features 50<br />

stacking 53<br />

standings<br />

loading 50-54<br />

standings data<br />

obtaining 50<br />

URL 50<br />

Stratified K Fold 32<br />

style sheets 218<br />

stylometry 186<br />

subgraphs<br />

connected components 151-153<br />

criteria, optimizing 155-159<br />

finding 151<br />

subreddits 212, 215<br />

SVMs<br />

about 196<br />

classifying <strong>with</strong> 197<br />

kernels 198<br />

URL 197<br />

system<br />

building, for taking image as<br />

input 242-244<br />

T<br />

temporal analysis 305<br />

text<br />

about 106<br />

extracting, from arbitrary websites 218<br />

text-based datasets 105<br />

text transformers<br />

bag-of-words model 118-120<br />

defining 118<br />

features 121<br />

n-grams 120<br />

word, counting in dataset 118-120<br />

tf-idf 120<br />

Theano<br />

about 248, 249<br />

URL 260<br />

using 249<br />

Torch<br />

URL 306<br />

train_feature_value() function 20<br />

transformer<br />

creating 98<br />

implementing 99, 100<br />

transformer API 99<br />

unit testing 101, 102<br />

tutorial, Google<br />

URL 307<br />

tutorial, Yahoo<br />

URL 307<br />

tweets<br />

about 106<br />

F1-score, used for evaluation 129, 130<br />

features, obtaining from models 130, 132<br />

loading 128, 129<br />

Twitter<br />

follower information, obtaining<br />

from 140, 141<br />

Twitter account<br />

URL 107<br />

twitter documentation<br />

URL 107<br />

[ 316 ]

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!