10.11.2016 Views

Learning Data Mining with Python

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

NetworkX<br />

about 145<br />

defining 303<br />

URL 150, 303<br />

neural network<br />

about 26, 162, 246, 247<br />

back propagation (backprop)<br />

algorithm 173, 174<br />

classifying 172, 173<br />

defining 247<br />

implementing 247, 248<br />

implementing, <strong>with</strong> nolearn 254-257<br />

Lasagne, defining 249-253<br />

Theano, defining 248, 249<br />

training 172, 173, 246<br />

URL 308<br />

words, predicting 175-179<br />

neural network layers, Lasagne<br />

dropout layers 250<br />

network-in-network layers 250<br />

noise layers 250<br />

neurons 162<br />

news articles<br />

clustering algorithms, using as<br />

transformers 229, 230<br />

data, obtaining 216-218<br />

grouping 222, 223<br />

k-means algorithm 223-226<br />

obtaining 211, 212<br />

reddit, as data source 215, 216<br />

results, evaluating 226-228<br />

topic information, extracting from<br />

clusters 228, 229<br />

web API used, for obtaining<br />

data 212-215<br />

n-grams<br />

about 120, 198<br />

advantages 121<br />

disadvantages 121<br />

NLTK<br />

references 302<br />

NLTK installation instructions<br />

URL 126<br />

noise<br />

adding 301<br />

nolearn package<br />

neural networks, implementing<br />

<strong>with</strong> 254-257<br />

nonprogrammers, for <strong>Python</strong> language<br />

URL 4<br />

O<br />

object classification 242<br />

OneR 18<br />

one-versus-all classifier<br />

creating 197<br />

online learning<br />

about 236<br />

defining 236<br />

implementing 237-239<br />

ordinal 85<br />

output layer 164<br />

overfitting 21<br />

P<br />

pagination 140<br />

pandas<br />

references 300<br />

URL 42, 300<br />

pandas documentation<br />

URL 59<br />

pandas.read_csv function 44<br />

parameters, ensemble process<br />

n_estimators 56<br />

n_jobs 56<br />

oob_score 56<br />

petal length 16<br />

petal width 16<br />

pip 5, 145<br />

pipeline<br />

about 38, 39<br />

BernoulliNB classifier 128<br />

creating 103, 126<br />

DictVectorizer transformer 128<br />

NLTKBOW transformer 128<br />

URL 299<br />

precision 129<br />

[ 314 ]

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!