08.06.2015 Views

Building Machine Learning Systems with Python - Richert, Coelho

Building Machine Learning Systems with Python - Richert, Coelho

Building Machine Learning Systems with Python - Richert, Coelho

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

sklearn.naive_bayes package 127<br />

sklearn package 52<br />

sobel filtering 213<br />

sparse package 18<br />

sparsity 78<br />

spatial package 18<br />

specgram() function 183<br />

special package 18<br />

spectrogram 182, 183<br />

Speeded Up Robust Features. See SURF<br />

starcluster<br />

cluster generation, automating<br />

<strong>with</strong> 255-258<br />

Starcluster<br />

URL, for documentation 258<br />

stats package 18<br />

stemming<br />

about 57<br />

NLTK, installing 58<br />

NLTK, using 58<br />

supermarket shopping baskets<br />

analyzing 173-175<br />

supervised learning 33, 34<br />

support vector machines (SVM) 9<br />

SURF 216<br />

system<br />

demonstrating, for new post 68-72<br />

T<br />

Talkbox SciKit 193<br />

task<br />

about 242<br />

example 243-245<br />

term frequency - inverse document<br />

frequency (TF-IDF) 60<br />

testing error 38<br />

text preprocessing phase<br />

achievements 61<br />

goals 61<br />

thresholding 202-205<br />

Title attribute 93<br />

topic model<br />

about 75, 81<br />

building 76-80<br />

topic modeling 75, 86<br />

topics<br />

about 76<br />

selecting 86, 87<br />

topic space<br />

similarity, comparing 80-82<br />

training error 38<br />

transform method 54<br />

tweets<br />

cleaning 136-138<br />

Twitter data<br />

fetching 118<br />

TwoToReal<br />

about 262<br />

URL 262<br />

U<br />

University of California at Irvine (UCI) 42<br />

V<br />

vectorization 51<br />

vectorizer<br />

extending, <strong>with</strong> NLTK's stemmer 59<br />

ViewCount attribute 93<br />

visualization, Iris dataset 34, 35<br />

W<br />

wave format<br />

MP3 files, converting into 182<br />

Wikipedia<br />

modeling 83-86<br />

URL, for dumps 83<br />

word count vectors<br />

normalizing 56<br />

wordle<br />

URL 80<br />

word sense disambiguation 142<br />

word types<br />

about 138<br />

determining 139, 140<br />

wrappers<br />

using 230, 231<br />

[ 271 ]

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!