10.11.2016 Views

Learning Data Mining with Python

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 2<br />

While there is a lot of variance, the plot shows a decreasing trend as the number of<br />

neighbors increases.<br />

Preprocessing using pipelines<br />

When taking measurements of real-world objects, we can often get features in<br />

very different ranges. For instance, if we are measuring the qualities of an animal,<br />

we might have several features, as follows:<br />

• Number of legs: This is between the range of 0-8 for most animals, while<br />

some have many more!<br />

• Weight: This is between the range of only a few micrograms, all the way<br />

to a blue whale <strong>with</strong> a weight of 190,000 kilograms!<br />

• Number of hearts: This can be between zero to five, in the case of<br />

the earthworm.<br />

[ 35 ]

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!