10.11.2016 Views

Learning Data Mining with Python

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 1<br />

In the preceding dataset, we had an obvious feature type. We wanted to know if<br />

people are short or tall, so we collected their heights. This engineering feature is an<br />

important problem in data mining. In later chapters, we will discuss methods for<br />

choosing good features to collect in your dataset. Ultimately, this step often requires<br />

some expert domain knowledge or at least some trial and error.<br />

In this book, we will introduce data mining through <strong>Python</strong>. In some<br />

cases, we choose clarity of code and workflows, rather than the most<br />

optimized way to do this. This sometimes involves skipping some<br />

details that can improve the algorithm's speed or effectiveness.<br />

Using <strong>Python</strong> and the I<strong>Python</strong> Notebook<br />

In this section, we will cover installing <strong>Python</strong> and the environment that we will use<br />

for most of the book, the I<strong>Python</strong> Notebook. Furthermore, we will install the numpy<br />

module, which we will use for the first set of examples.<br />

Installing <strong>Python</strong><br />

The <strong>Python</strong> language is a fantastic, versatile, and an easy to use language.<br />

For this book, we will be using <strong>Python</strong> 3.4, which is available for your system from<br />

the <strong>Python</strong> Organization's website: https://www.python.org/downloads/.<br />

There will be two major versions to choose from, <strong>Python</strong> 3.4 and <strong>Python</strong> 2.7.<br />

Remember to download and install <strong>Python</strong> 3.4, which is the version tested<br />

throughout this book.<br />

In this book, we will be assuming that you have some knowledge of programming<br />

and <strong>Python</strong> itself. You do not need to be an expert <strong>with</strong> <strong>Python</strong> to complete this<br />

book, although a good level of knowledge will help.<br />

If you do not have any experience <strong>with</strong> programming, I recommend that you<br />

pick up the <strong>Learning</strong> <strong>Python</strong> book from.<br />

[ 3 ]

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!