08.06.2015 Views

Building Machine Learning Systems with Python - Richert, Coelho

Building Machine Learning Systems with Python - Richert, Coelho

Building Machine Learning Systems with Python - Richert, Coelho

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 12<br />

Summary<br />

We saw how to use jug, a little <strong>Python</strong> framework, to manage computations in a<br />

way that takes advantage of multiple cores or multiple machines. Although this<br />

framework is generic, it was built specifically to address the data analysis needs of<br />

its author (who is also an author of this book). Therefore, it has several aspects that<br />

make it fit in <strong>with</strong> the rest of the <strong>Python</strong> machine learning environment.<br />

We also learned about AWS, the Amazon cloud. Using cloud computing is often<br />

a more effective use of resources than building an in-house computing capacity.<br />

This is particularly true if your needs are not constant, but changing. Starcluster<br />

even allows for clusters that automatically grow as you launch more jobs and shrink<br />

as they terminate.<br />

This is the end of the book. We have come a long way. We learned how to perform<br />

classification when we have labeled data and clustering when we do not. We learned<br />

about dimensionality reduction and topic modeling to make sense of large datasets.<br />

Towards the end, we looked at some specific applications, such as music genre<br />

classification and computer vision. For implementations, we relied on <strong>Python</strong>. This<br />

language has an increasingly expanding ecosystem of numeric computing packages<br />

built on top of NumPy. Whenever possible, we relied on scikit-learn but also used<br />

other packages when necessary. Due to the fact that they all use the same basic data<br />

structure (the NumPy multidimensional array), it is possible to mix functionality<br />

from different packages seamlessly. All of the packages used in this book are open<br />

source and available for use in any project.<br />

Naturally, we did not cover every machine learning topic. In the Appendix, Where<br />

to Learn More About <strong>Machine</strong> <strong>Learning</strong>, we provide pointers to a selection of other<br />

resources that will help interested readers learn more about machine learning.<br />

[ 259 ]

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!