10.11.2016 Views

Learning Data Mining with Python

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 7 – Discovering Accounts to<br />

Follow Using Graph <strong>Mining</strong><br />

Appendix<br />

More complex algorithms<br />

https://www.cs.cornell.edu/home/kleinber/link-pred.pdf<br />

There has been extensive research on predicting links in graphs, including for<br />

social networks. For instance, David Liben-Nowell and Jon Kleinberg published a<br />

paper on this topic that would serve as a great place for more complex algorithms,<br />

linked above.<br />

NetworkX<br />

https://networkx.github.io/<br />

If you are going to be using graphs and networks more, going in-depth into the<br />

NetworkX package is well worth your time—the visualization options are great and<br />

the algorithms are well implemented. Another library called SNAP is also available<br />

<strong>with</strong> <strong>Python</strong> bindings, at http://snap.stanford.edu/snappy/index.html.<br />

Chapter 8 – Beating CAPTCHAs <strong>with</strong><br />

Neural Networks<br />

Better (worse?) CAPTCHAs<br />

http://scikit-image.org/docs/dev/auto_examples/applications/plot_<br />

geometric.html<br />

The CAPTCHAs we beat in this example were not as complex as those normally<br />

used today. You can create more complex variants using a number of techniques<br />

as follows:<br />

• Applying different transformations such as the ones in scikit-image<br />

(see the link above)<br />

• Using different colors and colors that don't translate well to graeyscale<br />

• Adding lines or other shapes to the image: http://scikit-image.org/<br />

docs/dev/api/skimage.draw.html<br />

[ 303 ]

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!