25.11.2014 Views

Conference Notes and Best Practices Release 1.3 ... - Read the Docs

Conference Notes and Best Practices Release 1.3 ... - Read the Docs

Conference Notes and Best Practices Release 1.3 ... - Read the Docs

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Conference</strong> <strong>Notes</strong> <strong>and</strong> <strong>Best</strong> <strong>Practices</strong>, <strong>Release</strong> <strong>1.3</strong><br />

3.<strong>1.3</strong> Thursday<br />

Guerilla APIs<br />

Presenter: Russell Keith-Magee<br />

Track: I<br />

Description:<br />

In an ideal world, every web system would provide a well designed REST API with oAuth au<strong>the</strong>ntication.<br />

But what do you do when those things don’t exist?<br />

Guerilla APIs<br />

• Big companies do not embrace API centric development<br />

• In order to get this data you have to use unconventional tactics.<br />

• Can build user facing API that works how we want to<br />

• Not productized (but could be)<br />

RSS h<strong>and</strong>ling<br />

• FeedParser<br />

• Time tasks/cron to retrieve content<br />

• Pythonic interface for RSS h<strong>and</strong>ling<br />

Email H<strong>and</strong>ling<br />

• Get pure text<br />

• HTML<br />

• Attachments<br />

• Emails may be recursive<br />

• Mimetype<br />

• DOCX<br />

• DOC<br />

PDF Processing<br />

• PDF is a printing format<br />

• Internally, vector based drawing instructions<br />

• May contain attachments<br />

• PDFMiner<br />

• Some PDFs are locked down<br />

Use Selenium to open a browser session to actually submit forms programatically.<br />

44 Chapter 3. <strong>Conference</strong>s

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!