Conference Notes and Best Practices Release 1.3 ... - Read the Docs
Conference Notes and Best Practices Release 1.3 ... - Read the Docs
Conference Notes and Best Practices Release 1.3 ... - Read the Docs
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>Conference</strong> <strong>Notes</strong> <strong>and</strong> <strong>Best</strong> <strong>Practices</strong>, <strong>Release</strong> <strong>1.3</strong><br />
3.<strong>1.3</strong> Thursday<br />
Guerilla APIs<br />
Presenter: Russell Keith-Magee<br />
Track: I<br />
Description:<br />
In an ideal world, every web system would provide a well designed REST API with oAuth au<strong>the</strong>ntication.<br />
But what do you do when those things don’t exist?<br />
Guerilla APIs<br />
• Big companies do not embrace API centric development<br />
• In order to get this data you have to use unconventional tactics.<br />
• Can build user facing API that works how we want to<br />
• Not productized (but could be)<br />
RSS h<strong>and</strong>ling<br />
• FeedParser<br />
• Time tasks/cron to retrieve content<br />
• Pythonic interface for RSS h<strong>and</strong>ling<br />
Email H<strong>and</strong>ling<br />
• Get pure text<br />
• HTML<br />
• Attachments<br />
• Emails may be recursive<br />
• Mimetype<br />
• DOCX<br />
• DOC<br />
PDF Processing<br />
• PDF is a printing format<br />
• Internally, vector based drawing instructions<br />
• May contain attachments<br />
• PDFMiner<br />
• Some PDFs are locked down<br />
Use Selenium to open a browser session to actually submit forms programatically.<br />
44 Chapter 3. <strong>Conference</strong>s