03.02.2014 Views

php|architect's Guide to Web Scraping with PHP - Wind Business ...

php|architect's Guide to Web Scraping with PHP - Wind Business ...

php|architect's Guide to Web Scraping with PHP - Wind Business ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

96 ” T i d y Extension<br />

Output<br />

Obtaining the resulting output of tidy repairing a document is fairly simple.<br />

<br />

While the object-oriented API offers no public declaration of the magic method<br />

__<strong>to</strong>String, it can be cast <strong>to</strong> a string as well as output directly using the echo construct.<br />

W r a p - U p<br />

This concludes the chapter. At this point, you should have your obtained document<br />

in a format suitable for input <strong>to</strong> an XML extension. The following few chapters will be<br />

devoted <strong>to</strong> using specific extensions <strong>to</strong> searching and extracting data from repaired<br />

documents.<br />

F o r the <strong>PHP</strong> manual section on the tidy extension, see http://php.net/tidy.<br />

F o r documentation on the tidy library itself, see<br />

http://tidy.sourceforge.net/#docs.<br />

F o r a tidy configuration setting reference, see<br />

http://tidy.sourceforge.net/docs/quickref.html.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!