17.05.2014 Views

PDFlib Text Extraction Toolkit (TET) Manual

PDFlib Text Extraction Toolkit (TET) Manual

PDFlib Text Extraction Toolkit (TET) Manual

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Fig. 6.1<br />

Acrobat’s advanced<br />

search dialog<br />

> Sample code for the <strong>TET</strong> library: dumper mini sample<br />

> <strong>TET</strong>ML element: /<strong>TET</strong>/Document/DocInfo/Custom<br />

XMP metadata on document level. XMP metadata consists of an XML stream containing<br />

extended metadata.<br />

> How to display with Acrobat X/XI: File, Properties..., Additional Metadata.. (not available<br />

in the free Adobe Reader)<br />

> How to search a single PDF with Acrobat X/XI: not available<br />

> How to search multiple PDFs with Acrobat X/XI: click Edit, [Advanced] Search and Show<br />

More Options. In the Look In: pull-down select a folder of PDF documents and in the<br />

pull-down menu Use these additional criteria select XMP Metadata (not available in<br />

the free Adobe Reader).<br />

> Sample code for the <strong>TET</strong> library: dumper mini sample<br />

> <strong>TET</strong>ML element: /<strong>TET</strong>/Document/Metadata<br />

XMP metadata on image level. XMP metadata can be attached to document components,<br />

such as images, pages, fonts, etc. However, XMP is commonly only found on the<br />

image level (in addition to document level).<br />

70 Chapter 6: <strong>Text</strong> <strong>Extraction</strong>

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!