17.05.2014 Views

PDFlib Text Extraction Toolkit (TET) Manual

PDFlib Text Extraction Toolkit (TET) Manual

PDFlib Text Extraction Toolkit (TET) Manual

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

TeX documents. PDF documents produced with the TeX documents often contain numerical<br />

glyph names, Type 3 fonts and other features which prevent other products<br />

from successfully extracting the text. <strong>TET</strong> contains many heuristics and workarounds<br />

for dealing with such documents. However, a particular flavor of TeX documents can<br />

only be processed with a workaround that requires more processing time, and is disabled<br />

by default. You can enable more CPU-intensive font processing for these documents<br />

with the following document option:<br />

checkglyphlists=true<br />

68 Chapter 5: Configuration

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!