17.05.2014 Views

PDFlib Text Extraction Toolkit (TET) Manual

PDFlib Text Extraction Toolkit (TET) Manual

PDFlib Text Extraction Toolkit (TET) Manual

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Index<br />

A<br />

annotations 59<br />

API reference 121<br />

area of text extraction 64<br />

arrays 109<br />

B<br />

bookmarks 59<br />

Byte Order Mark (BOM) 61, 123<br />

C<br />

C binding 22<br />

C++ binding 24<br />

categories of resources 51<br />

character references 122<br />

characters 61<br />

CJK (Chinese, Japanese, Korean) 67<br />

compatibility forms 67<br />

configuration 7<br />

CJK support 12<br />

codelist 77<br />

COM binding 25<br />

command-line tool 15<br />

comments 59<br />

composite characters 62<br />

concordance (XSLT sample) 101<br />

connector 35<br />

content analysis 71<br />

coordinate system 64<br />

CSV format 103<br />

CUS (Corporate Use Subarea) 68<br />

D<br />

dehyphenation 73<br />

dictionaries 109<br />

Dispose( ) 123<br />

document and page functions 129<br />

document domains 57<br />

document info entries 57<br />

document info fields 105<br />

document styles 74<br />

E<br />

EBCDIC-based systems 152<br />

encrypted PDF documents 119<br />

encryption status 106<br />

end points of glyphs and words 66<br />

evaluation version 7<br />

examples<br />

document info fields 105<br />

encryption status 106<br />

fonts in a document 106<br />

number of pages 105<br />

page size 106<br />

pCOS paths 105<br />

text extraction status 49<br />

writing mode 106<br />

XSLT 101<br />

exception handling 21<br />

in C 22<br />

F<br />

fake bold removal 73<br />

file attachments 59<br />

file searching 52<br />

font filtering (XSLT sample) 101<br />

font statistics (XSLT sample) 102<br />

FontReporter plugin 11, 76<br />

fonts in a document 106<br />

form fields 59<br />

fullwidth variants 67<br />

G<br />

glyph metrics 64<br />

glyph rules 79<br />

glyphlist 79<br />

glyphs 61<br />

granularity 71<br />

H<br />

halfwidth variants 67<br />

highlighting 66<br />

HTML converter (XSLT sample) 103<br />

I<br />

IFilter for Microsoft products 44<br />

image<br />

small image removal 86<br />

image extraction 81<br />

image objects 81<br />

images<br />

color fidelity 87<br />

geometry 83<br />

merging 85<br />

resolution 83<br />

Index 161

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!