17.05.2014 Views

PDFlib Text Extraction Toolkit (TET) Manual

PDFlib Text Extraction Toolkit (TET) Manual

PDFlib Text Extraction Toolkit (TET) Manual

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Table 6.1 CJK compatibility decomposition examples (suboptions for the decompose option)<br />

decomposition<br />

name description affected Unicode characters<br />

narrow<br />

small<br />

square<br />

vertical<br />

wide<br />

Narrow (hankaku)<br />

compatibility characters<br />

Small forms for CNS<br />

11643 compatibility<br />

CJK squared font<br />

variants<br />

Vertical layout presentation<br />

forms<br />

Wide (zenkaku) compatibility<br />

forms<br />

U+FF61-U+FFDC,<br />

U+FFE8-U+FFEE<br />

U+FE50-U+FE6B<br />

U+3250,<br />

U+32CC-U+32CF,<br />

U+3300-U+3357,<br />

U+3371-U+33DF,<br />

U+337B-U+337F,<br />

U+33FF,<br />

U+1F131-U+1F14E,<br />

U+1F190,<br />

U+1F200,<br />

U+1F210-U+1F231<br />

U+309F,<br />

U+30FF,<br />

U+FE10-U+FE19<br />

U+FE30-U+FE48<br />

U+3000,<br />

U+FF01-U+FF60,<br />

U+FFE0-U+FFE6<br />

decompositions<br />

enabled (default)<br />

<br />

U+30F2<br />

<br />

U+002C<br />

<br />

U+30AD U+30ED<br />

<br />

U+FE37<br />

<br />

U+00A3<br />

<br />

decompositions<br />

disabled<br />

<br />

U+FF66<br />

<br />

U+FE50<br />

<br />

U+3314<br />

<br />

U+007B<br />

<br />

U+FFE1<br />

6.3 Chinese, Japanese, and Korean <strong>Text</strong> 81

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!