symbols-a4
symbols-a4
symbols-a4
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Dec Hex Char L ATEX 2ε<br />
128 80 € \texteuro (tc)<br />
130 82 ‚ \quotesinglbase (T1)<br />
131 83 f \textit{f}<br />
132 84 „ \quotedblbase (T1)<br />
133 85 . . . \dots<br />
134 86 † \dag<br />
135 87 ‡ \ddag<br />
136 88 ˆ \textasciicircum<br />
137 89 ‰ \textperthousand (tc)<br />
138 8A ˇ S \v{S}<br />
139 8B ‹ \guilsinglleft (T1)<br />
140 8C Œ \OE<br />
142 8E ˇ Z \v{Z}<br />
Table 330: L ATEX 2ε Code Page 1252 Table<br />
Dec Hex Char L ATEX 2ε<br />
145 91 ‘ ‘<br />
146 92 ’ ’<br />
147 93 “ ‘‘<br />
148 94 ” ’’<br />
149 95 • \textbullet<br />
150 96 – --<br />
151 97 — ---<br />
152 98 ˜ \textasciitilde<br />
153 99 \texttrademark<br />
154 9A ˇs \v{s}<br />
155 9B › \guilsinglright (T1)<br />
156 9C œ \oe<br />
158 9E ˇz \v{z}<br />
159 9F ¨ Y \"{Y}<br />
While too large to incorporate into this document, a listing of ISO 8879:1986 SGML/XML character entities<br />
and their L ATEX equivalents is available from http://www.bitjungle.com/~isoent/. Some of the characters<br />
presented there make use of isoent, a L ATEX 2ε package (available from the same URL) that fakes some of the<br />
missing ISO glyphs using the L ATEX picture environment. 14<br />
8.7 Unicode characters<br />
Unicode is a “universal character set”—a standard for encoding (i.e., assigning unique numbers to) the <strong>symbols</strong><br />
appearing in many of the world’s languages. While ASCII can represent 128 <strong>symbols</strong> and Latin 1 can represent<br />
256 <strong>symbols</strong>, Unicode can represent an astonishing 1,114,112 <strong>symbols</strong>.<br />
Because TEX and L ATEX predate the Unicode standard and Unicode fonts by almost a decade, support for<br />
Unicode has had to be added to the base TEX and L ATEX systems. Note first that L ATEX distinguishes between<br />
input encoding—the characters used in the .tex file—and output encoding—the characters that appear in the<br />
generated .dvi, .pdf, etc. file.<br />
Inputting Unicode characters<br />
To include Unicode characters in a .tex file, load the ucs package and load the inputenc package with the utf8x<br />
(“UTF-8 extended”) option. 15 These packages enable L ATEX to translate UTF-8 sequences to L ATEX commands,<br />
which are subsequently processed as normal. For example, the UTF-8 text “Copyright © 2009”—“©” is not<br />
an ASCII character and therefore cannot be input directly without packages such as ucs/inputenc—is converted<br />
internally by inputenc to “Copyright \textcopyright{} 2009” and therefore typeset as “Copyright © 2009”.<br />
The ucs/inputenc combination supports only a tiny subset of Unicode’s million-plus <strong>symbols</strong>.<br />
Additional <strong>symbols</strong> can be added manually using the \DeclareUnicodeCharacter command.<br />
\DeclareUnicodeCharacter takes two arguments: a Unicode number and a L ATEX command to execute<br />
when the corresponding Unicode character is encountered in the input. For example, the Unicode character<br />
“degree celsius” (“ ℃ ”) appears at character position U+2103. 16 However, “ ℃ ” is not one of the characters<br />
that ucs and inputenc recognize. The following document shows how to use \DeclareUnicodeCharacter to<br />
tell L ATEX that the “ ℃ ” character should be treated as a synonym for \textcelsius:<br />
\documentclass{article}<br />
\usepackage{ucs}<br />
\usepackage[utf8x]{inputenc}<br />
14isoent is not featured in this document, because it is not available from CTAN and because the faked <strong>symbols</strong> are not “true”<br />
characters; they exist in only one size, regardless of the body text’s font size.<br />
15UTF-8 is the 8-bit Unicode Transformation Format, a popular mechanism for representing Unicode symbol numbers as<br />
sequences of one to four bytes.<br />
16The Unicode convention is to express character positions as “U+〈hexadecimal number〉”.<br />
117