14.03.2013 Views

symbols-a4

symbols-a4

symbols-a4

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Dec Hex Char L ATEX 2ε<br />

128 80 € \texteuro (tc)<br />

130 82 ‚ \quotesinglbase (T1)<br />

131 83 f \textit{f}<br />

132 84 „ \quotedblbase (T1)<br />

133 85 . . . \dots<br />

134 86 † \dag<br />

135 87 ‡ \ddag<br />

136 88 ˆ \textasciicircum<br />

137 89 ‰ \textperthousand (tc)<br />

138 8A ˇ S \v{S}<br />

139 8B ‹ \guilsinglleft (T1)<br />

140 8C Π\OE<br />

142 8E ˇ Z \v{Z}<br />

Table 330: L ATEX 2ε Code Page 1252 Table<br />

Dec Hex Char L ATEX 2ε<br />

145 91 ‘ ‘<br />

146 92 ’ ’<br />

147 93 “ ‘‘<br />

148 94 ” ’’<br />

149 95 • \textbullet<br />

150 96 – --<br />

151 97 — ---<br />

152 98 ˜ \textasciitilde<br />

153 99 \texttrademark<br />

154 9A ˇs \v{s}<br />

155 9B › \guilsinglright (T1)<br />

156 9C œ \oe<br />

158 9E ˇz \v{z}<br />

159 9F ¨ Y \"{Y}<br />

While too large to incorporate into this document, a listing of ISO 8879:1986 SGML/XML character entities<br />

and their L ATEX equivalents is available from http://www.bitjungle.com/~isoent/. Some of the characters<br />

presented there make use of isoent, a L ATEX 2ε package (available from the same URL) that fakes some of the<br />

missing ISO glyphs using the L ATEX picture environment. 14<br />

8.7 Unicode characters<br />

Unicode is a “universal character set”—a standard for encoding (i.e., assigning unique numbers to) the <strong>symbols</strong><br />

appearing in many of the world’s languages. While ASCII can represent 128 <strong>symbols</strong> and Latin 1 can represent<br />

256 <strong>symbols</strong>, Unicode can represent an astonishing 1,114,112 <strong>symbols</strong>.<br />

Because TEX and L ATEX predate the Unicode standard and Unicode fonts by almost a decade, support for<br />

Unicode has had to be added to the base TEX and L ATEX systems. Note first that L ATEX distinguishes between<br />

input encoding—the characters used in the .tex file—and output encoding—the characters that appear in the<br />

generated .dvi, .pdf, etc. file.<br />

Inputting Unicode characters<br />

To include Unicode characters in a .tex file, load the ucs package and load the inputenc package with the utf8x<br />

(“UTF-8 extended”) option. 15 These packages enable L ATEX to translate UTF-8 sequences to L ATEX commands,<br />

which are subsequently processed as normal. For example, the UTF-8 text “Copyright © 2009”—“©” is not<br />

an ASCII character and therefore cannot be input directly without packages such as ucs/inputenc—is converted<br />

internally by inputenc to “Copyright \textcopyright{} 2009” and therefore typeset as “Copyright © 2009”.<br />

The ucs/inputenc combination supports only a tiny subset of Unicode’s million-plus <strong>symbols</strong>.<br />

Additional <strong>symbols</strong> can be added manually using the \DeclareUnicodeCharacter command.<br />

\DeclareUnicodeCharacter takes two arguments: a Unicode number and a L ATEX command to execute<br />

when the corresponding Unicode character is encountered in the input. For example, the Unicode character<br />

“degree celsius” (“ ℃ ”) appears at character position U+2103. 16 However, “ ℃ ” is not one of the characters<br />

that ucs and inputenc recognize. The following document shows how to use \DeclareUnicodeCharacter to<br />

tell L ATEX that the “ ℃ ” character should be treated as a synonym for \textcelsius:<br />

\documentclass{article}<br />

\usepackage{ucs}<br />

\usepackage[utf8x]{inputenc}<br />

14isoent is not featured in this document, because it is not available from CTAN and because the faked <strong>symbols</strong> are not “true”<br />

characters; they exist in only one size, regardless of the body text’s font size.<br />

15UTF-8 is the 8-bit Unicode Transformation Format, a popular mechanism for representing Unicode symbol numbers as<br />

sequences of one to four bytes.<br />

16The Unicode convention is to express character positions as “U+〈hexadecimal number〉”.<br />

117

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!