here - OSTA - Optical Storage Technology Association
here - OSTA - Optical Storage Technology Association
here - OSTA - Optical Storage Technology Association
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
2.1 Part 1 - General<br />
2.1.1 Character Sets<br />
The character set used by UDF for the structures defined in this document is the<br />
CS0 character set. The <strong>OSTA</strong> CS0 character set is defined as follows:<br />
<strong>OSTA</strong> CS0 shall consist of the d-characters specified in The Unicode Standard,<br />
Version 2.0 (ISBN 0-201-48345-9 from Addison-Wesley Publishing Company<br />
http://www.awl.com/ , see also http://www.unicode.org), excluding #FEFF and<br />
FFFE, stored in the <strong>OSTA</strong> Compressed Unicode format which is defined as<br />
follows:<br />
<strong>OSTA</strong> Compressed Unicode format<br />
RBP Length Name Contents<br />
0 1 Compression ID Uint8<br />
1 ?? Compressed Bit Stream Byte<br />
The CompressionID shall identify the compression algorithm used to compress<br />
the CompressedBitStream field. The following algorithms are currently<br />
supported:<br />
Compression Algorithm<br />
Value<br />
Description<br />
0 - 7 Reserved<br />
8 Value indicates t<strong>here</strong> are 8 bits per character<br />
in the CompressedBitStream.<br />
9-15 Reserved<br />
16 Value indicates t<strong>here</strong> are 16 bits per<br />
character in the CompressedBitStream.<br />
17-253 Reserved<br />
254 Value indicates the CS0 expansion is empty<br />
and unique. Compression Algorithm 8 is<br />
used for compression.<br />
255 Value indicates the CS0 expansion is empty<br />
and unique. Compression Algorithm 16 is<br />
used for compression.<br />
For a CompressionID of 8 or 16, the value of the CompressionID shall specify<br />
the number of BitsPerCharacter for the d-characters defined in the<br />
CharacterBitStream field. Each sequence of CompressionID bits in the<br />
CharacterBitStream field shall represent an <strong>OSTA</strong> Compressed Unicode d-<br />
character. The bits of the character being encoded shall be added to the<br />
CharacterBitStream from most- to least-significant-bit. The bits shall be added to<br />
the CharacterBitStream starting from the most significant bit of the current byte<br />
being encoded into.<br />
UDF 2.01<br />
11<br />
March50 April