05.07.2014 Views

here - OSTA - Optical Storage Technology Association

here - OSTA - Optical Storage Technology Association

here - OSTA - Optical Storage Technology Association

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

2.1 Part 1 - General<br />

2.1.1 Character Sets<br />

The character set used by UDF for the structures defined in this document is the<br />

CS0 character set. The <strong>OSTA</strong> CS0 character set is defined as follows:<br />

<strong>OSTA</strong> CS0 shall consist of the d-characters specified in The Unicode Standard,<br />

Version 2.0 (ISBN 0-201-48345-9 from Addison-Wesley Publishing Company<br />

http://www.awl.com/ , see also http://www.unicode.org), excluding #FEFF and<br />

FFFE, stored in the <strong>OSTA</strong> Compressed Unicode format which is defined as<br />

follows:<br />

<strong>OSTA</strong> Compressed Unicode format<br />

RBP Length Name Contents<br />

0 1 Compression ID Uint8<br />

1 ?? Compressed Bit Stream Byte<br />

The CompressionID shall identify the compression algorithm used to compress<br />

the CompressedBitStream field. The following algorithms are currently<br />

supported:<br />

Compression Algorithm<br />

Value<br />

Description<br />

0 - 7 Reserved<br />

8 Value indicates t<strong>here</strong> are 8 bits per character<br />

in the CompressedBitStream.<br />

9-15 Reserved<br />

16 Value indicates t<strong>here</strong> are 16 bits per<br />

character in the CompressedBitStream.<br />

17-253 Reserved<br />

254 Value indicates the CS0 expansion is empty<br />

and unique. Compression Algorithm 8 is<br />

used for compression.<br />

255 Value indicates the CS0 expansion is empty<br />

and unique. Compression Algorithm 16 is<br />

used for compression.<br />

For a CompressionID of 8 or 16, the value of the CompressionID shall specify<br />

the number of BitsPerCharacter for the d-characters defined in the<br />

CharacterBitStream field. Each sequence of CompressionID bits in the<br />

CharacterBitStream field shall represent an <strong>OSTA</strong> Compressed Unicode d-<br />

character. The bits of the character being encoded shall be added to the<br />

CharacterBitStream from most- to least-significant-bit. The bits shall be added to<br />

the CharacterBitStream starting from the most significant bit of the current byte<br />

being encoded into.<br />

UDF 2.01<br />

11<br />

March50 April

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!