13.07.2015 Views

Extensions of the UNIX File Command and Magic File for File Type ...

Extensions of the UNIX File Command and Magic File for File Type ...

Extensions of the UNIX File Command and Magic File for File Type ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

MIME types have parameters such as charset, type <strong>and</strong> codecs. Additional parameters have beenadded to <strong>the</strong> MIME types in <strong>the</strong> <strong>File</strong> Format Library to indicate <strong>the</strong> version <strong>of</strong> <strong>the</strong> file <strong>for</strong>mat (1,1.1, iv) <strong>and</strong> encoding <strong>of</strong> files (encrypted compressed, <strong>and</strong> rle). For Windows bitmaps(image/bmp), <strong>the</strong> parameter colors (16, 16bit, 24bit, 256, 32bit) has been added.For executable programs, <strong>for</strong> example a Windows 32-bit executable, <strong>the</strong> MIME type includes atype parameter, <strong>for</strong> example,application/octet-stream; type=win32-exeThe value <strong>of</strong> extensions is a list <strong>of</strong> filename extensions commonly used <strong>for</strong> files in this file<strong>for</strong>mat. This value <strong>of</strong> this attribute can be null.In <strong>the</strong> GTRI <strong>File</strong> Format Library, if <strong>the</strong>re is a PRONOM fmt or x-fmt identifier <strong>for</strong> <strong>the</strong> file<strong>for</strong>mat, it has been entered in <strong>the</strong> PUID field. If <strong>the</strong>re is no PRONOM PUID, <strong>the</strong> PUID attributevalue is null. To have unique identifiers <strong>for</strong> all file <strong>for</strong>mats, <strong>for</strong>mats without PUIDs could beassigned an identifier such as x-fmt/gtrinumeric_identifer.The value <strong>for</strong> Precedes is a list <strong>of</strong> <strong>File</strong> Format Ids. The interpretation is that <strong>the</strong> file signature(magic number) tests <strong>for</strong> <strong>the</strong> current <strong>File</strong> Format must precede <strong>the</strong> tests <strong>for</strong> <strong>the</strong> file <strong>for</strong>mats whoseids are <strong>the</strong> values <strong>of</strong> Precedes. Precedence relationships are necessary because some tests <strong>for</strong> file<strong>for</strong>mats must be per<strong>for</strong>med be<strong>for</strong>e o<strong>the</strong>rs, or <strong>the</strong> file <strong>for</strong>mat will be incorrectly identified. Forexample, <strong>the</strong> OpenDocument Text <strong>for</strong>mat must be recognized be<strong>for</strong>e <strong>the</strong> Zip file <strong>for</strong>mat, because<strong>the</strong> <strong>for</strong>mer is a special case <strong>of</strong> <strong>the</strong> latter.In <strong>the</strong> future, <strong>the</strong> attributes <strong>of</strong> a file <strong>for</strong>mat might be extended to include:Plat<strong>for</strong>m (OS, hardware)Digital Object Class (e.g., 3D model, image, video, audio)Description (History, relationship to o<strong>the</strong>r <strong>for</strong>mats, etc.)Release date/supported until dateMacOS Creator/<strong>Type</strong> codesMacOS X Uni<strong>for</strong>m <strong>Type</strong> Identifier (UTI)The magic file released with <strong>the</strong> file comm<strong>and</strong> <strong>of</strong>ten uses a sequence <strong>of</strong> magic statements tocharacterize multiple file <strong>for</strong>mats <strong>and</strong> in addition extracts additional metadata <strong>for</strong> a file type. Thefile signature tests that have been created <strong>for</strong> <strong>the</strong> <strong>File</strong> Format Library contain criteria <strong>for</strong>individual file <strong>for</strong>mats <strong>and</strong> do not extract additional metadata. In <strong>the</strong> future, an attribute could beadded to file <strong>for</strong>mat whose values were tests <strong>for</strong> technical metadata as well as identification <strong>of</strong>file type.<strong>File</strong> signature tests have been created <strong>for</strong> more than 800 file <strong>for</strong>mats. Figure 2 shows <strong>the</strong> magictests characterizing <strong>the</strong> OpenDocument Text file <strong>for</strong>mat.10

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!