17.05.2014 Views

PDFlib TET PDF IFilter 4.0 Manual

PDFlib TET PDF IFilter 4.0 Manual

PDFlib TET PDF IFilter 4.0 Manual

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

3.2 Metadata Organization<br />

Metadata is organized in the following hierarchical way:<br />

> Properties are the fundamental building blocks for metadata. Properties in the Windows<br />

operating system and the <strong>IFilter</strong> interface are organized by a unique numeric<br />

identifier (see below).<br />

> Property sets comprise a group of properties which usually have some logical relationship.<br />

All properties in a set share the same GUID (see below). Property sets can be<br />

specified in the XML configuration file.<br />

> Property set collections comprise a group of property sets. <strong>TET</strong> <strong>PDF</strong> <strong>IFilter</strong> implements<br />

several predefined property set collections. They can be used to collectively enable<br />

or disable multiple property sets together. It is not required to configure additional<br />

property set collections.<br />

Property identification and GUIDs. Properties are identified in the <strong>IFilter</strong> interface by<br />

an identifier which consists of two parts:<br />

> The first part is the Globally Unique Identifier GUID (sometimes also called Universally<br />

Unique Identifier, UUID), a unique 128-bit identifier which has the same value for all<br />

properties in a property set. Details on GUIDs can be found at<br />

www.itu.int/ITU-T/studygroups/com17/oid/X.667-E.pdf<br />

There are various tools available for creating GUIDs; you can also use online services,<br />

e.g. the one which is available at<br />

www.itu.int/ITU-T/asn1/uuid.html<br />

A sample GUID looks as follows: 7a737220-0cd0-11dd-bd75-0002a5d5c51b.<br />

> The second part uniquely identifies the property within its property set. It can consist<br />

of a positive integer called the identifier, or ID for short. Property identifiers in a<br />

set must start with the value 2, but are otherwise arbitrary. Property identifiers are<br />

supported in all <strong>IFilter</strong> clients.<br />

Alternatively, the second part may consist of a cleartext name. The use of names instead<br />

of IDs is deprecated, and is not supported by some <strong>IFilter</strong> clients, e.g. Windows<br />

Search. However, it can make configuration more convenient for those <strong>IFilter</strong> clients<br />

which support it, e.g. SharePoint. See »XML configuration for GUID+name treatment<br />

of properties«, page 43, for information on enabling the GUID+name method.<br />

The GUID+ID or GUID+name combination is required to configure metadata property<br />

queries in search products. Other aspects of metadata properties are detailed in Section<br />

3.4, »Custom Metadata Properties«, page 42.<br />

40 Chapter 3: Metadata Properties

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!