PDFlib TET PDF IFilter 4.0 Manual
PDFlib TET PDF IFilter 4.0 Manual
PDFlib TET PDF IFilter 4.0 Manual
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
3.2 Metadata Organization<br />
Metadata is organized in the following hierarchical way:<br />
> Properties are the fundamental building blocks for metadata. Properties in the Windows<br />
operating system and the <strong>IFilter</strong> interface are organized by a unique numeric<br />
identifier (see below).<br />
> Property sets comprise a group of properties which usually have some logical relationship.<br />
All properties in a set share the same GUID (see below). Property sets can be<br />
specified in the XML configuration file.<br />
> Property set collections comprise a group of property sets. <strong>TET</strong> <strong>PDF</strong> <strong>IFilter</strong> implements<br />
several predefined property set collections. They can be used to collectively enable<br />
or disable multiple property sets together. It is not required to configure additional<br />
property set collections.<br />
Property identification and GUIDs. Properties are identified in the <strong>IFilter</strong> interface by<br />
an identifier which consists of two parts:<br />
> The first part is the Globally Unique Identifier GUID (sometimes also called Universally<br />
Unique Identifier, UUID), a unique 128-bit identifier which has the same value for all<br />
properties in a property set. Details on GUIDs can be found at<br />
www.itu.int/ITU-T/studygroups/com17/oid/X.667-E.pdf<br />
There are various tools available for creating GUIDs; you can also use online services,<br />
e.g. the one which is available at<br />
www.itu.int/ITU-T/asn1/uuid.html<br />
A sample GUID looks as follows: 7a737220-0cd0-11dd-bd75-0002a5d5c51b.<br />
> The second part uniquely identifies the property within its property set. It can consist<br />
of a positive integer called the identifier, or ID for short. Property identifiers in a<br />
set must start with the value 2, but are otherwise arbitrary. Property identifiers are<br />
supported in all <strong>IFilter</strong> clients.<br />
Alternatively, the second part may consist of a cleartext name. The use of names instead<br />
of IDs is deprecated, and is not supported by some <strong>IFilter</strong> clients, e.g. Windows<br />
Search. However, it can make configuration more convenient for those <strong>IFilter</strong> clients<br />
which support it, e.g. SharePoint. See »XML configuration for GUID+name treatment<br />
of properties«, page 43, for information on enabling the GUID+name method.<br />
The GUID+ID or GUID+name combination is required to configure metadata property<br />
queries in search products. Other aspects of metadata properties are detailed in Section<br />
3.4, »Custom Metadata Properties«, page 42.<br />
40 Chapter 3: Metadata Properties