url - Universität zu Lübeck
url - Universität zu Lübeck
url - Universität zu Lübeck
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
56 CHAPTER 4. INTRODUCTION TO RECENT APPROACHES IN XML INDEXING<br />
sions; these approaches are itemized as hybrid indexes.<br />
The selectivity on an index states whether it always covers the whole XML data<br />
or is tunable for specific and user-defined fragments. A non-selective index has<br />
to be updated whenever the original data is modified. A selective index consumes<br />
less space and can be tuned for the typical usage of the database leading to less<br />
update operations. A relational index is selective because it is defined upon a<br />
table and a column.<br />
Key-queries may return an element which differs from the key-element(s) that<br />
is/are used for the value comparison. For instance, the general path expression<br />
//item[quantity > x 1 ] returns item elements whereas the value used for the comparison<br />
belongs to a quantity element. The majority of index approaches can<br />
only return the indexed key-element leading to additional expenses for navigation<br />
if the return element is different. For large paths between key and the return<br />
value this may add significant costs for the query processor. Some approaches<br />
like KeyX and the Refined Path from the Index Fabric are able to directly return<br />
the requested element without further navigation in the XML data.<br />
In order to explain and illustrate the different indexing approaches in a quickly<br />
understandable manner we use some XML data taken from the XMark project<br />
and generate a specific index for each approach to be evaluated. The sample data<br />
consists of two items, one located in Asia and two in Europe. The items have<br />
different child elements describing the properties of the item. Additionally, the<br />
sample data contains two persons with their addresses. The textual representation<br />
of the sample data is presented in figure 4.1.<br />
1 <br />
2 <br />
3 <br />
4 <br />
5 Singapur<br />
6 2<br />
7 512 MB USB Stick<br />
8 Money order<br />
9 Cash<br />
10 <br />
11 <br />
12 <br />
13 <br />
14 Hamburg<br />
15 1<br />
16 Beuys Sculpture <br />
17 <br />
18 <br />
19 Paris<br />
20 2<br />
21 Louvre Tickets<br />
22 Cash<br />
23 <br />
24 <br />
25