30.08.2014 Views

url - Universität zu Lübeck

url - Universität zu Lübeck

url - Universität zu Lübeck

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

56 CHAPTER 4. INTRODUCTION TO RECENT APPROACHES IN XML INDEXING<br />

sions; these approaches are itemized as hybrid indexes.<br />

The selectivity on an index states whether it always covers the whole XML data<br />

or is tunable for specific and user-defined fragments. A non-selective index has<br />

to be updated whenever the original data is modified. A selective index consumes<br />

less space and can be tuned for the typical usage of the database leading to less<br />

update operations. A relational index is selective because it is defined upon a<br />

table and a column.<br />

Key-queries may return an element which differs from the key-element(s) that<br />

is/are used for the value comparison. For instance, the general path expression<br />

//item[quantity > x 1 ] returns item elements whereas the value used for the comparison<br />

belongs to a quantity element. The majority of index approaches can<br />

only return the indexed key-element leading to additional expenses for navigation<br />

if the return element is different. For large paths between key and the return<br />

value this may add significant costs for the query processor. Some approaches<br />

like KeyX and the Refined Path from the Index Fabric are able to directly return<br />

the requested element without further navigation in the XML data.<br />

In order to explain and illustrate the different indexing approaches in a quickly<br />

understandable manner we use some XML data taken from the XMark project<br />

and generate a specific index for each approach to be evaluated. The sample data<br />

consists of two items, one located in Asia and two in Europe. The items have<br />

different child elements describing the properties of the item. Additionally, the<br />

sample data contains two persons with their addresses. The textual representation<br />

of the sample data is presented in figure 4.1.<br />

1 <br />

2 <br />

3 <br />

4 <br />

5 Singapur<br />

6 2<br />

7 512 MB USB Stick<br />

8 Money order<br />

9 Cash<br />

10 <br />

11 <br />

12 <br />

13 <br />

14 Hamburg<br />

15 1<br />

16 Beuys Sculpture <br />

17 <br />

18 <br />

19 Paris<br />

20 2<br />

21 Louvre Tickets<br />

22 Cash<br />

23 <br />

24 <br />

25

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!