url - Universität zu Lübeck

More documents

Recommendations

Info

5.4. QUERY PROCESSING 89 5.4 Query Processing Executing a query without an index implies a sequential search in the full XML data leading to linear complexity in the size of the XML data. For larger data, e.g. a list of available books in a bookstore, this implies a prohibitive query execution time. 5.4.1 Query Execution with matching KeyX Indexes The query optimizer extracts the keys of a given query q and looks for an appropriate index. In the following an index j is defined by its declaration consisting of the linear path expressions to the keys, qualifiers and the return value. Formally, j is the tuple (K j , Q j , v j ) see section 5.1.1 for details). An XPath based query q consists of analogous path expressions; q = (K q , Q q , v q ). If the path expressions of the query q and an index j are exactly the same then the index matches best and can instantly be used. The key(s) are searched in the tree structure of the index; like in relational indexes this is done in logarithmic time. If the key is found the attached value is either the reference to the corresponding return value in the XML data (single-key index) or a search tree of a lower level (multi-key index) so that further (recursive) key retrieval is performed. The references to the return values in the XML data are returned as the result of the query. 5.4.2 Index Usage with Deviating Return Values If the path expression of the keys and the qualifiers of the index j and the query q are the same (K q = K j , Q q = Q j ) but the path expressions to the return value differ (v q ≠ v j ) then the index might still be used in some cases with additional postprocessing: If the elements requested by v q are reachable from v j by a linear relative path expression p △ = v q − v j that contains no wildcard or descendant operator we can evaluate p △ on each node that is referenced by the index j through v j . Example 12 Lets say we have an index j that indexes item elements by their name value. The corresponding XPath expression is //item[name=’x’] with K j = {//item/name}, Q j = ∅ and v j = //item. The query q = //item/location[../name = ′ Sinus MP 3 P layer ′ ] has the same key as the index j but the return value differs: //item ≠ //item/location. The relative linear path expression p △ describes the path from the elements that are referenced by the index to the elements that are requested by the query. In this case p △ = //item/location − //item = /location navigates to the location children of each item element.
90 CHAPTER 5. THE KEY-ORIENTED XML INDEX KEYX The postprocessing raises additional costs but if the set of elements referenced by v j is small the total costs of the query evaluation with the index will still outperform the exhaustive evaluation over the whole document. If there is a wildcard operator in the path expressions of the query the index may not be suitable anymore because it does not cover all requested elements. For instance, the index defined by /site/regions/asia/item[name=’x’] indexes all items located in asia. The query /site/regions/*/item[name=’Sinus MP3 Player’] does not have a regional restriction. We could calculate the path expression p △ = ../∗ that navigates from the item elements in asia to all children of its parent but this would not lead to success because the index does not cover the values of name elements that do not belong to asia. The decision whether we can use an index or not relies on the subset relationship (containment) of the corresponding keys. 5.4.3 Containment Problem In general, a selective index covers all queries with a result set being a subset of the query that defined the index. For instance, an index that is designed to accelerate queries of the form /dblp/ ∗ [author = ′ x ′ ] is also capable of evaluating queries like /dblp/book[author = ′ x ′ ] or /dblp/article[author = ′ x ′ ] because the selected keys are a subset of the keys of the index. When using an index that covers a superset of the elements that are selected by the query an additional postprocessing step has to filter wrong hits: A simple node test checks if the selected nodes are of the requested element type (e.g. an element selected by ∗ is checked if it has the label book). Like in the previous case the postprocessing requires linear complexity in the size of the elements that are returned by the index. If an index is defined with a non-empty set of qualifiers (e.g. only books with an isbn child) it cannot be used to process a query that ignores the qualifier because the index does not cover all requested elements. In contrast, a query that poses more qualifiers than the index can be processed by the index with additional postprocessing because the query’s result is a subset of the elements that are indexed. The decision whether the selected nodes of one XPath expression p are a subset of the result set of a second expression p ′ (p ⊆ p ′ ) can be solved using the containment algorithm presented by Miklau and Suciu [82]. This algorithm constructs tree patterns for the path expressions p and p ′ and creates two (alternating) tree automata A and A ′ accepting XML data that can be queried by p and p ′ . Containment holds (p ⊆ p ′ ) if lang(A) ⊆ lang(A ′ ). A third automaton A ′′ accepting the complement of lang(A ′ ) (lang(A ′ )) is built on the base of A ′ by exchanging all accepting states with the non-accepting states. If the product automaton B = A x A ′′ has no reachable accepting state it holds, that lang(A)∩lang(A ′ ) = ∅. This is equiv-
Page 1 and 2:
Aus dem Institut für Informationss
Page 3 and 4:
Acknowledgments I would like to tha
Page 5 and 6:
2 CONTENTS 4 Introduction to Recent
Page 7 and 8:
4 CONTENTS 10.3 XML Schema . . . .
Page 9 and 10:
6 CHAPTER 1. INTRODUCTION Due to th
Page 11 and 12:
8 CHAPTER 2. FUNDAMENTALS In contra
Page 13 and 14:
10 CHAPTER 2. FUNDAMENTALS data is
Page 15 and 16:
12 CHAPTER 2. FUNDAMENTALS XML supp
Page 17 and 18:
14 CHAPTER 2. FUNDAMENTALS 2.2 Docu
Page 19 and 20:
16 CHAPTER 2. FUNDAMENTALS be const
Page 21 and 22:
18 CHAPTER 2. FUNDAMENTALS plies th
Page 23 and 24:
20 CHAPTER 2. FUNDAMENTALS the stru
Page 25 and 26:
22 CHAPTER 2. FUNDAMENTALS 2.3 XML
Page 27 and 28:
24 CHAPTER 2. FUNDAMENTALS Axes for
Page 29 and 30:
26 CHAPTER 2. FUNDAMENTALS Node Tes
Page 31 and 32:
28 CHAPTER 2. FUNDAMENTALS //item[c
Page 33 and 34:
30 CHAPTER 2. FUNDAMENTALS 21 i f (
Page 35 and 36:
32 CHAPTER 2. FUNDAMENTALS FLWOR-Ex
Page 37 and 38:
34 CHAPTER 2. FUNDAMENTALS 21 22 {
Page 39 and 40:
36 CHAPTER 2. FUNDAMENTALS 1 2 3
Page 41 and 42: 38 CHAPTER 2. FUNDAMENTALS 2.5 XML
Page 43 and 44: 40 CHAPTER 2. FUNDAMENTALS the valu
Page 45 and 46: 42 CHAPTER 2. FUNDAMENTALS tables a
Page 47 and 48: 44 CHAPTER 2. FUNDAMENTALS signific
Page 49 and 50: 46 CHAPTER 3. FORMAL MODELS FOR XML
Page 59 and 60: 56 CHAPTER 4. INTRODUCTION TO RECEN
Page 81 and 82: 78 CHAPTER 5. THE KEY-ORIENTED XML
Page 91: 88 CHAPTER 5. THE KEY-ORIENTED XML
Page 107 and 108: 104 CHAPTER 6. THE INDEX SELECTION
Page 127 and 128: 124 CHAPTER 7. THE XML INDEX UPDATE
Page 143 and 144:
140 CHAPTER 7. THE XML INDEX UPDATE
Page 145 and 146:
Page 147 and 148:
Page 149 and 150:
Page 151 and 152:
148 CHAPTER 8. KEYX IMPLEMENTATION
Page 153 and 154:
Page 155 and 156:
Page 157 and 158:
Page 159 and 160:
Page 161 and 162:
158 CHAPTER 9. CONCLUSION AND FUTUR
Page 163 and 164:
160 CHAPTER 9. CONCLUSION AND FUTUR
Page 165 and 166:
162 CHAPTER 10. APPENDIX 23 relKeyP
Page 167 and 168:
164 CHAPTER 10. APPENDIX Title: On
Page 169 and 170:
166 CHAPTER 10. APPENDIX Title: The
Page 171 and 172:
168 CHAPTER 10. APPENDIX
Page 173 and 174:
170 BIBLIOGRAPHY [12] Alberto Capra
Page 175 and 176:
172 BIBLIOGRAPHY [37] Roy Goldman a
Page 177 and 178:
174 BIBLIOGRAPHY [59] Raghav Kaushi
Page 179 and 180:
176 BIBLIOGRAPHY [84] David G. Mitc
Page 181 and 182:
178 BIBLIOGRAPHY [113] W3 Schools.
Page 183 and 184:
180 Index D, 78, 108, 110, 111, 144
Page 185 and 186:
182 INDEX Oracle XML DB, 41 Parent,
show all

url - Universität zu Lübeck

Create successful ePaper yourself

Delete template?

Save as template?