url - Universität zu Lübeck

More documents

Recommendations

Info

Chapter 4 Introduction to Recent Approaches in XML Indexing In this section we classify and describe recent approaches indexing XML and semistructured data. Some approaches were published before XML gained the current importance and generally operate on semistructured data. We transferred these approaches to XML. The basic idea of an index for semistructured data and XML is to accelerate the execution of path expressions, for instance XPath. The more complex XQuery expressions benefit from an index, too, because XQuery relies on the execution of XPath expressions for addressing the nodes of the sequences. All indexing approaches have in common that they try to avoid the linear inspection of XML nodes when performing node tests or checking predicates. For instance, when evaluating the XPath expression //item[/name=’MP3 Player’] every element is treated as if it has the label item or not. Second, for each item element all children are checked whether they have the label name. Third, for all name elements the corresponding text value is compared with the given string. For larger databases this evaluation method leads to unacceptable processing times. Although all indexing approaches have the same goal, their methodology, the internal data structures, and the query processing vary significantly. For this reason we establish some criteria in order to classify and compare the related work on XML indexing. Some index approaches index the structure of the XML data without regarding the value of elements or attributes. These approaches are called structural indexes or pure-path indexes. On the other hand some indexes cover only the value of elements and attributes without reflecting the leading path to these values; these approaches are called value indexes. Advanced approaches cover both structure and values leading to an acceleration of more general and realistic path expres-
56 CHAPTER 4. INTRODUCTION TO RECENT APPROACHES IN XML INDEXING sions; these approaches are itemized as hybrid indexes. The selectivity on an index states whether it always covers the whole XML data or is tunable for specific and user-defined fragments. A non-selective index has to be updated whenever the original data is modified. A selective index consumes less space and can be tuned for the typical usage of the database leading to less update operations. A relational index is selective because it is defined upon a table and a column. Key-queries may return an element which differs from the key-element(s) that is/are used for the value comparison. For instance, the general path expression //item[quantity > x 1 ] returns item elements whereas the value used for the comparison belongs to a quantity element. The majority of index approaches can only return the indexed key-element leading to additional expenses for navigation if the return element is different. For large paths between key and the return value this may add significant costs for the query processor. Some approaches like KeyX and the Refined Path from the Index Fabric are able to directly return the requested element without further navigation in the XML data. In order to explain and illustrate the different indexing approaches in a quickly understandable manner we use some XML data taken from the XMark project and generate a specific index for each approach to be evaluated. The sample data consists of two items, one located in Asia and two in Europe. The items have different child elements describing the properties of the item. Additionally, the sample data contains two persons with their addresses. The textual representation of the sample data is presented in figure 4.1. 1 2 3 4 5 Singapur 6 2 7 512 MB USB Stick 8 Money order 9 Cash 10 11 12 13 14 Hamburg 15 1 16 Beuys Sculpture 17 18 19 Paris 20 2 21 Louvre Tickets 22 Cash 23 24 25
Page 1 and 2:
Aus dem Institut für Informationss
Page 3 and 4:
Acknowledgments I would like to tha
Page 5 and 6:
2 CONTENTS 4 Introduction to Recent
Page 7 and 8: 4 CONTENTS 10.3 XML Schema . . . .
Page 9 and 10: 6 CHAPTER 1. INTRODUCTION Due to th
Page 11 and 12: 8 CHAPTER 2. FUNDAMENTALS In contra
Page 13 and 14: 10 CHAPTER 2. FUNDAMENTALS data is
Page 15 and 16: 12 CHAPTER 2. FUNDAMENTALS XML supp
Page 17 and 18: 14 CHAPTER 2. FUNDAMENTALS 2.2 Docu
Page 19 and 20: 16 CHAPTER 2. FUNDAMENTALS be const
Page 21 and 22: 18 CHAPTER 2. FUNDAMENTALS plies th
Page 23 and 24: 20 CHAPTER 2. FUNDAMENTALS the stru
Page 25 and 26: 22 CHAPTER 2. FUNDAMENTALS 2.3 XML
Page 27 and 28: 24 CHAPTER 2. FUNDAMENTALS Axes for
Page 29 and 30: 26 CHAPTER 2. FUNDAMENTALS Node Tes
Page 31 and 32: 28 CHAPTER 2. FUNDAMENTALS //item[c
Page 33 and 34: 30 CHAPTER 2. FUNDAMENTALS 21 i f (
Page 35 and 36: 32 CHAPTER 2. FUNDAMENTALS FLWOR-Ex
Page 37 and 38: 34 CHAPTER 2. FUNDAMENTALS 21 22 {
Page 39 and 40: 36 CHAPTER 2. FUNDAMENTALS 1 2 3
Page 41 and 42: 38 CHAPTER 2. FUNDAMENTALS 2.5 XML
Page 43 and 44: 40 CHAPTER 2. FUNDAMENTALS the valu
Page 45 and 46: 42 CHAPTER 2. FUNDAMENTALS tables a
Page 47 and 48: 44 CHAPTER 2. FUNDAMENTALS signific
Page 49 and 50: 46 CHAPTER 3. FORMAL MODELS FOR XML
Page 57: 54 CHAPTER 3. FORMAL MODELS FOR XML
Page 61 and 62: 58 CHAPTER 4. INTRODUCTION TO RECEN
Page 81 and 82: 78 CHAPTER 5. THE KEY-ORIENTED XML
Page 107 and 108: 104 CHAPTER 6. THE INDEX SELECTION
Page 109 and 110:
106 CHAPTER 6. THE INDEX SELECTION
Page 111 and 112:
Page 113 and 114:
Page 115 and 116:
Page 117 and 118:
Page 119 and 120:
Page 121 and 122:
Page 123 and 124:
Page 125 and 126:
Page 127 and 128:
124 CHAPTER 7. THE XML INDEX UPDATE
Page 129 and 130:
Page 131 and 132:
Page 133 and 134:
Page 135 and 136:
Page 137 and 138:
Page 139 and 140:
Page 141 and 142:
Page 143 and 144:
Page 145 and 146:
Page 147 and 148:
Page 149 and 150:
Page 151 and 152:
148 CHAPTER 8. KEYX IMPLEMENTATION
Page 153 and 154:
Page 155 and 156:
Page 157 and 158:
Page 159 and 160:
Page 161 and 162:
158 CHAPTER 9. CONCLUSION AND FUTUR
Page 163 and 164:
160 CHAPTER 9. CONCLUSION AND FUTUR
Page 165 and 166:
162 CHAPTER 10. APPENDIX 23 relKeyP
Page 167 and 168:
164 CHAPTER 10. APPENDIX Title: On
Page 169 and 170:
166 CHAPTER 10. APPENDIX Title: The
Page 171 and 172:
168 CHAPTER 10. APPENDIX
Page 173 and 174:
170 BIBLIOGRAPHY [12] Alberto Capra
Page 175 and 176:
172 BIBLIOGRAPHY [37] Roy Goldman a
Page 177 and 178:
174 BIBLIOGRAPHY [59] Raghav Kaushi
Page 179 and 180:
176 BIBLIOGRAPHY [84] David G. Mitc
Page 181 and 182:
178 BIBLIOGRAPHY [113] W3 Schools.
Page 183 and 184:
180 Index D, 78, 108, 110, 111, 144
Page 185 and 186:
182 INDEX Oracle XML DB, 41 Parent,
show all

url - Universität zu Lübeck

Create successful ePaper yourself

Delete template?

Save as template?