url - Universität zu Lübeck

More documents

Recommendations

Info

6.3. INDEX SELECTION PROBLEM APPLIED TO KEYX INDEXES 109 Definition 22 (Index Candidates) The index candidates are defined as a function ican : P → P(D) returning a set containing all possible index declarations for a given path expression p. The following definition combines and permutes the key nodes: ican(p) = {([k 1 , k 2 , . . . , k m ], value(p)) | k j ∈ key(p) ∧ In total we have to consider ∑ m n=0 1 ≤ j ≤ m ∧ 1 ≤ m ≤ |key(p)|} m! (m−n)! − 1 different possible indexes. As most of these indexes are dropped during ISP calculation, we call them index candidates of the path expression p. Index candidates are virtual and not materialized in the database. Example 16 A query o with the path expression p 5 = /dblp/article[author = ”X” and title = ”Y ”] has the following key and value nodes: key(p 5 ) = {/dlbp/article/author, /dlpb/article/title} value(p 5 ) = /dlbp/article All index candidates are listed below. As the order of key nodes matters, the first two index candidates of ican(p 5 ) are not equivalent. ican(p 5 ) = {i 1 p 5 , i 2 p 5 , i 3 p 5 , i 4 p 5 } with i 1 p 5 = ([/dblp/article/author, /dblp/article/title], /dblp/article) i 2 p 5 = ([/dblp/article/title, /dblp/article/author], /dblp/article) i 3 p 5 = ([/dblp/article/author], /dblp/article) i 4 p 5 = ([/dblp/article/title], /dblp/article) The two multi-key indexes i 1 p 5 and i 2 p 5 constitute the best suitable indexes for p 5 as they reflect both key nodes. In contrast the two indexes i 3 p 5 and i 4 p 5 require additional processing of the referenced nodes, but are still more efficient than an evaluation of the plain path expression without an index. Please notice that the number of index candidates grows exponentially with the number of key nodes of a path expression and increases the costs of solving the ISP dramatically. A path expression with 4 key nodes will lead to 64 index candidates! Heuristics to decrease the computational expense have to start at this point by reducing the number of index candidates. To consider the whole workload we need to regard the index candidates of all database operations o ∈ W . This is done by unifying the index candidates of all operations to the total index candidate set.
110 CHAPTER 6. THE INDEX SELECTION PROBLEM Definition 23 (Total Index Candidates) The index candidates of all database operations of the workload W are unified to the total index candidates set T IC W : T IC W = ⋃ o∈W ican(path(o)) The set T IC consists of all (l) possible index declarations d 1 , . . . , d l ∈ D which are relevant for the workload W . T IC = {d 1 , d 2 , . . . , d l } with d j ∈ D and 1 ≤ j ≤ l The set T IC is constant for a given workload and stays unchanged while exploring the ISP. The number of index candidates in T IC may be less than the sum of all index candidates in all sets of ican(o) as they may contain duplicates. Nonetheless the number of indexes in T IC grows exponentially. 6.3.2 Index Configuration |T IC W | ≤ ∑ o j ∈W |ican(path(o j ))| A lot of the index candidates of T IC are dropped when calculating the set of indexes of T IC which is optimal for the workload. A set of indexes is called index configuration. Definition 24 (Index Configuration) An index configuration c points out which index declarations of T IC are materialized and available when executing the workload. A configuration c is a vector of flags identifying which index candidates from T IC are selected or not. c = (f 1 , f 2 , . . . , f l ) with f j ∈ {0, 1} and 1 ≤ j ≤ l The j-th flag f j identifies if the j-th index i j from T IC is selected (f j = 1) or not (f j = 0). The set of all possible index configurations is denoted by CONF T ICW . Example 17 The configuration c 1 = (1, 0, 0, 1, 1) indicates that the first and the last two indexes are materialized while the second and third are inactive. Because every combination of indexes is a different index configuration it is obvious that |T IC| |CONF T IC | = 2 6.3.3 Cost Functions In order to find the best index configuration for the workload W we have to compare the costs and profits of all index configurations. Materializing each index
Page 1 and 2:
Aus dem Institut für Informationss
Page 3 and 4:
Acknowledgments I would like to tha
Page 5 and 6:
2 CONTENTS 4 Introduction to Recent
Page 7 and 8:
4 CONTENTS 10.3 XML Schema . . . .
Page 9 and 10:
6 CHAPTER 1. INTRODUCTION Due to th
Page 11 and 12:
8 CHAPTER 2. FUNDAMENTALS In contra
Page 13 and 14:
10 CHAPTER 2. FUNDAMENTALS data is
Page 15 and 16:
12 CHAPTER 2. FUNDAMENTALS XML supp
Page 17 and 18:
14 CHAPTER 2. FUNDAMENTALS 2.2 Docu
Page 19 and 20:
16 CHAPTER 2. FUNDAMENTALS be const
Page 21 and 22:
18 CHAPTER 2. FUNDAMENTALS plies th
Page 23 and 24:
20 CHAPTER 2. FUNDAMENTALS the stru
Page 25 and 26:
22 CHAPTER 2. FUNDAMENTALS 2.3 XML
Page 27 and 28:
24 CHAPTER 2. FUNDAMENTALS Axes for
Page 29 and 30:
26 CHAPTER 2. FUNDAMENTALS Node Tes
Page 31 and 32:
28 CHAPTER 2. FUNDAMENTALS //item[c
Page 33 and 34:
30 CHAPTER 2. FUNDAMENTALS 21 i f (
Page 35 and 36:
32 CHAPTER 2. FUNDAMENTALS FLWOR-Ex
Page 37 and 38:
34 CHAPTER 2. FUNDAMENTALS 21 22 {
Page 39 and 40:
36 CHAPTER 2. FUNDAMENTALS 1 2 3
Page 41 and 42:
38 CHAPTER 2. FUNDAMENTALS 2.5 XML
Page 43 and 44:
40 CHAPTER 2. FUNDAMENTALS the valu
Page 45 and 46:
42 CHAPTER 2. FUNDAMENTALS tables a
Page 47 and 48:
44 CHAPTER 2. FUNDAMENTALS signific
Page 49 and 50:
46 CHAPTER 3. FORMAL MODELS FOR XML
Page 51 and 52:
Page 53 and 54:
Page 55 and 56:
Page 57 and 58:
Page 59 and 60:
56 CHAPTER 4. INTRODUCTION TO RECEN
Page 61 and 62: 58 CHAPTER 4. INTRODUCTION TO RECEN
Page 81 and 82: 78 CHAPTER 5. THE KEY-ORIENTED XML
Page 107 and 108: 104 CHAPTER 6. THE INDEX SELECTION
Page 111: 108 CHAPTER 6. THE INDEX SELECTION
Page 127 and 128: 124 CHAPTER 7. THE XML INDEX UPDATE
Page 151 and 152: 148 CHAPTER 8. KEYX IMPLEMENTATION
Page 161 and 162: 158 CHAPTER 9. CONCLUSION AND FUTUR
Page 163 and 164:
160 CHAPTER 9. CONCLUSION AND FUTUR
Page 165 and 166:
162 CHAPTER 10. APPENDIX 23 relKeyP
Page 167 and 168:
164 CHAPTER 10. APPENDIX Title: On
Page 169 and 170:
166 CHAPTER 10. APPENDIX Title: The
Page 171 and 172:
168 CHAPTER 10. APPENDIX
Page 173 and 174:
170 BIBLIOGRAPHY [12] Alberto Capra
Page 175 and 176:
172 BIBLIOGRAPHY [37] Roy Goldman a
Page 177 and 178:
174 BIBLIOGRAPHY [59] Raghav Kaushi
Page 179 and 180:
176 BIBLIOGRAPHY [84] David G. Mitc
Page 181 and 182:
178 BIBLIOGRAPHY [113] W3 Schools.
Page 183 and 184:
180 Index D, 78, 108, 110, 111, 144
Page 185 and 186:
182 INDEX Oracle XML DB, 41 Parent,
show all

url - Universität zu Lübeck

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?