url - Universität zu Lübeck

More documents

Recommendations

Info

6.4. AUTONOMOUS XML INDEXING 121 Test Scenario 3 The previous two scenarios were constructed to evaluate isolated characteristics of the KeyX auto index system and operated on artificial data. In order to determine the overall performance of KeyX we set up a more realistic test using real XML data from the DBLP project [70] - the well-known computer science bibliography. The full DBLP data consists of approximately 500,000 publications, mainly articles, inproceedings, and books. Our concrete test data is an extract of the full DBLP of roughly 26 Megabyte and consists of 586546 element nodes, attribute nodes and text nodes, more precisely 534 articles, 57000 inproceedings and 1024 proceedings. For the test we set up 27 different XPath based queries. Each operation o has one index candidate of class ican 1 (o) which supports the query to the best. We created an initial workload by randomly selecting 25 of these database operations. In general, some operations are selected multiple times and others are not part of the workload. Additionally, the operations in the workload are assigned as querying or modifying at random using a predefined ratio. Further workloads are created by a delta algorithm that exchanges one operation from the workload with a new one that is selected randomly from the set of 27 operations. The total size of the workload stays unchanged. The delta algorithm guarantees small and random changes in the workload - both in the contained path expressions and the ratio of querying and modifying operations. This should simulate a real database application that changes over30 time. Due to the slowly changing workload the ISP Tool is able to adapt the KeyX index system: The index selection tool is called periodically (every 30 runs) and finds a new index configuration that suits better for the changed workloads. Of course, each run of this non-deterministic algorithm generates different results. The costs to drop and create new indexes are not taken into account because in realistic scenarios with less fast changing workloads the index selection tool would be called less frequently and index updated can be done in times when the CPU is less used. We present the measurements of a representative test run in figure 6.7. The first four workloads are executed without any index. Then, the index selection tool is called and creates indexes that accelerate ongoing workloads. The delta algorithm changes the workload more and more so that the established indexes are performing poorer. Each 30th there is seen an edge in the curve that indicates that the index selection tool has updated the index configuration. The sawtooth pattern is typical for this test.
122 CHAPTER 6. THE INDEX SELECTION PROBLEM Figure 6.7: Test 3: Long-running evaluation of the ISP tool that optimizes the database to a permanently changing workload. The pseudo code for this test scenario including the delta algorithm is presented below. 1 /∗∗ 2 ∗ Pseudo code f o r scenario 3: 3 ∗ This method creates , modifies and 4 ∗ executes a workload continuously . 5 ∗ The ISP Tool is called to optimize 6 ∗ the workload ’ s execution time . 7 ∗/ 9 int counter = 0; 10 int ispFrequency = 30; 11 double queryModifyRatio = 0.75; 12 int workloadSize = 25; 13 Workload w = 14 createRandomWorkload ( workloadSize ) 16 while ( true ) { 17 // create new database Operation 18 Operation op = new Operation ( ) ; 19 op . path = createRandomPathExpr ( ) ; 20 op . type = getType ( queryModifyRatio ) ; 22 // swap op with random op from w 23 int index = 24 ( int )random . nextDouble ( ) ∗workloadSize ; 25 w[ index ] = op ; 27 // execute and log the workload 28 w. log ( ) ; 29 w. execute ( ) ; 31 // c a l l ISP t o o l each ispFreq ’ th run 32 i f ( counter % ispFrequency == 0) 33 ispTool . createSuitableIndexes (w) ; 35 counter++; 36 }
Page 1 and 2:
Aus dem Institut für Informationss
Page 3 and 4:
Acknowledgments I would like to tha
Page 5 and 6:
2 CONTENTS 4 Introduction to Recent
Page 7 and 8:
4 CONTENTS 10.3 XML Schema . . . .
Page 9 and 10:
6 CHAPTER 1. INTRODUCTION Due to th
Page 11 and 12:
8 CHAPTER 2. FUNDAMENTALS In contra
Page 13 and 14:
10 CHAPTER 2. FUNDAMENTALS data is
Page 15 and 16:
12 CHAPTER 2. FUNDAMENTALS XML supp
Page 17 and 18:
14 CHAPTER 2. FUNDAMENTALS 2.2 Docu
Page 19 and 20:
16 CHAPTER 2. FUNDAMENTALS be const
Page 21 and 22:
18 CHAPTER 2. FUNDAMENTALS plies th
Page 23 and 24:
20 CHAPTER 2. FUNDAMENTALS the stru
Page 25 and 26:
22 CHAPTER 2. FUNDAMENTALS 2.3 XML
Page 27 and 28:
24 CHAPTER 2. FUNDAMENTALS Axes for
Page 29 and 30:
26 CHAPTER 2. FUNDAMENTALS Node Tes
Page 31 and 32:
28 CHAPTER 2. FUNDAMENTALS //item[c
Page 33 and 34:
30 CHAPTER 2. FUNDAMENTALS 21 i f (
Page 35 and 36:
32 CHAPTER 2. FUNDAMENTALS FLWOR-Ex
Page 37 and 38:
34 CHAPTER 2. FUNDAMENTALS 21 22 {
Page 39 and 40:
36 CHAPTER 2. FUNDAMENTALS 1 2 3
Page 41 and 42:
38 CHAPTER 2. FUNDAMENTALS 2.5 XML
Page 43 and 44:
40 CHAPTER 2. FUNDAMENTALS the valu
Page 45 and 46:
42 CHAPTER 2. FUNDAMENTALS tables a
Page 47 and 48:
44 CHAPTER 2. FUNDAMENTALS signific
Page 49 and 50:
46 CHAPTER 3. FORMAL MODELS FOR XML
Page 51 and 52:
Page 53 and 54:
Page 55 and 56:
Page 57 and 58:
Page 59 and 60:
56 CHAPTER 4. INTRODUCTION TO RECEN
Page 61 and 62:
Page 63 and 64:
Page 65 and 66:
Page 67 and 68:
Page 69 and 70:
Page 71 and 72:
Page 73 and 74: 70 CHAPTER 4. INTRODUCTION TO RECEN
Page 81 and 82: 78 CHAPTER 5. THE KEY-ORIENTED XML
Page 107 and 108: 104 CHAPTER 6. THE INDEX SELECTION
Page 123: 120 CHAPTER 6. THE INDEX SELECTION
Page 127 and 128: 124 CHAPTER 7. THE XML INDEX UPDATE
Page 151 and 152: 148 CHAPTER 8. KEYX IMPLEMENTATION
Page 161 and 162: 158 CHAPTER 9. CONCLUSION AND FUTUR
Page 163 and 164: 160 CHAPTER 9. CONCLUSION AND FUTUR
Page 165 and 166: 162 CHAPTER 10. APPENDIX 23 relKeyP
Page 167 and 168: 164 CHAPTER 10. APPENDIX Title: On
Page 169 and 170: 166 CHAPTER 10. APPENDIX Title: The
Page 171 and 172: 168 CHAPTER 10. APPENDIX
Page 173 and 174: 170 BIBLIOGRAPHY [12] Alberto Capra
Page 175 and 176:
172 BIBLIOGRAPHY [37] Roy Goldman a
Page 177 and 178:
174 BIBLIOGRAPHY [59] Raghav Kaushi
Page 179 and 180:
176 BIBLIOGRAPHY [84] David G. Mitc
Page 181 and 182:
178 BIBLIOGRAPHY [113] W3 Schools.
Page 183 and 184:
180 Index D, 78, 108, 110, 111, 144
Page 185 and 186:
182 INDEX Oracle XML DB, 41 Parent,
show all

url - Universität zu Lübeck

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?