url - Universität zu Lübeck

More documents

Recommendations

Info

Chapter 8 KeyX Implementation Details This chapter provides an overview of the architecture of the KeyX indexing system and gives some implementation details. KeyX with all its modules is implemented in over 110 Java classes, interfaces and exceptions that cannot be completely introduced in this thesis. Therefore, we concentrate on the most important aspects, methods and paradigms. The implementations were mainly done in the software developing environment Eclipse[25]. 8.1 Architecture KeyX is an application that works on top of a native XML database management system (XDBMS) and uses services that are provided through defined interfaces. This layered architecture abstracts from the particular implementation of the XML persistency and makes the exchange of the underlying modules a lot easier as all communication uses the interfaces. We have chosen the XDBMS Infonyte DB [55] because it comes with all required standardized interfaces like DOM or XPath. The architecture of Infonyte DB is shown in figure 8.1. Figure 8.1: The basic architecture of the Infonyte DB XDBMS KeyX is an application on top of Infonyte DB that may use any of the provided interfaces. Basically, all XML data are represented in the tree-structure DOM that is stored persistently as a PDOM document in the file system. Persistency is
148 CHAPTER 8. KEYX IMPLEMENTATION DETAILS a task of the XDBMS Infonyte DB so that the developer of an application has not to regard any persistency aspects like invoking serializations, etc. Compared to highly sophisticated commercial database products, Infonyte DB is a rudimentary database management system. Infonyte DB does not offer advanced features like multi-user control mechanisms or transactions. In particular, it cannot be guaranteed that some stored XML data is always valid concerning a given schema, because elements can be inserted or deleted in the DOM without consistency controls. Validation is always performed on demand and then on the whole XML data so that it is too expensive to validate the data after each modification. Infonyte comes with no satisfying indexing approach; it supports some structural queries in a rudimentary way. The performance measurements of section 5.5 show that the query execution time of Infonyte DB without a KeyX index can become very long. Infonyte offers no XML update language like XUPdate. Therefore, all manipulations must be performed in a low level manner on the DOM tree. Recapitulating, one can say that Infonyte’s state of development is not comparable to commercial relational products, so a company typically would not use it in productive environments. But this holds for most XML DBMS that are often initiated as research projects. An exception could be Tamino from the Software AG that is a fully commercial XDBMS. Anyhow, because Infonyte DB offers all features required for KeyX and the Software AG published only very few technical informations about Tamino the choice to use Infonyte DB was no fault. KeyX uses the Infonyte DB interfaces DOM and XPath. The XPath engine is used to evaluate queries when building an index or when no index is available for processing a query. For a selected node in the DOM tree Infonyte can return a unique id. The time to dereference an id is constant and not dependent on the size of the XML data or the position of the element. Any XDBMS that offers these two features may be used with the KeyX implementation in principle. The XPath interface of the XDBMS is handed over to the database applications of the highest layer with the difference that covered queries are executed upon an index. The database application is not aware if an index is used or not because it still sends XPath queries and gets XML nodes. This architecture makes it easy to integrate an indexing system in existing applications. Therefore, the results of this thesis may be used universally for native XDBML; they do not dependent on the specific DBMS Infonyte DB. A block diagram of the KeyX system can be found in figure 8.2. The architecture consists mainly of three parts: the query engine with the query optimizer as the central part, the ISP Tool that analyzes the workloads and optimizes the index configuration and a collection of the XML data, the workload and statistics. These data have to be stored persistently in order to survive system restarts. In a realistic environment it makes sense to log the workload for several weeks or months to get an impression of the typical usage of the database. The workload is a compressed file logging all occurred database operations in a
Page 1 and 2:
Aus dem Institut für Informationss
Page 3 and 4:
Acknowledgments I would like to tha
Page 5 and 6:
2 CONTENTS 4 Introduction to Recent
Page 7 and 8:
4 CONTENTS 10.3 XML Schema . . . .
Page 9 and 10:
6 CHAPTER 1. INTRODUCTION Due to th
Page 11 and 12:
8 CHAPTER 2. FUNDAMENTALS In contra
Page 13 and 14:
10 CHAPTER 2. FUNDAMENTALS data is
Page 15 and 16:
12 CHAPTER 2. FUNDAMENTALS XML supp
Page 17 and 18:
14 CHAPTER 2. FUNDAMENTALS 2.2 Docu
Page 19 and 20:
16 CHAPTER 2. FUNDAMENTALS be const
Page 21 and 22:
18 CHAPTER 2. FUNDAMENTALS plies th
Page 23 and 24:
20 CHAPTER 2. FUNDAMENTALS the stru
Page 25 and 26:
22 CHAPTER 2. FUNDAMENTALS 2.3 XML
Page 27 and 28:
24 CHAPTER 2. FUNDAMENTALS Axes for
Page 29 and 30:
26 CHAPTER 2. FUNDAMENTALS Node Tes
Page 31 and 32:
28 CHAPTER 2. FUNDAMENTALS //item[c
Page 33 and 34:
30 CHAPTER 2. FUNDAMENTALS 21 i f (
Page 35 and 36:
32 CHAPTER 2. FUNDAMENTALS FLWOR-Ex
Page 37 and 38:
34 CHAPTER 2. FUNDAMENTALS 21 22 {
Page 39 and 40:
36 CHAPTER 2. FUNDAMENTALS 1 2 3
Page 41 and 42:
38 CHAPTER 2. FUNDAMENTALS 2.5 XML
Page 43 and 44:
40 CHAPTER 2. FUNDAMENTALS the valu
Page 45 and 46:
42 CHAPTER 2. FUNDAMENTALS tables a
Page 47 and 48:
44 CHAPTER 2. FUNDAMENTALS signific
Page 49 and 50:
46 CHAPTER 3. FORMAL MODELS FOR XML
Page 51 and 52:
Page 53 and 54:
Page 55 and 56:
Page 57 and 58:
Page 59 and 60:
56 CHAPTER 4. INTRODUCTION TO RECEN
Page 61 and 62:
Page 63 and 64:
Page 65 and 66:
Page 67 and 68:
Page 69 and 70:
Page 71 and 72:
Page 73 and 74:
Page 75 and 76:
Page 77 and 78:
Page 79 and 80:
Page 81 and 82:
78 CHAPTER 5. THE KEY-ORIENTED XML
Page 83 and 84:
Page 85 and 86:
Page 87 and 88:
Page 89 and 90:
Page 91 and 92:
Page 93 and 94:
Page 95 and 96:
Page 97 and 98:
Page 99 and 100: 96 CHAPTER 5. THE KEY-ORIENTED XML
Page 107 and 108: 104 CHAPTER 6. THE INDEX SELECTION
Page 127 and 128: 124 CHAPTER 7. THE XML INDEX UPDATE
Page 149: 146 CHAPTER 7. THE XML INDEX UPDATE
Page 153 and 154: 150 CHAPTER 8. KEYX IMPLEMENTATION
Page 161 and 162: 158 CHAPTER 9. CONCLUSION AND FUTUR
Page 163 and 164: 160 CHAPTER 9. CONCLUSION AND FUTUR
Page 165 and 166: 162 CHAPTER 10. APPENDIX 23 relKeyP
Page 167 and 168: 164 CHAPTER 10. APPENDIX Title: On
Page 169 and 170: 166 CHAPTER 10. APPENDIX Title: The
Page 171 and 172: 168 CHAPTER 10. APPENDIX
Page 173 and 174: 170 BIBLIOGRAPHY [12] Alberto Capra
Page 175 and 176: 172 BIBLIOGRAPHY [37] Roy Goldman a
Page 177 and 178: 174 BIBLIOGRAPHY [59] Raghav Kaushi
Page 179 and 180: 176 BIBLIOGRAPHY [84] David G. Mitc
Page 181 and 182: 178 BIBLIOGRAPHY [113] W3 Schools.
Page 183 and 184: 180 Index D, 78, 108, 110, 111, 144
Page 185 and 186: 182 INDEX Oracle XML DB, 41 Parent,
show all

url - Universität zu Lübeck

Create successful ePaper yourself

Delete template?

Save as template?