Download - Academy Publisher

More documents

Recommendations

Info

ISBN 978-952-5726-09-1 (Print) Proceedings of the Second International Symposium on Networking and Network Security (ISNNS ’10) Jinggangshan, P. R. China, 2-4, April. 2010, pp. 054-057 Research for the Algorithm of Query to Compressed XML Data Guojia Yu 1 , Huizhong Qiu 2 , and Lin Tian 3 1 Scholl of Computer Science and Engineerin University of Electronic Science and Technology of China, ChengDU, China Email:yuguojia@foxmail.com 2 Scholl of Computer Science and Engineering University of Electronic Science and Technology of China, ChengDU, China Email: hzqiu@ uestc.edu.cn, ruan052@126.com Abstract—Because XML data is increasingly becoming the standard of transmission and distribution of Internet and enterprise's data in a common format. Efficient algorithms of compression and query in XML data can directly reduce the cost of storage of data and shorten response time of query. Studying in this aspect is widely promising. This article proposed an equivalence relation on the basis of characters of XML, and proved the rationality of the index and the feasibility of query algorithm on this method, then put forward a new query algorithm on the compressed index. Finally, compared with XGrind that supports query on the partial decompression of compressed XML data in experiment. The efficiency of query on the compressed index was significantly higher than Xgrind's in several sets of data . Index Terms—XML date; compressed index; query; algorithm I. INTRODUCTION In this paper,build XML compressed index, and query efficiently on this index. Complete it in three parts in main: First, code XML data of tags and attribute names with dictionary, then use Huffman[1] coding to compress the element values and attribute values. Secondly, expand SAX generic events into another events. And compress the original XML tree structure to build a new compressed index, reduce greatly data redundancy by the structured data itself. Finally, query efficiently some data on the compressed index. II. BUILD THE COMPRESSED INDEX A. Pre-Compression The first step: use the dictionary of pre-compression to encode XML data in the non-content nodes, scan the DTD or Schema whose XML document would be compressed, store the labels’ name and attributes’ name into two dictionaries, and then the values of the dictionary instead of these labels’ name and attributes’ name. After that,build the compressed indexed. In the previous,article shows some concepts of terminology: 1 The same name item: the same name of tags basing on the same parent node compose the same name item. 2 Different chain: the first element in all of the same items based on the same parent node composed of the different chain. 3Repetition rate of XML data: (the number of lable elements of XML ― the number of the same name item) / the number of lable elements of XML. 4 The judgement event: combine two adjacent events (that’s geniric events)of SAX,then expand API of the event-driven SAX parser to a judgement event B. TP Equivalence Relations Illuminationed by the indexs of APEX[2], Fabric[3], XQueC[4], XBZip[5] and other methods, this article will convert the tree structure of XML to another index who could guarantee to support efficient query. So introduce a TP equivalence relations (tree to tree-graph)with two structures can be interchangeable, that is isomorphic, as follows. Figure 1. XML document tree structure [Definition 1] TP equivalence relation: given a tree G: G ( V , E) , where V is the set of nodes in G, E is the set of edges in G. Convert G, you can get another form of treegraph G '( V ', E' ) . Based on G and G',we can define a binary relation R. If R satisfies the following conditions, that R is a TP equivalence relations of G and G': 1) any node u in G has exactly the same and unique corresponding node u' in G'. 2) If there is a node in G, whose child pointer p point to the p 1 -the first child on the left, that they have a p→ α p relationship 1, then, in G' there must also exist a corresponding element of q, which child pointer point to © 2010 ACADEMY PUBLISHER AP-PROC-CS-10CN006 54
its first child q 1 on the left, that they have a q→ α q . relationship : 1 3) p in G points to all child nodes in C = p , p , L, p ) , but there has its corresponding ( 2 3 n C' = ( q2 , q3, L, qn ) in G', starting from q 1 , build the same name item from the same name pointer and build the different name item from the next name chain. 4) Similarly, we can define the relationship from nodes in G' to nodes in G. It’s omitted at here. Figure 2. CSTP Index Converted map is shown in Fig.2. Fig.1 and Fig.2 show that the two structures are equivalent, but due to the query on the index, so this need for further analysis of their relationship and the corresponding characters. Theorem 1: CSTP compressed query index designed by this paper is a tree -graph, that has hierarchy of a tree that the parent node is unique,and also has connectivity of a graph on the same layer node.We could query from the root firstly and aslo do the breadth-first . The findings and order is the same as the original XML content. C. Expand Events of SAX Parser For the need of establishing index CSTP, this paper extend API of the event-driven SAX parser to judgement event after combining two adjacent events of SAX. For example, XML data: element contents 1 element contents 2 ... .... SAX after the expansion is disposed as following: Judgement (Start element (a), Characters (element contents 1)) in order to send a judgement event, and then build node and dispose corresponding 7 judgement events proved and summed up at the later. SAX continues reading: Judgement (Characters (element contents 1), End element (a)) after completion, as described above for the corresponding treatment, SAX continues reading and puting out Judgement (End element (a), Start element (b) ), ... ..., and so on, in order to guarantee building the CSTP index proposed in this paper only by reading through SAX. D. The Building of The Index Lemma 1: For a pure XML document (such as the DTD, Schema or CSS which is separated from XML data), all of the information in it can be divided into five kinds: 1 : 2 : 3 : 4 : 5 : element content Prove: According to the W3C’s definition and specifications for XML documents, it is not difficult to prove the lemma above. Character 1: XML has a total of five kinds of generic events, and a maximum of 25 judgement events. Because the SAX sends generic event from the start tag, end tag and element content in XML, each information of the five kinds divided above lemma is corresponded to a genetic event. Expand the genetic event of SAX API, 2 genetic events constitute a judgement event, then five kinds of genetic events could constitute 25 species of judgement events. They are (It illustrates the relationship or their characters between the two genetic events behind genetic each event): (1) (2) (3) (4) (5) element contents (6) (7) (8) (9) (10) element contents (11) (12) (13) (14) (15) element contents (16) (17) (18) (19) (20) element contents (21) element contents (22) element contents (23) element contents
Page 1 and 2:
Proceedings The Second Internationa
Page 3 and 4:
Table of Contents Message from the
Page 5 and 6:
Zuming Xiao, Zhan Guo, Bin Tan, and
Page 7 and 8:
Message from the Symposium Chairs T
Page 9 and 10:
Second International Symposium on N
Page 11 and 12:
ISBN 978-952-5726-09-1 (Print) Proc
Page 13 and 14: model that can deal with time serie
Page 15 and 16: ISBN 978-952-5726-09-1 (Print) Proc
Page 17 and 18: training for the Wushu competition
Page 21 and 22: information, called weak uncertain
Page 23 and 24: Student side Student side Student s
Page 27 and 28: If we define element of student as
Page 31 and 32: QoS, each frame data is divided int
Page 35 and 36: for Imaging Two and Three Phase Flo
Page 39 and 40: A. Profiling&following control algo
Page 41 and 42: Research and Realization about Conv
Page 43 and 44: PDF document structure is a tree st
Page 47 and 48: support 10 Mb / s. But ENC28J60 onl
Page 51 and 52: condition the first byte output of
Page 55 and 56: According to maximum membership deg
Page 59 and 60: ubber according to the mass ratio o
Page 61 and 62: Ⅲ. AN IMPROVED DNA ALGORITHM FOR
Page 63: Ⅴ.CONCLUSION REMARKS DNA computer
Page 67 and 68: chains can greatly improve efficien
Page 69 and 70: II. RELATED WORK A. Mobile Service
Page 71 and 72: special services. SOAP is used to b
Page 73 and 74: detection methods of DDoS attacks m
Page 75 and 76: Step4 calculate the new subordinate
Page 77 and 78: Figure 1. An analysis of the partit
Page 81 and 82: The preceding three formulas can be
Page 85 and 86: And the decay speed of buffer seque
Page 89 and 90: distance of view point. Given that
Page 93 and 94: Ⅳ. EVALUATION OF BLENDED LEARNING
Page 97 and 98: symmetric with respect to the origi
Page 101 and 102: each sample belongs to each categor
Page 105 and 106: A. Data Preparing To generate our t
Page 109 and 110: oth α and β . determination of Qu
Page 113 and 114: Strong earthquake 0.1< M L
Page 115 and 116:
ISBN 978-952-5726-09-1 (Print) Proc
Page 117 and 118:
indirect causes, and the logical re
Page 119 and 120:
egression. Granger causality test i
Page 121 and 122:
B. Evaluation We have evaluated thi
Page 123 and 124:
used as a source and neighboring ce
Page 125 and 126:
Figure 3. Drainage networks generat
Page 127 and 128:
Combinational logic unit failures i
Page 129 and 130:
Tabal.1 Combinational fault logic t
Page 131 and 132:
under the endorsement of both the m
Page 133 and 134:
TABLE I. DESCRIPTION OF PROPOSITION
Page 135 and 136:
Figure 2. The process of the invers
Page 137 and 138:
Figure 9. (a)the original image.(b)
Page 139 and 140:
ISBN 978-952-5726-09-1 (Print) Proc
Page 141 and 142:
In order to reduce to the number of
Page 143 and 144:
ISBN 978-952-5726-09-1 (Print) Proc
Page 145 and 146:
that studying being going to be to
Page 147 and 148:
Reference[10] analyzed the evolutio
Page 149 and 150:
module, communication module, apper
Page 151 and 152:
After the comprehensive performance
Page 153 and 154:
system testing can be seen that the
Page 155 and 156:
III. AUTONOMIC RESOURCE ALLOCATION
Page 157 and 158:
The QoE i (T i ) is the ith user’
Page 159 and 160:
ISBN 978-952-5726-09-1 (Print) Proc
Page 161 and 162:
nodes will be formed one cluster, t
Page 163 and 164:
ISBN 978-952-5726-09-1 (Print) Proc
Page 165 and 166:
Where n is the number of data point
Page 167 and 168:
ISBN 978-952-5726-09-1 (Print) Proc
Page 169 and 170:
Keyboard event Application User mod
Page 171 and 172:
ISBN 978-952-5726-09-1 (Print) Proc
Page 173 and 174:
SQL Azure will eventually include a
Page 175 and 176:
ISBN 978-952-5726-09-1 (Print) Proc
Page 177 and 178:
The nodes in the suffix tree are dr
Page 179 and 180:
[3] Y. Li, S. M. Chung, and J. D. H
Page 181 and 182:
some drilling fluid produces hydrog
Page 183 and 184:
"normalization", whose membership b
Page 185 and 186:
and minimum structural elements in
Page 187 and 188:
esults of the spatial transform par
Page 189 and 190:
B. Research on protocol actions Com
Page 191 and 192:
ACKNOWLEDGMENT This work is funded
Page 193 and 194:
A. Problem Description In a small b
Page 195 and 196:
[4] Huang, Hung, and J. Y. jen Hsu.
Page 197 and 198:
Further by calculating, the followi
Page 199 and 200:
TABLE II. THE CONCENTRATION BETWEEN
Page 201 and 202:
private key that obtained by using
Page 203 and 204:
ISBN 978-952-5726-09-1 (Print) Proc
Page 205 and 206:
ontology, and the domain dictionary
Page 207 and 208:
ISBN 978-952-5726-09-1 (Print) Proc
Page 209 and 210:
Eq.4 is NP-hard and can be solved b
Page 211 and 212:
ISBN 978-952-5726-09-1 (Print) Proc
Page 213 and 214:
Where: G i is the selection field o
Page 215 and 216:
If there is X j in a generation, wh
Page 217 and 218:
Theorem 2.4([10]). Let L 1 and L 2
Page 219 and 220:
The relation between the lattice im
Page 221 and 222:
Figure 2. Example of single-step de
Page 223 and 224:
evocation and the fourth group stor
Page 225 and 226:
focused mainly on rule-based forms
Page 227 and 228:
Ⅵ. CONCLUSION Figure 6. The Flask
Page 229 and 230:
EDCF in comparison with DCF, has so
Page 231 and 232:
B. Simulation results and analysis
Page 233 and 234:
ISBN 978-952-5726-09-1 (Print) Proc
Page 235 and 236:
in routing table. When search resou
Page 237 and 238:
ISBN 978-952-5726-09-1 (Print) Proc
Page 239 and 240:
others through the selective emotio
Page 241 and 242:
such as weather information, techno
Page 243 and 244:
the opening window, a related page
Page 245 and 246:
1) Process context storage areas. I
Page 247 and 248:
V. CONCLUSION In this thesis, sCPU-
Page 249 and 250:
main types of horizontal search env
Page 251 and 252:
Enterprise Portal security provides
Page 253 and 254:
() t = [ S () t S () t ] T N S ,...
Page 255 and 256:
[2] Kumaravel, N., and Kavitha, V.,
Page 257 and 258:
output, so BP network has been wide
Page 259 and 260:
input to the artificial neural netw
Page 261 and 262:
(2) In a process (tokens from outsi
Page 263 and 264:
The reduction process consists of t
Page 265 and 266:
all eight normal vectors are identi
Page 267 and 268:
is B object , and the number of emp
Page 269 and 270:
xi, j y' = εα ( (1 + tanh( )) −
Page 271 and 272:
Error 8000 7000 6000 5000 4000 3000
Page 273 and 274:
Architecture (AMBA) a new bus archi
Page 275 and 276:
In order to ensure the smooth proce
Page 277 and 278:
ISBN 978-952-5726-09-1 (Print) Proc
Page 279 and 280:
In this paper, we adopt two Sobel o
Page 281 and 282:
ISBN 978-952-5726-09-1 (Print) Proc
Page 283 and 284:
four layers.The far right of the gr
Page 285 and 286:
ISBN 978-952-5726-09-1 (Print) Proc
Page 287 and 288:
TABLE II. OPERATIONAL EMPLOYEE TABL
Page 289 and 290:
A. Object-relational Type To audit
Page 291 and 292:
to execution time. Also, DBA would
Page 293 and 294:
Liping Chen .......................
show all

Download - Academy Publisher

Create successful ePaper yourself

Delete template?

Save as template?