Download - Academy Publisher
Download - Academy Publisher
Download - Academy Publisher
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
its first child q 1 on the left, that they have a<br />
q→ α q .<br />
relationship : 1<br />
3) p in G points to all child nodes<br />
in C = p , p , L,<br />
p ) , but there has its corresponding<br />
(<br />
2 3 n<br />
C' = ( q2 , q3,<br />
L,<br />
qn<br />
) in G', starting from q 1<br />
, build the<br />
same name item from the same name pointer and build<br />
the different name item from the next name chain.<br />
4) Similarly, we can define the relationship from<br />
nodes in G' to nodes in G. It’s omitted at here.<br />
Figure 2. CSTP Index<br />
Converted map is shown in Fig.2. Fig.1 and Fig.2<br />
show that the two structures are equivalent, but due to the<br />
query on the index, so this need for further analysis of<br />
their relationship and the corresponding characters.<br />
Theorem 1: CSTP compressed query index designed<br />
by this paper is a tree -graph, that has hierarchy of a tree<br />
that the parent node is unique,and also has connectivity of<br />
a graph on the same layer node.We could query from the<br />
root firstly and aslo do the breadth-first . The findings and<br />
order is the same as the original XML content.<br />
C. Expand Events of SAX Parser<br />
For the need of establishing index CSTP, this paper<br />
extend API of the event-driven SAX parser to judgement<br />
event after combining two adjacent events of SAX.<br />
For example, XML data:<br />
element contents 1 <br />
element contents 2 <br />
... ....<br />
SAX after the expansion is disposed as following:<br />
Judgement (Start element (a), Characters (element<br />
contents 1)) in order to send a judgement event, and then<br />
build node and dispose corresponding 7 judgement events<br />
proved and summed up at the later. SAX continues<br />
reading: Judgement (Characters (element contents 1), End<br />
element (a)) after completion, as described above for the<br />
corresponding treatment, SAX continues reading and<br />
puting out Judgement (End element (a), Start element<br />
(b) ), ... ..., and so on, in order to guarantee building the<br />
CSTP index proposed in this paper only by reading<br />
through SAX.<br />
D. The Building of The Index<br />
Lemma 1: For a pure XML document (such as the<br />
DTD, Schema or CSS which is separated from XML<br />
data), all of the information in it can be divided into five<br />
kinds:<br />
1 : <br />
2 : <br />
3 : <br />
4 : <br />
5 : element content<br />
Prove: According to the W3C’s definition and<br />
specifications for XML documents, it is not difficult to<br />
prove the lemma above.<br />
Character 1: XML has a total of five kinds of generic<br />
events, and a maximum of 25 judgement events.<br />
Because the SAX sends generic event from the start<br />
tag, end tag and element content in XML, each<br />
information of the five kinds divided above lemma is<br />
corresponded to a genetic event. Expand the genetic event<br />
of SAX API, 2 genetic events constitute a judgement<br />
event, then five kinds of genetic events could constitute<br />
25 species of judgement events. They are (It illustrates<br />
the relationship or their characters between the two<br />
genetic events behind genetic each event):<br />
(1) <br />
(2) <br />
(3) <br />
(4) <br />
(5) element contents<br />
(6) <br />
(7) <br />
(8) <br />
(9) <br />
(10) element contents<br />
(11) <br />
(12) <br />
(13) <br />
(14) <br />
(15) element<br />
contents<br />
(16) <br />
(17) <br />
(18) <br />
(19) <br />
(20) element contents<br />
(21) element contents<br />
(22) element contents<br />
(23) element contents