url - Universität zu Lübeck
url - Universität zu Lübeck
url - Universität zu Lübeck
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
7.4. EVALUATION 137<br />
Figure 7.5: Measurements of the intersection algorithm<br />
7.4.2 Evaluation of Satisfiability<br />
In a second scenario we evaluated the more complex algorithm checking the satisfiability<br />
of two path expressions p, p ′ ∈ XP {[],∗,//,NOT } . The algorithm has an<br />
exponential runtime because all variables must be checked in order to satisfy<br />
the qualifiers. In order to show that the exponential complexity is no significant<br />
limitation in the database context we determine the expected value for the times<br />
expenses for typical operations with an average case simulation.<br />
In general, the expected value EX of a random variable X : Ω → R is<br />
∫<br />
EX = X(ω) · p(ω) dω<br />
ω∈Ω<br />
with Ω = XP {[],∗,//,NOT } the set of all path expressions, X(ω) is the runtime of the<br />
algorithm for a specific path expression ω ∈ Ω and p(ω) its probability.<br />
There are three variables for a path expression: the length l p of the path linearize(p)<br />
which affects the intersection algorithm. Because intersection is determined very<br />
quickly (see previous experiment) l p has only a little effect. The second and third<br />
variables are the number of qualifiers and element names in a path expression.<br />
The element names correspond to the variables in 3SAT. Because Ω is infinite in<br />
general we have to restrict it to a reasonable boundary: We think that 100 qualifiers<br />
in a path expression provide a realistic upper value for database operations<br />
(comparable to a SELECT statement with 100 WHERE clauses in SQL). Second,<br />
because we have no distribution function for the probabilities of path expressions<br />
we assume an equal distribution.