Cost-Based Optimization of Integration Flows - Datenbanken ...
Cost-Based Optimization of Integration Flows - Datenbanken ...
Cost-Based Optimization of Integration Flows - Datenbanken ...
- No tags were found...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
5 Multi-Flow <strong>Optimization</strong><br />
Thus, for the worst case, T L (m i ) = lc, while for all other cases, T L (m i ) ≤ lc is true. Hence,<br />
Theorem 5.3 holds.<br />
Probability P(R’)<br />
Probability P(k’)<br />
E(R’)<br />
E(k’) with E(k’) = Σ (k i’ · P(k’=k i’))<br />
(a) Message Arrival Rate R ′<br />
Message<br />
Rate R’<br />
(b) Batch Size k ′<br />
Batch Size<br />
k’<br />
Figure 5.14: Waiting Time Computation With Skewed Message Arrival Rate<br />
So far, we have assumed a fixed message rate R. Now, we formally analyze the influence<br />
<strong>of</strong> the assumption <strong>of</strong> a stochastic arrival rate R ′ that exhibits a potentially skewed probability<br />
distribution D such as the Poisson distribution. Figure 5.14(a) illustrates the probability<br />
<strong>of</strong> a specific arrival rate (for a continuous arrival rate distribution function), while<br />
Figure 5.14(b) illustrates the resulting batch sizes k ′ . Here, D can be a discrete or continuous<br />
function, where the expected value E(R ′ ) is computed as the probability-weighted<br />
integral <strong>of</strong> continuous values or as the probability-weighted sum <strong>of</strong> discrete values, respectively.<br />
For example, the discrete Poisson distribution is relevant because Xiao et al. argued<br />
that the arrival process <strong>of</strong> workflow instances is typically Poisson-distributed [XCY06].<br />
The main difference is that we now include uncertainty—in the form <strong>of</strong> an arbitrary message<br />
rate—into the min ˆT L computation because until now, we have used k ′ = R · ∆tw as<br />
batch size estimation. For D = poisson, the fixed arrival rate R is substituted with an<br />
uncertain arrival rate R ′ such that<br />
k ′ = R ′ · ∆tw with P R (R ′ = r) = Rr<br />
r! · e−R , (5.19)<br />
with an expected value <strong>of</strong> E(R ′ ) = R. Due to the introduced uncertainty, we need to<br />
extend Theorem 5.3 to skewed probability distributions functions.<br />
Theorem 5.4 (Extended S<strong>of</strong>t Guarantee <strong>of</strong> Maximum Latency). The waiting time computation<br />
ensures that—for a given uncertain message rate R ′ with skewed probability distribution<br />
function D—the latency time <strong>of</strong> a single message T L (m i ) with m i ∈ M ′ will, in<br />
expectation, and for non-overload situations, not exceed the maximum latency constraint<br />
lc with T L (m i ) ≤ lc.<br />
Pro<strong>of</strong>. Recall the worst case <strong>of</strong> T L (M ′ ) = lc with<br />
⌈ |M<br />
T L (M ′ ′ ⌉<br />
|<br />
) = · ∆tw + W (P ′ , k ′ ). (5.20)<br />
k ′<br />
There, the over- and underestimation <strong>of</strong> batch size k ′ does affect the execution time<br />
W (P ′ , k ′ ). Hence, the worst case is given, where ∆tw = W (P ′ , k ′ ). With regard to<br />
the uncertain arrival rate, the average discrete over- and under-estimation <strong>of</strong> k ′ is, in<br />
expectation, equal with<br />
∑<br />
k ′ i ≤E(k′ )<br />
(<br />
k<br />
′<br />
i · P (k ′ = k ′ i) ) = ∑<br />
k ′ i ≥E(k′ )<br />
(<br />
k<br />
′<br />
i · P (k ′ = k ′ i) ) (5.21)<br />
154