25.01.2015 Views

Cost-Based Optimization of Integration Flows - Datenbanken ...

Cost-Based Optimization of Integration Flows - Datenbanken ...

Cost-Based Optimization of Integration Flows - Datenbanken ...

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

5 Multi-Flow <strong>Optimization</strong><br />

Thus, for the worst case, T L (m i ) = lc, while for all other cases, T L (m i ) ≤ lc is true. Hence,<br />

Theorem 5.3 holds.<br />

Probability P(R’)<br />

Probability P(k’)<br />

E(R’)<br />

E(k’) with E(k’) = Σ (k i’ · P(k’=k i’))<br />

(a) Message Arrival Rate R ′<br />

Message<br />

Rate R’<br />

(b) Batch Size k ′<br />

Batch Size<br />

k’<br />

Figure 5.14: Waiting Time Computation With Skewed Message Arrival Rate<br />

So far, we have assumed a fixed message rate R. Now, we formally analyze the influence<br />

<strong>of</strong> the assumption <strong>of</strong> a stochastic arrival rate R ′ that exhibits a potentially skewed probability<br />

distribution D such as the Poisson distribution. Figure 5.14(a) illustrates the probability<br />

<strong>of</strong> a specific arrival rate (for a continuous arrival rate distribution function), while<br />

Figure 5.14(b) illustrates the resulting batch sizes k ′ . Here, D can be a discrete or continuous<br />

function, where the expected value E(R ′ ) is computed as the probability-weighted<br />

integral <strong>of</strong> continuous values or as the probability-weighted sum <strong>of</strong> discrete values, respectively.<br />

For example, the discrete Poisson distribution is relevant because Xiao et al. argued<br />

that the arrival process <strong>of</strong> workflow instances is typically Poisson-distributed [XCY06].<br />

The main difference is that we now include uncertainty—in the form <strong>of</strong> an arbitrary message<br />

rate—into the min ˆT L computation because until now, we have used k ′ = R · ∆tw as<br />

batch size estimation. For D = poisson, the fixed arrival rate R is substituted with an<br />

uncertain arrival rate R ′ such that<br />

k ′ = R ′ · ∆tw with P R (R ′ = r) = Rr<br />

r! · e−R , (5.19)<br />

with an expected value <strong>of</strong> E(R ′ ) = R. Due to the introduced uncertainty, we need to<br />

extend Theorem 5.3 to skewed probability distributions functions.<br />

Theorem 5.4 (Extended S<strong>of</strong>t Guarantee <strong>of</strong> Maximum Latency). The waiting time computation<br />

ensures that—for a given uncertain message rate R ′ with skewed probability distribution<br />

function D—the latency time <strong>of</strong> a single message T L (m i ) with m i ∈ M ′ will, in<br />

expectation, and for non-overload situations, not exceed the maximum latency constraint<br />

lc with T L (m i ) ≤ lc.<br />

Pro<strong>of</strong>. Recall the worst case <strong>of</strong> T L (M ′ ) = lc with<br />

⌈ |M<br />

T L (M ′ ′ ⌉<br />

|<br />

) = · ∆tw + W (P ′ , k ′ ). (5.20)<br />

k ′<br />

There, the over- and underestimation <strong>of</strong> batch size k ′ does affect the execution time<br />

W (P ′ , k ′ ). Hence, the worst case is given, where ∆tw = W (P ′ , k ′ ). With regard to<br />

the uncertain arrival rate, the average discrete over- and under-estimation <strong>of</strong> k ′ is, in<br />

expectation, equal with<br />

∑<br />

k ′ i ≤E(k′ )<br />

(<br />

k<br />

′<br />

i · P (k ′ = k ′ i) ) = ∑<br />

k ′ i ≥E(k′ )<br />

(<br />

k<br />

′<br />

i · P (k ′ = k ′ i) ) (5.21)<br />

154

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!