25.01.2015 Views

Cost-Based Optimization of Integration Flows - Datenbanken ...

Cost-Based Optimization of Integration Flows - Datenbanken ...

Cost-Based Optimization of Integration Flows - Datenbanken ...

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

3.5 Experimental Evaluation<br />

We conducted further experiments where we executed n = 100,000 instances <strong>of</strong> the<br />

plan P 8 and varied the scale factor data size d (in 100 kB) in order to simulate changing<br />

workload characteristics. Figure 3.28(a) illustrates the monitored plan execution times<br />

with annotated input data sizes d. <strong>Based</strong> on these statistics, we evaluated the influences<br />

<strong>of</strong> workload aggregation methods, their parameters, and <strong>of</strong> the sliding time window size<br />

in detail.<br />

(a) Execution Time<br />

(b) Workload Aggregation Method<br />

(c) EMA Smoothing Parameter α<br />

(d) Sliding Time Window Size ∆w<br />

Figure 3.28: Influence <strong>of</strong> Parameters on the Sensibility <strong>of</strong> Workload Adaptation<br />

First, Figure 3.28(b) shows the influence <strong>of</strong> the workload aggregation method, where<br />

we fixed ∆w = 1,000 s and illustrate the estimated costs continuously (∆t = 1 s). The<br />

real execution times contain several outliers and a major skew. Obviously, MA causes the<br />

slowest adaptation, while WMA and EMA cause faster adaptation. This is reasoned by the<br />

fact that MA (constant weights) computes a linear average, while WMA (linear weights) and<br />

EMA (exponential weights) compute weighted averages over the sliding window, where the<br />

latest items have higher influence. Further, LR causes the fastest adaptation due to extrapolation.<br />

However, it is important to note that LR tends to strongly over- and underestimate<br />

on abrupt workload shifts such as at the 80,000 th plan instance. The results support our<br />

decision to use the exponential moving average (EMA) as default aggregation method due<br />

to fast but robust workload adaptation. There, our default setting <strong>of</strong> the EMA parameter<br />

α = 0.002 was set empirically based on the observed variance <strong>of</strong> plan execution times<br />

83

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!