25.01.2015 Views

Cost-Based Optimization of Integration Flows - Datenbanken ...

Cost-Based Optimization of Integration Flows - Datenbanken ...

Cost-Based Optimization of Integration Flows - Datenbanken ...

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

6.1 Motivation and Problem Description<br />

Receive (o1)<br />

[service: s3, out: msg1]<br />

Monitored Statistics (per plan instance):<br />

|dsout1(o1)|, W(o1)<br />

Selection (o2)<br />

[in: msg1, out: msg2]<br />

Selection (o3)<br />

[in: msg2, out: msg3]<br />

Selection (o4)<br />

[in: msg3, out: msg4]<br />

oc1<br />

oc2<br />

oc3<br />

|dsin1(o2)|, |dsout1(o2)|, W(o2)<br />

|dsin1(o3)|, |dsout1(o3)|, W(o3)<br />

|dsin1(o4)|, |dsout1(o4)|, W(o4)<br />

@type='SCM'<br />

Translation (o6)<br />

[in: msg4, out: msg5]<br />

Switch (o5)<br />

[in: msg4]<br />

oc4<br />

@type='MAT'<br />

Translation (o7)<br />

[in: msg4, out: msg5]<br />

|dsin1(o5)|, W(expA), W(expB)<br />

|dsin1(o6)|, |dsout1(o6)|, W(o6)<br />

|dsin1(o7)|, |dsout1(o7)|, W(o7)<br />

Assign (o8)<br />

[in: msg5, out: msg6]<br />

|dsin1(o8)|, |dsout1(o8)|, W(o8)<br />

Invoke (o9)<br />

[service s6, in: msg6]<br />

|dsin1(o9)|, W(o9)<br />

Figure 6.2: Example Plan P 5 and Monitored Statistics<br />

optimization algorithm A-PMO iterates over all operators and applies relevant optimization<br />

techniques according to the operator type.<br />

To summarize, the periodical re-optimization has several advantages that reason its<br />

application instead <strong>of</strong> existing approaches from the area <strong>of</strong> adaptive query processing.<br />

However, it exhibits four major drawbacks:<br />

Problem 6.1 (Drawbacks <strong>of</strong> Periodical Re-<strong>Optimization</strong> 18 ). First, the generic gathering<br />

<strong>of</strong> statistics for all operators leads to the maintenance <strong>of</strong> statistics that might not be used<br />

by the optimizer. Second, periodical re-optimization finds a new plan only if workload<br />

characteristics have changed. Otherwise, we trigger many unnecessary invocations <strong>of</strong> the<br />

optimizer that evaluates the complete search space. Third, if a workload change occurs,<br />

it takes a while until re-optimization is triggered. Thus, during this adaptation delay, we<br />

are using a suboptimal plan and we are missing optimization opportunities. Fourth, the<br />

parameter ∆t has high influence on optimization and execution times and hence, parameterization<br />

requires awareness <strong>of</strong> changing workloads. We already presented experiments to<br />

all <strong>of</strong> these four drawbacks in Section 3.5. Varying the parameter ∆t (e.g., Figure 3.21)<br />

showed the spectrum from high re-optimization overheads to a degradation <strong>of</strong> the execution<br />

time to the unoptimized case. In contrast, the generic gathering <strong>of</strong> statistics (e.g., Figure<br />

3.26) requires further discussion because the overhead <strong>of</strong> our estimator is negligible.<br />

However, including automatic parameter re-estimation techniques (e.g., for continuously<br />

computing the optimal smoothing constant α <strong>of</strong> EMA) or using more complex workload<br />

aggregation methods would significantly increase this overhead.<br />

The drawbacks <strong>of</strong> periodical re-optimization and other optimization models are reasoned<br />

by the underlying fundamental problem <strong>of</strong> the strict separation between optimization,<br />

execution and statistics monitoring, which prevents the exchange <strong>of</strong> detailed information<br />

about when and how to re-optimize. This problem <strong>of</strong> a black-box optimizer was recently reconsidered<br />

by Chaudhuri, who argued for rethinking the optimizer contract [Cha09] in the<br />

context <strong>of</strong> DBMS. With regard to related work <strong>of</strong> the area <strong>of</strong> adaptive query processing, as<br />

18 Potential alternatives <strong>of</strong> periodical re-optimization such as on-idle re-optimization (re-optimization on<br />

free cycles) or anticipatory re-optimization (prediction <strong>of</strong> workload shifts) also have these drawbacks.<br />

169

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!