21.01.2014 Views

Xiao Liu PhD Thesis.pdf - Faculty of Information and Communication ...

Xiao Liu PhD Thesis.pdf - Faculty of Information and Communication ...

Xiao Liu PhD Thesis.pdf - Faculty of Information and Communication ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

To evaluate the average performance, 10 independent experiments with different<br />

scientific workflow sizes (ranging from 2,000 to 50,000 workflow activities with<br />

their mean activity durations between 30 to 3,000 time units) are executed 100 times<br />

each. All the activity durations are generated by the normal distribution model. The<br />

st<strong>and</strong>ard deviation is set as 10% <strong>of</strong> the mean activity duration to represent dynamic<br />

system performance. We have also implemented other representative distribution<br />

models such as exponential, uniform <strong>and</strong> a mixture <strong>of</strong> them. Since the experimental<br />

results are similar, this chapter only demonstrates the results with normal<br />

distribution.<br />

The constraint setting utilises the strategy introduced in [55] <strong>and</strong> the initial<br />

probability is set reasonably as 90% to serve as a type <strong>of</strong> QoS contract between<br />

users <strong>and</strong> service providers which is agreed at scientific workflow build time. Here<br />

the initial probability means that a scientific workflow has a 90% probability to<br />

finish on time, or in other words, 90% workflow instances can finish on time.<br />

Therefore, in our experiment, we specify the “satisfactory temporal correctness” as a<br />

temporal violation rate below 10% so as to meet the QoS contract. The average<br />

length <strong>of</strong> the workflow segments is set as 20 which is a moderate size for a<br />

workflow sub-process similar to those high-level activities depicted in Figure 1.1.<br />

Meanwhile, although under normal circumstances, the experiments can be<br />

conducted without noises (i.e. 0% noise), to simulate some worse case scenarios,<br />

r<strong>and</strong>om noises (i.e. a fixed rate <strong>of</strong> delays at r<strong>and</strong>om activities) are also injected to<br />

simulate extra delays accordingly along workflow execution due to potential<br />

unpredictable causes such as system overload <strong>and</strong> resource unavailability. For the<br />

comparison purpose, four rounds <strong>of</strong> simulation experiments with different r<strong>and</strong>om<br />

noise levels <strong>of</strong> 0%, 5%, 15% <strong>and</strong> 25% are conducted. Since the st<strong>and</strong>ard deviation<br />

<strong>of</strong> activity durations are set as 10% <strong>of</strong> the mean values, according to the “ 3 σ ” rule,<br />

there is a scarce chance that the noises, i.e. the delays, would exceed 30% <strong>of</strong> the<br />

mean durations [87]. Therefore, we set the upper bound <strong>of</strong> noises as 25% to<br />

investigate the effectiveness <strong>of</strong> our strategy under extreme situations. There are<br />

many temporal violation h<strong>and</strong>ling strategies available as will be introduced in<br />

Chapter 8. In our experiments, similar to the settings in Section 6.3.1, we use pseudo<br />

workflow local rescheduling with a reasonable average time compensation rate <strong>of</strong><br />

50% [56, 100]. The size <strong>of</strong> rescheduled workflow activities is r<strong>and</strong>omly selected up<br />

113

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!