Xiao Liu PhD Thesis.pdf - Faculty of Information and Communication ...
Xiao Liu PhD Thesis.pdf - Faculty of Information and Communication ...
Xiao Liu PhD Thesis.pdf - Faculty of Information and Communication ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Figure 6.2 Checkpoint Selection with Different Noises<br />
The checkpoint selection results with different distribution models are depicted<br />
in Figure 6.3. The result with 100% normal distribution models is also included as a<br />
benchmark. Similarly to Figure 6.2, the number <strong>of</strong> checkpoint is growing<br />
accordingly with the increase <strong>of</strong> workflow size. However, given different<br />
combinations with different percentages <strong>of</strong> the three distribution models, the number<br />
<strong>of</strong> selected checkpoints is very close to each other. For example, when the number<br />
<strong>of</strong> workflow activities is 6,000, the numbers <strong>of</strong> checkpoints are 298, 294, 293, 292,<br />
286 <strong>and</strong> 287 for 100%Norm, MIX(50%Norm, 30%Uni, 20%Exp), MIX(40%Norm,<br />
40%Uni, 20%Exp), MIX(30%Norm, 40%Uni, 30%Exp), MIX(20%Norm, 40%Uni,<br />
40%Exp) <strong>and</strong> MIX(10%Norm, 50%Uni, 40%Exp) respectively. The maximum<br />
difference is 11. Therefore, even if we regard that all the activity durations follow<br />
normal distribution models, the lowest accuracy rate <strong>of</strong> our checkpoint selection<br />
strategy is 1-(11/298)=0.96, i.e. 96%. Specifically, the lowest accuracy rates for the<br />
10 scientific workflows with workflow sizes ranging from 200 to 10,000 are 91%,<br />
93%, 96%, 95%, 96%, 96%, 95%, 96%, 95% <strong>and</strong> 96%. Therefore, given our<br />
checkpoint selection strategy, it can be observed that when the workflow size is<br />
large enough, the accuracy rate is stable no matter what the actual underlying<br />
distribution models are. However, since the real world system environments<br />
normally cannot be simulated exactly by a single distribution model or even several<br />
mixed distribution models, the experiments here cannot evaluate our strategy under<br />
all possible circumstances but selected representatives. The idea here is to<br />
demonstrate that the performance <strong>of</strong> our strategy will not be affected significantly by<br />
the underlying distribution models, although it is built on the normal distribution<br />
models.<br />
102