21.01.2014 Views

Xiao Liu PhD Thesis.pdf - Faculty of Information and Communication ...

Xiao Liu PhD Thesis.pdf - Faculty of Information and Communication ...

Xiao Liu PhD Thesis.pdf - Faculty of Information and Communication ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Figure 6.2 Checkpoint Selection with Different Noises<br />

The checkpoint selection results with different distribution models are depicted<br />

in Figure 6.3. The result with 100% normal distribution models is also included as a<br />

benchmark. Similarly to Figure 6.2, the number <strong>of</strong> checkpoint is growing<br />

accordingly with the increase <strong>of</strong> workflow size. However, given different<br />

combinations with different percentages <strong>of</strong> the three distribution models, the number<br />

<strong>of</strong> selected checkpoints is very close to each other. For example, when the number<br />

<strong>of</strong> workflow activities is 6,000, the numbers <strong>of</strong> checkpoints are 298, 294, 293, 292,<br />

286 <strong>and</strong> 287 for 100%Norm, MIX(50%Norm, 30%Uni, 20%Exp), MIX(40%Norm,<br />

40%Uni, 20%Exp), MIX(30%Norm, 40%Uni, 30%Exp), MIX(20%Norm, 40%Uni,<br />

40%Exp) <strong>and</strong> MIX(10%Norm, 50%Uni, 40%Exp) respectively. The maximum<br />

difference is 11. Therefore, even if we regard that all the activity durations follow<br />

normal distribution models, the lowest accuracy rate <strong>of</strong> our checkpoint selection<br />

strategy is 1-(11/298)=0.96, i.e. 96%. Specifically, the lowest accuracy rates for the<br />

10 scientific workflows with workflow sizes ranging from 200 to 10,000 are 91%,<br />

93%, 96%, 95%, 96%, 96%, 95%, 96%, 95% <strong>and</strong> 96%. Therefore, given our<br />

checkpoint selection strategy, it can be observed that when the workflow size is<br />

large enough, the accuracy rate is stable no matter what the actual underlying<br />

distribution models are. However, since the real world system environments<br />

normally cannot be simulated exactly by a single distribution model or even several<br />

mixed distribution models, the experiments here cannot evaluate our strategy under<br />

all possible circumstances but selected representatives. The idea here is to<br />

demonstrate that the performance <strong>of</strong> our strategy will not be affected significantly by<br />

the underlying distribution models, although it is built on the normal distribution<br />

models.<br />

102

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!