21.01.2014 Views

Xiao Liu PhD Thesis.pdf - Faculty of Information and Communication ...

Xiao Liu PhD Thesis.pdf - Faculty of Information and Communication ...

Xiao Liu PhD Thesis.pdf - Faculty of Information and Communication ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

the probability for self-recovery is 50%. If PTR ( a p ) is twice as large as PTD ( a p ) ,<br />

i.e. T is equal to 1, the probability for self-recovery is 84.13% [87]. Note that in<br />

practice, historical data can be employed to discover <strong>and</strong> modify the actual<br />

probability distribution models. Nevertheless, the strategy proposed in this chapter<br />

can be applied in a similar manner.<br />

7.2.2 Temporal Violation H<strong>and</strong>ling Point Selection Strategy<br />

After the probability <strong>of</strong> self-recovery is defined, a probability threshold is required.<br />

The probability threshold can be regarded as the minimum confidence for skipping<br />

temporal violation h<strong>and</strong>ling on a selected checkpoint yet still retaining satisfactory<br />

temporal correctness. Given a probability threshold, a temporal violation h<strong>and</strong>ling<br />

point selection rule is defined as follows.<br />

Temporal Violation H<strong>and</strong>ling Point Selection Rule:<br />

At activity<br />

a p , with the probability <strong>of</strong> self-recovery P (T ) <strong>and</strong> the probability<br />

threshold PT ( 0 < PT < 1), the rule for temporal violation h<strong>and</strong>ling point selection<br />

is as follows: if<br />

P ( T ) > PT , then the current checkpoint is not selected as a h<strong>and</strong>ling<br />

point; otherwise, the current checkpoint is selected as a h<strong>and</strong>ling point.<br />

The probability threshold PT is an important parameter in the temporal<br />

violation h<strong>and</strong>ling point selection rule <strong>and</strong> it needs to be defined to facilitate<br />

violation h<strong>and</strong>ling point selection at workflow runtime. However, whether selfrecovery<br />

will be successful or not can only be determined after the execution <strong>of</strong> the<br />

subsequent activities. It is difficult, if not impossible, to specify the optimal<br />

probability threshold which can select the minimal number <strong>of</strong> h<strong>and</strong>ling points while<br />

maintaining satisfactory temporal correctness. To address this problem, we borrow<br />

the idea from adaptive testing where the new testing cases are generated based on<br />

the knowledge <strong>of</strong> previous testing cases [24]. In our strategy, the probability<br />

threshold can start from any moderate initial values, e.g. 0.3 or 0.5, <strong>and</strong> it is then<br />

adaptively modified based on the results <strong>of</strong> previous temporal violation h<strong>and</strong>ling<br />

along scientific workflow execution. Therefore, the initial threshold has limited<br />

impact on the effectiveness <strong>of</strong> our strategy. The process <strong>of</strong> adaptive modification for<br />

PT is described as follows.<br />

109

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!