21.01.2014 Views

Xiao Liu PhD Thesis.pdf - Faculty of Information and Communication ...

Xiao Liu PhD Thesis.pdf - Faculty of Information and Communication ...

Xiao Liu PhD Thesis.pdf - Faculty of Information and Communication ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

4.2 Related Work <strong>and</strong> Problem Analysis<br />

4.2.1 Related Work<br />

In this thesis, we focus on time-series based interval forecasting strategies. Five<br />

service performance estimation approaches including simulation, analytical<br />

modelling, historical data, on-line learning <strong>and</strong> hybrid, are presented in [98]. In<br />

general, time-series forecasting belongs to historical data based approaches.<br />

Currently, much work utilises multivariate models <strong>and</strong> focuses on the prediction <strong>of</strong><br />

CPU load since it is the dominant factor for the durations <strong>of</strong> computation intensive<br />

activities. Typical linear time-series models such as AR, MA <strong>and</strong> Box-Jenkins are<br />

fitted to perform host load prediction in [31] where it claims that most time-series<br />

models are sufficient for host load prediction <strong>and</strong> recommends AR(16) which<br />

consumes miniscule computation cost. The work in [52] proposes a one-step-ahead<br />

<strong>and</strong> low-overhead time-series forecasting strategy which tracks recent trends by<br />

giving more weight to recent data. A hybrid model which integrates the AR model<br />

with confidence interval is proposed in [97] to perform n-step-ahead load prediction.<br />

A different strategy proposed in [85] predicts application duration with historical<br />

information where search techniques are employed to determine those application<br />

characteristics that yield the best definition <strong>of</strong> similarity. The work in [3]<br />

investigates extended forecast <strong>of</strong> both CPU <strong>and</strong> network load on computation grid to<br />

achieve effective load balancing. The authors in [97] mention the problem <strong>of</strong> turning<br />

points which causes large prediction errors to time-series models. However, few<br />

solutions have been proposed to investigate more affecting factors <strong>and</strong> h<strong>and</strong>le<br />

turning points. Different from the above work, there are a few literatures utilise<br />

univariate models. NWS [93] implements three types <strong>of</strong> univariate time-series<br />

forecasting methods including mean-based, median-based <strong>and</strong> autoregressive<br />

methods, <strong>and</strong> it dynamically chooses the one exhibiting the lowest cumulative error<br />

measure at any given moment to generate a forecast. Hence, NWS automatically<br />

identifies the best forecasting methods for any given resource. The work in [33]<br />

proposes a Dynamic Exponential Smoothing (DES) method which forecasts the<br />

future duration only based on its past values. The basic motivation for DES is to<br />

h<strong>and</strong>le the problems <strong>of</strong> sudden peaks <strong>and</strong> level switches. DES computes the optimal<br />

43

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!