Contents Telektronikk - Telenor
Contents Telektronikk - Telenor
Contents Telektronikk - Telenor
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
200<br />
˜X = X + β(C − E(C))<br />
(5.1)<br />
where β is -1 in the simple case where X<br />
and C are positively correlated and in the<br />
same order of magnitude. If not, then β<br />
can be picked according to some optimum<br />
criteria, see [16] for details on what<br />
is called regression-adjusted control.<br />
It is useful to distinguish between internal<br />
and external controls:<br />
(i) Internal control<br />
The random variable C with the known<br />
expectation occurs in the same computation/simulation<br />
as X, e.g. C is the<br />
average queuing delay and X is the<br />
probability of full system.<br />
(ii) External control<br />
The random variable C with the known<br />
expectation occurs in a separate computation/simulation<br />
in addition to the<br />
one where X is recorded. For example,<br />
if we are interested in estimating X<br />
which is the probability of full system<br />
in a G/G/13-queue, we can claim that it<br />
may be correlated to the probability of<br />
full system in a M/M/14-queue, and let<br />
C be this quantity.<br />
External control has the disadvantage<br />
that extra computations are necessary to<br />
obtain the control. A general disadvantage<br />
with control variables is that it is<br />
very difficult to find a proper control<br />
variable.<br />
5.3 Rare event provoking<br />
All three methods in these first subsections<br />
are means for controlling the variability<br />
of a set of observations. It gives us<br />
no means for manipulating the processes<br />
generating the observations, e.g. to increase<br />
the sampling of the rare sequences<br />
of event that are of vital importance to<br />
the performance measure such as probability<br />
of full system in orders less than<br />
e.g. 10-9 . The next three subsections will<br />
present techniques that do.<br />
5.3.1 Stratified sampling<br />
Probably the most frequent use of stratified<br />
sampling is in a Gallup poll. The<br />
population is divided into strata in<br />
3 General arrivals and server distributions.<br />
4 Poisson arrival and neg.exp. service<br />
time distributions, easy computational,<br />
see reference example.<br />
accordance to some descriptives such as<br />
sex, age, position, education, geographical<br />
address, etc. To get a proper impression<br />
of the public opinion, a fair sample<br />
from each strata must be drawn.<br />
Applied in network performance evaluation,<br />
stratified sampling is best explained<br />
through an example, details of the method<br />
can for instance be found in Chapter<br />
11.5 in [16]. Consider the reference<br />
example M/M/1 from Section 5.1, only<br />
this time with k servers, i.e. an M/M/ksystem.<br />
The number of servers vary randomly<br />
following a known distribution,<br />
i.e. we know the probability of having k<br />
servers. Assume that we want to estimate<br />
a quantity X that is positively correlated<br />
to the number of servers K, which is the<br />
intermediary variable. We get a reduction<br />
in the variance of X if we partition<br />
(stratifies) the total calculation into strata<br />
according to the possible outcome of the<br />
intermediary variables, and then estimate<br />
X for a given stratum (e.g. a fixed number<br />
of servers active).<br />
For example, let k vary from 1 to 4 and<br />
suppose we want to estimate the probability<br />
of waiting time larger than 5 minutes<br />
(which is most likely to occur when<br />
only one server is active). We partition<br />
the number of simulation replica into 4<br />
strata, and see to that most of the simulation<br />
effort are concentrated on the strata<br />
containing 1 and 2 servers.<br />
The example illustrates the main problem<br />
with stratified sampling, namely how to<br />
divide the simulation replica into strata,<br />
and how to define the optimal number of<br />
simulation replica in each stratum, and<br />
finally how to calculate the strata probabilities.<br />
Recently, it is published a variant of<br />
stratified sampling, called transition<br />
splitting [23]. This technique is extremely<br />
efficient on models as the reference<br />
example, because it uses all available<br />
exact knowledge and leaves very little to<br />
be simulated. In the M/M/1 case the<br />
quantity of interest is known, and therefore<br />
the results of Section 5.4 should be<br />
very flattering and in favour of transition<br />
splitting. The basic idea is simply to<br />
assume that in every simulation replica<br />
(using regenerative simulation, see e.g.<br />
[24]) he starts with a sequence of upward<br />
transitions from empty until a full system<br />
is reached, and then simulate the<br />
sequence of events (transitions) from this<br />
point and back to an empty system again.<br />
The sequence of upward transitions until<br />
full system without returning to an empty<br />
system first, is the only stratum in this<br />
simple example. The probability of this<br />
stratum can in this simple case be calculated<br />
exact, e.g. by first step analysis<br />
[25]. Details on transition splitting can<br />
found in [23].<br />
Generally, it is very hard to find a way to<br />
do this transition splitting and to define<br />
the strata involved. Additionally, it is not<br />
a trivial matter to calculate the probability<br />
of each stratum. In fact, in our reference<br />
example it is more demanding to<br />
calculate the stratum probability than to<br />
find the exact state probabilities.<br />
5.3.2 Importance sampling<br />
Importance sampling (IS) is a technique<br />
for variance reduction in computer simulation<br />
where the sampling of the outcome<br />
is made in proportion with its relative<br />
importance on the result. For a general<br />
introduction see for instance [16] Chapter<br />
11.<br />
Importance sampling was originally proposed<br />
as a method for Monte Carlo integration5<br />
, see e.g. [26]. Later, the method<br />
has been applied to several areas. The<br />
relevant applications in this paper is<br />
assessments within the dependability<br />
[27,28] and teletraffic area [29,30,31]. In<br />
[32] IS is proposed as a method to provoke<br />
cell losses by a synthetic traffic<br />
generator like the STG developed at the<br />
PARASOL project. The method should<br />
be applicable to both real measurement<br />
on ATM equipment as well as simulation,<br />
see Section 6 and [33].<br />
The basic idea of importance sampling is<br />
to change the underlying stochastic process<br />
in a way that makes the rare events<br />
of interest (i.e. that is important to the<br />
performance quantity of interest) to<br />
appear more frequently. The observations<br />
made under these conditions must of<br />
course be corrected in a way that makes<br />
them look like being generated by the<br />
original process and produce unbiased<br />
estimates.<br />
By considering the reference example<br />
once more, we soon get an idea of how<br />
5 Make large number of “shots” at a<br />
value domain and count the number of<br />
“hits” in the region bounded by the<br />
function to be integrated. The ratio of<br />
hits to shots is the integration of the<br />
bounded region. This is a helpful method<br />
in cases where an analytic or<br />
numerical solution is difficult or<br />
impossible, e.g. due to discontinuity.