01.03.2013 Views

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

2 1 Introduction<br />

In the case of the body fall there is a law that allows the exact computation of<br />

one of the variables h or t (for given h0 <strong>and</strong> g) as a function of the other one.<br />

Moreover, if we repeat the body-fall experiment under identical conditions, we<br />

consistently obtain the same results, within the precision of the measurements.<br />

These are the attributes of deterministic data: the same data will be obtained,<br />

within the precision of the measurements, under repeated experiments in welldefined<br />

conditions.<br />

Imagine now that we were dealing with Stock Exchange data, such as, for<br />

instance, the daily share value throughout one year of a given company. For such<br />

data there is no known law to describe how the share value evolves along the year.<br />

Furthermore, the possibility of experiment repetition with identical results does not<br />

apply here. We are, thus, in presence of what is called r<strong>and</strong>om data.<br />

Classical examples of r<strong>and</strong>om data are:<br />

− Thermal noise generated in electrical resistances, antennae, etc.;<br />

− Brownian motion of tiny particles in a fluid;<br />

− Weather variables;<br />

− Financial variables such as Stock Exchange share values;<br />

− Gambling game outcomes (dice, cards, roulette, etc.);<br />

− Conscript height at military inspection.<br />

In none of these examples can a precise mathematical law describe the data.<br />

Also, there is no possibility of obtaining the same data in repeated experiments,<br />

performed under similar conditions. This is mainly due to the fact that several<br />

unforeseeable or immeasurable causes play a role in the generation of such data.<br />

For instance, in the case of the Brownian motion, we find that, after a certain time,<br />

the trajectories followed by several particles that have departed from exactly the<br />

same point, are completely different among them. Moreover it is found that such<br />

differences largely exceed the precision of the measurements.<br />

When dealing with a r<strong>and</strong>om dataset, especially if it relates to the temporal<br />

evolution of some variable, it is often convenient to consider such dataset as one<br />

realization (or one instance) of a set (or ensemble) consisting of a possibly infinite<br />

number of realizations of a generating process. This is the so-called r<strong>and</strong>om<br />

process (or stochastic process, from the Greek “stochastikos” = method or<br />

phenomenon composed of r<strong>and</strong>om parts). Thus:<br />

− The w<strong>and</strong>ering voltage signal one can measure in an open electrical<br />

resistance is an instance of a thermal noise process (with an ensemble of<br />

infinitely many continuous signals);<br />

− The succession of face values when tossing n times a die is an instance of a<br />

die tossing process (with an ensemble of finitely many discrete sequences).<br />

− The trajectory of a tiny particle in a fluid is an instance of a Brownian<br />

process (with an ensemble of infinitely many continuous trajectories);

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!