01.03.2013 Views

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

12 1 Introduction<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

P(x)<br />

F(x)<br />

1 2 3 4 5<br />

Figure 1.5. Probability <strong>and</strong> distribution functions for Example 1.2, assuming that<br />

the frequencies are correct estimates of the probabilities.<br />

Several discrete distributions are described in Appendix B. An important one,<br />

since it occurs frequently in statistical studies, is the binomial distribution. It<br />

describes the probability of occurrence of a “success” event k times, in n<br />

independent trials, performed in the same conditions. The complementary “failure”<br />

event occurs, therefore, n – k times. The probability of the “success” in a single<br />

trial is denoted p. The complementary probability of the failure is 1 – p, also<br />

denoted q. Details on this distribution can be found in Appendix B. The respective<br />

probability function is:<br />

⎛ n⎞<br />

k n−k<br />

⎛ n⎞<br />

k n−k<br />

P(<br />

X = k)<br />

= ⎜ p − p = p q<br />

k ⎟ ( 1 ) ⎜<br />

k ⎟ . 1.1<br />

⎝ ⎠<br />

⎝ ⎠<br />

1.4.2 Continuous Variables<br />

We now consider a dataset involving a continuous r<strong>and</strong>om variable. Since the<br />

variable can assume an infinite number of possible values, the probability<br />

associated to each particular value is zero. Only probabilities associated to intervals<br />

of the variable domain can be non-zero. For instance, the probability that a gunshot<br />

hits a particular point in a target is zero (the variable domain is here two-<br />

dimensional). However, the probability that it hits the “bull’s-eye” area is non-zero.<br />

For a continuous variable, X (with value denoted by the same lower case letter,<br />

x), one can assign infinitesimal probabilities ∆p(x) to infinitesimal intervals ∆x:<br />

∆ p( x)<br />

= f ( x)<br />

∆x<br />

, 1.2<br />

where f(x) is the probability density function, computed at point x.<br />

For a finite interval [a, b] we determine the corresponding probability by adding<br />

up the infinitesimal contributions, i.e., using:<br />

∫<br />

b<br />

a<br />

P ( a < X ≤ b)<br />

= f ( x)<br />

dx . 1.3<br />

x

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!