01.03.2013 Views

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

3 Estimating Data Parameters<br />

Making inferences about a population based upon a r<strong>and</strong>om sample is a major task<br />

in statistical analysis. Statistical inference comprehends two inter-related<br />

problems: parameter estimation <strong>and</strong> test of hypotheses. In this chapter, we describe<br />

the estimation of several distribution parameters, using sample estimates that were<br />

presented as descriptive statistics in the preceding chapter. Because these<br />

descriptive statistics are single values, determined by appropriate formulas, they<br />

are called point estimates. Appendix C contains an introductory survey on how<br />

such point estimators may be derived <strong>and</strong> which desirable properties they should<br />

have. In this chapter, we also introduce the notion <strong>and</strong> methodology of interval<br />

estimation. In this <strong>and</strong> later chapters, we always assume that we are dealing with<br />

r<strong>and</strong>om samples. By definition, in a r<strong>and</strong>om sample x1, …, xn from a population<br />

with probability density function fX(x), the r<strong>and</strong>om variables associated with the<br />

sample values, X1, …, Xn, are i.i.d., hence the r<strong>and</strong>om sample has a joint density<br />

given by:<br />

f x , x ,..., x ) = f ( x ) f ( x )... f ( x ) .<br />

X (<br />

1,<br />

X 2 ,..., X n 1 2 n X 1 X 2<br />

A similar result applies to the joint probability function when the variables are<br />

discrete. Therefore, we rule out sampling from a finite population without<br />

replacement since, then, the r<strong>and</strong>om variables X1, …, Xn are not independent.<br />

Note, also, that in the applications one must often carefully distinguish between<br />

target population <strong>and</strong> sampled population. For instance, sometimes in the<br />

newspaper one finds estimation results concerning the proportion of votes on<br />

political parties. These results are usually presented as estimates for the whole<br />

population of a given country. However, careful reading discloses that the sample<br />

(hopefully a r<strong>and</strong>om one) was drawn using a telephone enquiry from the<br />

population residing in certain provinces. Although the target population is the<br />

population of the whole country, any inference made is only legitimate for the<br />

sampled population, i.e., the population residing in those provinces <strong>and</strong> that use<br />

telephones.<br />

3.1 Point Estimation <strong>and</strong> Interval Estimation<br />

Imagine that someone wanted to weigh a certain object using spring scales. The<br />

object has an unknown weight, ω. The weight measurement, performed with the<br />

scales, has usually two sources of error: a calibration error, because of the spring’s<br />

X<br />

n

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!