Applied Statistics Using SPSS, STATISTICA, MATLAB and R

More documents

Recommendations

Info

7.1.2 Estimating the Regression Function 7.1 Simple Linear Regression 273 A popular method of estimating the regression function parameters is to use a least square error (LSE) approach, by minimising the total sum of the squares of the errors (deviations) between the observed values yi and the estimated values b0 + b1xi: E = n ∑ i= 1 2 i n ∑ ε = ( y − b − b x ) . 7.2 i= 1 i 0 1 i 2 where b0 and b1 are estimates of β0 and β1, respectively. In order to apply the LSE method one starts by differentiating E in order to b0 and b1 and equalising to zero, obtaining the so-called normal equations: ∑ ∑ ⎪⎧ yi = nb0 + b1 xi ⎨ ⎪⎩ ∑xiyi= b0 ∑ xi + b1∑x 2 i , 7.3 where the summations, from now on, are always assumed to be for the n predictor values. By solving the normal equations, the following parameter estimates, b0 and b1, are derived: b 1 = ∑ ( xi ∑ b0 1 − x)( yi − y) . 7.4 2 ( x − x) i = y − b x . 7.5 The least square estimates of the linear regression parameters enjoy a number of desirable properties: i. The parameters b0 and b1 are unbiased estimates of the true parameters β0 and β1 ( E[ b 0 ] = β 0 , E[ b1 ] = β1 ), and have minimum variance among all unbiased linear estimates. ii. The predicted (or fitted) values yˆ i = b0 + b1x i are point estimates of the true, observed values, yi. The same is valid for the whole relationYˆ = b0 + b1 X , which is the point estimate of the mean response E[Y ]. iii. The regression line always goes through the point ( x , y ). iv. The computed errors ei = yi − yˆ i = yi − b0 − b1x i , called the residuals, are point estimates of the error values εi. The sum of the residuals is zero: ∑ e i = 0 . v. The residuals are uncorrelated with the predictor and the predicted values: ∑ e i xi = 0; ∑ ei yˆ i = 0 . vi. ∑ yi =∑ yˆ i ⇒ y = yˆ , i.e., the predicted values have the same mean as the observed values.
Page 2:
Applied Statistics Using SPSS, STAT
Page 6:
E d itors Prof. Dr. Joaquim P. Marq
Page 10:
Contents Preface to the Second Edit
Page 14:
Contents ix 5.2.3 The Chi-Square Te
Page 18:
Contents xi Appendix A - Short Surv
Page 22:
Contents xiii E.26 Soil Pollution .
Page 26:
Preface to the First Edition This b
Page 30:
Symbols and Abbreviations Sample Se
Page 34:
|A| determinant of matrix A tr(A) t
Page 38:
Σ covariance matrix x arithmetic m
Page 42:
1 Introduction 1.1 Deterministic Da
Page 46:
18 h 16 14 12 10 8 6 4 2 0 1.1 Dete
Page 50:
1.2 Population, Sample and Statisti
Page 54:
Table 1.3 1.2 Population, Sample an
Page 58:
Table 1.4 1.3 Random Variables 9 Da
Page 62:
1.4 Probabilities and Distributions
Page 66:
1.5 Beyond a Reasonable Doubt... 13
Page 70:
1.5 Beyond a Reasonable Doubt... 15
Page 74:
1.6 Statistical Significance and Ot
Page 78:
1.8 Software Tools 19 book we will
Page 82:
1.8 Software Tools 21 In the follow
Page 86:
1.8 Software Tools 23 illustrates t
Page 90:
1.8 Software Tools 25 On-line help
Page 94:
1.8 Software Tools 27 Figure 1.12.
Page 98:
2 Presenting and Summarising the Da
Page 102:
2.1 Preliminaries 31 The data can t
Page 106:
» meteo=[ 181 143 36 39 37 % Pasti
Page 110:
2.1 Preliminaries 35 are interested
Page 114:
2.1 Preliminaries 37 Besides the in
Page 118:
2.2 Presenting the Data 39 Sorting
Page 122:
2.2 Presenting the Data 41 In Table
Page 126:
2.2 Presenting the Data 43 With SPS
Page 130:
2.2 Presenting the Data 45 Figure 2
Page 134:
2.2 Presenting the Data 47 Figure 2
Page 138:
2.2 Presenting the Data 49 Let X de
Page 142:
2.2 Presenting the Data 51 Commands
Page 146:
2.2 Presenting the Data 53 A: The c
Page 150:
2.2 Presenting the Data 55 The s, c
Page 154:
2.2 Presenting the Data 57 histogra
Page 158:
2.3 Summarising the Data 59 type da
Page 162:
2.3 Summarising the Data 61 delimit
Page 166:
2.3 Summarising the Data 63 The sam
Page 170:
Note that: 2.3 Summarising the Data
Page 174:
where sXY, the sample covariance of
Page 178:
2.3 Summarising the Data 69 STATIST
Page 182:
2.3 Summarising the Data 71 A: The
Page 186:
2.3.6 Measures of Association for N
Page 190:
2.3 Summarising the Data 75 with th
Page 194:
Exercises 77 A: We use the N, S and
Page 198:
Exercises 79 2.13 Determine the box
Page 202:
3 Estimating Data Parameters Making
Page 206:
3.1 Point Estimation and Interval E
Page 210:
3.2 Estimating a Mean 85 In Chapter
Page 214:
3.2 Estimating a Mean 87 There are
Page 218:
3.2 Estimating a Mean 89 A: Using M
Page 222:
3.2 Estimating a Mean 91 Figure 3.5
Page 226:
3.3 Estimating a Proportion 93 esti
Page 230:
3.4 Estimating a Variance 95 is to
Page 234:
3.5 Estimating a Variance Ratio 97
Page 238:
3.6 Bootstrap Estimation 99 i. F df
Page 242:
3.6 Bootstrap Estimation 101 about
Page 246:
3.6 Bootstrap Estimation 103 The bi
Page 250:
3.6 Bootstrap Estimation 105 In the
Page 254:
Exercises 107 In order to obtain bo
Page 258:
Exercises 109 3.14 Consider the CTG
Page 262:
112 4 Parametric Tests of Hypothese
Page 266:
Page 270:
Page 274:
Page 278:
Page 282:
Page 286:
Page 290:
Page 294:
Page 298:
Page 302:
Page 306:
Page 310:
Page 314:
Page 318:
Page 322:
Page 326:
Page 330:
Page 334:
Page 338:
Page 342:
Page 346:
Page 350:
Page 354:
Page 358:
Page 362:
Page 366:
Page 370:
Page 374:
Page 378:
5 Non-Parametric Tests of Hypothese
Page 382:
5.1 Inference on One Population 173
Page 386:
Page 390:
s = npq = 224× 0. 75× 0. 25 = 6.4
Page 394:
5.1.3 The Chi-Square Goodness of Fi
Page 398:
Page 402:
Page 406:
1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 F
Page 410:
5.1.5 The Lilliefors Test for Norma
Page 414:
5.2 Contingency Tables 189 fewer mi
Page 418:
2 1 5.2 Contingency Tables 191 degr
Page 422:
5.2 Contingency Tables 193 An alter
Page 426:
5.2 Contingency Tables 195 male and
Page 430:
5.2 Contingency Tables 197 first ca
Page 434:
5.2 Contingency Tables 199 very low
Page 438:
5.3.1 Tests for Two Independent Sam
Page 442:
5.3 Inference on Two Populations 20
Page 446:
Page 450:
Page 454:
Page 458:
Example 5.19 5.3 Inference on Two P
Page 462:
3 ( N − N ) 5.4 Inference on More
Page 466:
5.4 Inference on More Than Two Popu
Page 470:
5.4 Inference on More Than Two Popu
Page 474:
Exercises 219 5.7 Several previous
Page 478:
Exercises 221 5.23 Run the non-para
Page 482:
6 Statistical Classification Statis
Page 486:
x 2 o o o o o o o o oo o o o o o o
Page 490:
6.2 Linear Discriminants 227 Figure
Page 494:
6.2 Linear Discriminants 229 Figure
Page 498:
6.2 Linear Discriminants 231 Let us
Page 502:
Table 6.5. Summary of minimum dista
Page 506:
6.3 Bayesian Classification 235 Not
Page 510:
6.3 Bayesian Classification 237 Fig
Page 514:
6.3 Bayesian Classification 239 Let
Page 518:
6.3 Bayesian Classification 241 Not
Page 522:
0.5 0.45 0.4 0.35 0.3 0.25 0.2 0.15
Page 526:
6.3 Bayesian Classification 245 For
Page 530: Reality A N Decision A N a b c d 6.
Page 534: 6.4 The ROC Curve 249 Figure 6.17.
Page 538: 6.4 The ROC Curve 251 In order to o
Page 542: 6.5 Feature Selection 253 A: The RO
Page 546: 6.5 Feature Selection 255 comfortab
Page 550: 6.6 Classifier Evaluation 257 Resub
Page 554: 259 A: Table 6.12 shows the leave-o
Page 558: 6.7 Tree Classifiers 261 Figure 6.2
Page 562: 6.7 Tree Classifiers 263 or not a g
Page 566: 6.7 Tree Classifiers 265 The classi
Page 570: 6.7 Tree Classifiers 267 The classi
Page 574: Exercises 269 6.2 Repeat the previo
Page 578: 7 Data Regression An important obje
Page 584: 274 7 Data Regression These propert
Page 588: 276 7 Data Regression The total var
Page 592: 278 7 Data Regression Next, we crea
Page 596: 280 7 Data Regression b 1 = ∑ ( x
Page 600: 282 7 Data Regression Example 7.5 Q
Page 604: 284 7 Data Regression The sampling
Page 608: 286 7 Data Regression From the defi
Page 612: 288 7 Data Regression * SSLF SSPE M
Page 616: 290 7 Data Regression where: - y is
Page 620: 292 7 Data Regression 1.0000 0.9692
Page 624: 294 7 Data Regression The first lin
Page 628: 296 7 Data Regression 7.2.5 ANOVA a
Page 632:
298 7 Data Regression must ask whic
Page 636:
300 7 Data Regression model. Simila
Page 640:
302 7 Data Regression The MATLAB po
Page 644:
304 7 Data Regression the same way
Page 648:
306 7 Data Regression 7.3.2 Evaluat
Page 652:
308 7 Data Regression The MATLAB re
Page 656:
310 7 Data Regression with s 2 =
Page 660:
312 7 Data Regression the threshold
Page 664:
314 7 Data Regression 7.11, the lar
Page 668:
316 7 Data Regression determination
Page 672:
318 7 Data Regression smaller discr
Page 676:
320 7 Data Regression Besides its u
Page 680:
322 7 Data Regression VIF and Mean
Page 684:
324 7 Data Regression Taking the na
Page 688:
326 7 Data Regression Example 7.22
Page 692:
328 7 Data Regression possible to p
Page 696:
330 8 Data Structure Analysis In Fi
Page 700:
332 8 Data Structure Analysis of th
Page 704:
334 8 Data Structure Analysis 2 −
Page 708:
336 8 Data Structure Analysis using
Page 712:
338 8 Data Structure Analysis ∑
Page 716:
340 8 Data Structure Analysis We se
Page 720:
342 8 Data Structure Analysis p = 0
Page 724:
344 8 Data Structure Analysis In or
Page 728:
346 8 Data Structure Analysis A: On
Page 732:
348 8 Data Structure Analysis ⎡0.
Page 736:
350 8 Data Structure Analysis 1 0 -
Page 740:
352 8 Data Structure Analysis 8.9 C
Page 744:
354 9 Survival Analysis P( t ≤ T
Page 748:
356 9 Survival Analysis Example 9.2
Page 752:
358 9 Survival Analysis the Fatigue
Page 756:
360 9 Survival Analysis “death”
Page 760:
362 9 Survival Analysis A: The Hear
Page 764:
364 9 Survival Analysis From Figure
Page 768:
366 9 Survival Analysis denominator
Page 772:
368 9 Survival Analysis The exponen
Page 776:
370 9 Survival Analysis γ This is
Page 780:
372 9 Survival Analysis the proport
Page 784:
374 9 Survival Analysis 9.5 Compute
Page 788:
376 10 Directional Data Example 10.
Page 792:
Page 796:
380 10 Directional Data The MATLAB
Page 800:
382 10 Directional Data A: We use t
Page 804:
384 10 Directional Data For p = 2,
Page 808:
386 10 Directional Data Thus, the r
Page 812:
388 10 Directional Data from a unif
Page 816:
390 10 Directional Data * 2 z = ( 1
Page 820:
392 10 Directional Data 10.4.3 The
Page 824:
Page 828:
396 10 Directional Data 10.5.2 Mean
Page 832:
398 10 Directional Data Similar res
Page 836:
400 10 Directional Data Exercises 1
Page 840:
Appendix A - Short Survey on Probab
Page 844:
A.1 Basic Notions 405 corresponding
Page 848:
A.2 Conditional Probability and Ind
Page 852:
A. 4 Bayes ’ Theorem 409 The firs
Page 856:
A.5 Random Variables and Distributi
Page 860:
a 0.5 0.4 0.3 0.2 0.1 0 f (x ) a a+
Page 864:
Example A. 12 A.6 Expectation, Vari
Page 868:
n [ X ] = ∑ i= 1 A.6 Expectation,
Page 872:
A.7 The Binomial and Normal Distrib
Page 876:
A.7 The Binomial and Normal Distrib
Page 880:
The following results are worth men
Page 884:
A.8.2 Moments A.8 Multivariate Dist
Page 888:
For the d-variate case, this genera
Page 892:
0.25 0.2 0.15 0.1 0.05 p(x) A.8 Mul
Page 896:
432 Appendix B - Distributions A: T
Page 900:
Page 904:
436 Appendix B - Distributions For
Page 908:
Page 912:
440 Appendix B - Distributions Dist
Page 916:
442 Appendix B - Distributions 0.45
Page 920:
444 Appendix B - Distributions B.2.
Page 924:
Page 928:
448 Appendix B - Distributions B.2.
Page 932:
450 Appendix B - Distributions Dist
Page 936:
452 Appendix B - Distributions 1 0.
Page 940:
Page 944:
456 Appendix C - Point Estimation T
Page 948:
458 Appendix C - Point Estimation
Page 952:
460 Appendix D - Tables p n k 0.05
Page 956:
Page 960:
Page 964:
466 Appendix D - Tables D.3 Student
Page 968:
468 Appendix D - Tables D.5 Critica
Page 972:
470 Appendix E - Datasets The varia
Page 976:
472 Appendix E - Datasets E.6 CTG T
Page 980:
474 Appendix E - Datasets E.9 FHR T
Page 984:
476 Appendix E - Datasets E.14 Fore
Page 988:
478 Appendix E - Datasets DATE_REOP
Page 992:
480 Appendix E - Datasets CG: Conic
Page 996:
482 Appendix E - Datasets E.26 Soil
Page 1000:
484 Appendix E - Datasets E.29 VCG
Page 1004:
Appendix F - Tools F.1 MATLAB Funct
Page 1008:
Appendix F - Tools 489 r
Page 1012:
References Chapters 1 and 2 Anderso
Page 1016:
References 493 Gardner MJ, Altman D
Page 1020:
References 495 Raudys S, Pikelis V
Page 1024:
References 497 Mardia KV, Jupp PE (
Page 1028:
500 Index 5.9 (two paired samples t
Page 1032:
502 Index H hazard function, 353 ha
Page 1036:
504 Index S sample, 5 mean, 416 siz
show all

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?