01.03.2013 Views

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Exercises 221<br />

5.23 Run the non-parametric counterparts of the tests used in Exercises 4.9, 4.10 <strong>and</strong> 4.20.<br />

Compare the results <strong>and</strong> the power of the tests with those obtained using parametric<br />

tests.<br />

5.24 <strong>Using</strong> appropriate non-parametric tests, determine which variables of the Wines’<br />

dataset are most discriminative of the white from the red wines.<br />

5.25 The Neonatal dataset contains mortality data for delivery taking place at home (MH)<br />

<strong>and</strong> at a Health Centre (MI). Assess whether there are significant differences at 5%<br />

level between delivery conditions, using the sign <strong>and</strong> the Wilcoxon tests.<br />

5.26 Consider the Firms’ dataset containing productivity figures (P) for a sample of<br />

Portuguese firms in four branches of activity (BRANCH). Study the dataset in order to:<br />

a) Assess with 5% level of significance whether there are significant differences<br />

among the productivity medians of the four branches.<br />

b) Assess with 1% level of significance whether Commerce <strong>and</strong> Industry have<br />

significantly different medians.<br />

5.27 Apply the appropriate non-parametric test in order to rank the discriminative capability<br />

of the features used to characterise the tissue types in the Breast Tissue dataset.<br />

5.28 Redo the previous Exercise 5.27 for the CTG dataset <strong>and</strong> the three-class discrimination<br />

expressed by the grouping variable NSP.<br />

5.29 Consider the discrimination of the three clay types based on the sample data of the<br />

Clays’ dataset. Show that the null hypothesis of equal medians for the three clay<br />

types is:<br />

a) Rejected with more than 95% confidence for all grading variables (LG, MG, HG).<br />

b) Not rejected for the iron oxide features.<br />

c) Rejected with higher confidence for the lime (CaO) than for the silica (SiO2).<br />

5.30 The FHR dataset contains measurements of basal heart rate performed by three human<br />

experts <strong>and</strong> an automatic diagnostic system. Assess whether the null hypothesis of<br />

equal median measurements can be accepted with 5% significance for the three human<br />

experts <strong>and</strong> the automatic diagnostic system.<br />

5.31 When analysing the contents of questions Q4, Q5, Q6 <strong>and</strong> Q7, someone said that “these<br />

questions are essentially evaluating the same thing”. Assess whether this statement can<br />

be accepted at a 5% significance level. Compute the coefficient of agreement κ <strong>and</strong><br />

discuss its significance.<br />

5.32 The Programming dataset contains results of an enquiry regarding freshman<br />

previous knowledge on programming (PROG), Boole’s Algebra (AB), binary<br />

arithmetic (BA) <strong>and</strong> computer hardware (H). Consider the variables PROG, AB, BA<br />

<strong>and</strong> H dichotomised in a “yes/no” fashion. Can one reject with 99% confidence the<br />

hypothesis that the four dichotomised variables essentially evaluate the same thing?

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!