17.10.2014 Views

Question 1 [22 Marks] In the 1974 Motor Trend US magazine, fuel ...

Question 1 [22 Marks] In the 1974 Motor Trend US magazine, fuel ...

Question 1 [22 Marks] In the 1974 Motor Trend US magazine, fuel ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>Question</strong> 1 [<strong>22</strong> <strong>Marks</strong>]<br />

<strong>In</strong> <strong>the</strong> <strong>1974</strong> <strong>Motor</strong> <strong>Trend</strong> <strong>US</strong> <strong>magazine</strong>, <strong>fuel</strong> consumption (mpg in miles/<strong>US</strong> gallon) was related<br />

to several aspects of automobile design and performance for 32 automobiles (1973-74<br />

models). We consider three of <strong>the</strong> variables, displacement (disp in cu.in), horse power (hp)<br />

and weight (wt, lb/1000). The data is plotted below toge<strong>the</strong>r with <strong>the</strong> log transformed<br />

data.<br />

100 300 2 3 4 5<br />

4.5 5.0 5.5 6.0 0.4 0.8 1.2 1.6<br />

mpg<br />

10 15 20 25 30<br />

logmpg<br />

2.4 2.8 3.2<br />

100 300<br />

disp<br />

4.5 5.0 5.5 6.0<br />

logdisp<br />

hp<br />

50 150 250<br />

loghp<br />

4.0 4.5 5.0 5.5<br />

2 3 4 5<br />

wt<br />

0.4 0.8 1.2 1.6<br />

logwt<br />

10 15 20 25 30<br />

50 150 250<br />

2.4 2.8 3.2<br />

4.0 4.5 5.0 5.5<br />

(i) From <strong>the</strong> graphs it was decided to use <strong>the</strong> log transformed variables in <strong>the</strong> analysis.<br />

Explain why you think this may or may not be a good idea.


(ii) The following model was fitted<br />

logmpg = µ + β 1 logdisp + β 2 loghp + β 3 logwt + error<br />

Complete <strong>the</strong> summary of <strong>the</strong> fit that follows by replacing <strong>the</strong> ∗’s.<br />

Coefficients:<br />

Estimate Std. Error t value Pr(>|t|)<br />

(<strong>In</strong>tercept) 4.9462 0.2687 18.41


<strong>Question</strong> 2 [30 <strong>Marks</strong>]<br />

(This question has been modified as <strong>the</strong> original question was not suitable for <strong>the</strong> unit as<br />

presented in 2002.)<br />

The original data set in question 1 also contains <strong>the</strong> number of cylinders (cyl: 4, 6 or 8) for<br />

each car tested. Since it is quite possible <strong>the</strong> relationship between mpg and displacement<br />

will also depend on <strong>the</strong> number of cylinders it was decided to examine this relationship<br />

separately.<br />

Miles/<strong>US</strong> gallon<br />

10 15 20 25 30<br />

four<br />

six<br />

eight<br />

100 200 300 400<br />

displacement<br />

The following models were fitted and <strong>the</strong> output from R is reported:<br />

Model 1: (3 intercepts and 3 slopes)<br />

Analysis of Variance Table<br />

Response: mpg<br />

Df Sum Sq Mean Sq F value Pr(>F)<br />

cyl 2 824.78 412.39 73.3<strong>22</strong>1 2.050e-11<br />

disp 1 57.64 57.64 10.2487 0.003591<br />

cyl:disp 2 97.39 48.69 8.6574 0.001313<br />

Residuals 26 146.23 5.62


Model 2: 3 intercepts, 1 common slope (parallel lines).<br />

Analysis of Variance Table<br />

Response: mpg<br />

Df Sum Sq Mean Sq F value Pr(>F)<br />

cyl 3 13741 4580 526.434 |t|)<br />

(<strong>In</strong>tercept) 40.87196 3.02012 13.533 2.79e-13<br />

cyl6 -21.78997 5.30660 -4.106 0.000354<br />

cyl8 -18.83916 4.61166 -4.085 0.000374<br />

disp -0.13514 0.02791 -4.842 5.10e-05<br />

cyl6:disp 0.13875 0.03635 3.817 0.000753<br />

cyl8:disp 0.11551 0.02955 3.909 0.000592<br />

Residual standard error: 2.372 on 26 degrees of freedom<br />

Multiple R-Squared: 0.8701, Adjusted R-squared: 0.8452<br />

F-statistic: 34.84 on 5 and 26 DF, p-value: 9.968e-11<br />

(a) Write down <strong>the</strong> equation of <strong>the</strong> three lines.<br />

(b) Test whe<strong>the</strong>r <strong>the</strong> slopes of <strong>the</strong> lines for <strong>the</strong> 6 and 8 cylinder cars are different<br />

from <strong>the</strong> slope of <strong>the</strong> line for <strong>the</strong> 4 cylinder cars.<br />

(c) Verify <strong>the</strong> 95% CI for <strong>the</strong> slope of <strong>the</strong> line for four cylinder cars is given by<br />

(−0.193, −0.078).<br />

(d) If <strong>the</strong> 95% CI for <strong>the</strong> slope of <strong>the</strong> line for six cylinder cars is (−0.044, 0.051)<br />

and that for eight cylinder cars is (−0.040, 0.0003) what can you say about <strong>the</strong><br />

relationship of <strong>the</strong> slopes of <strong>the</strong> three lines?


(iii) The mpg was estimated for cars with a displacement of 150 cu.in and having 4, 6<br />

and 8 cylinders using R.<br />

$fit<br />

1 2 3<br />

* 19.6<strong>22</strong>8 19.0877<br />

$se.fit<br />

1 2 3<br />

1.44190 1.18564 2.07059<br />

Comment on <strong>the</strong> results after first replacing <strong>the</strong> missing value (for 4 cylinder cars).<br />

(iv) With <strong>the</strong> aid of <strong>the</strong> following plots comment on <strong>the</strong> assumptions for <strong>the</strong> model and<br />

<strong>the</strong> fit of <strong>the</strong> line.<br />

Residual Plot<br />

Normal Q−Q Plot<br />

Residual<br />

−2 0 2 4<br />

Sample Quantiles<br />

−2 0 2 4<br />

15 20 25 30<br />

Fitted<br />

−2 −1 0 1 2<br />

Theoretical Quantiles


<strong>Question</strong> 3 25 <strong>Marks</strong>]<br />

An animal experiment is designed to investigate whe<strong>the</strong>r levorphanol reduces stress as<br />

reflected in <strong>the</strong> cortical sterone level. There were four treatment groups containing five<br />

animals each. The four treatments consisted of a control (C), levorphanol only (L),<br />

epinephrine only (E), and levorphanol plus epinephrine (LE).<br />

(i) Complete <strong>the</strong> ANOVA table below.<br />

Analysis of Variance Table<br />

Response: cs<br />

Df Sum Sq Mean Sq F value Pr(>F)<br />

treat * 37.58 * * 2e-04<br />

Residuals 16 16.30 *<br />

(ii) Write down and test a suitable hypo<strong>the</strong>sis concerning treatments.<br />

(iii) Write down three contrasts to test<br />

(a) The main effect of levorphanol (levor)<br />

(b) The main effect of epinephrine (epin)<br />

(c) <strong>the</strong> interaction between levorphanol and epinephrine (levor:epin).<br />

(iv) Show <strong>the</strong> contrasts are orthogonal to each o<strong>the</strong>r.


(v) Using <strong>the</strong> following ANOVA table and table of means, write down and test hypo<strong>the</strong>ses<br />

concerning <strong>the</strong> main effects of levorphanol, epinephrine and <strong>the</strong>ir interaction.<br />

Analysis of Variance Table<br />

Response: cs<br />

Df Sum Sq Mean Sq F value Pr(>F)<br />

levor 1 12.832 12.832 12.598 0.002670<br />

epin 1 18.586 18.586 18.246 0.000584<br />

levor:epin 1 6.161 6.161 6.048 0.025692<br />

Residuals 16 16.298 1.019<br />

Tables of means<br />

Grand mean<br />

2.964<br />

levor<br />

0 1<br />

3.765 2.163<br />

epin<br />

0 1<br />

2.000 3.928<br />

levor:epin<br />

epin<br />

levor 0 1<br />

0 2.246 5.284<br />

1 1.754 2.572<br />

Standard errors for differences of means<br />

levor epin levor:epin<br />

0.4514 0.4514 0.6383<br />

replic. 10 10 5<br />

(vi) Write a conclusion.


<strong>Question</strong> 4 [20 <strong>Marks</strong>]<br />

An experiment is conducted to determine <strong>the</strong> effect of three levels of fertilization on <strong>the</strong><br />

yield of sugar cane. The experiment is conducted at four locations, randomly selected<br />

from a number of possible locations available for <strong>the</strong> experiment. Each fertilizer level is<br />

applied to three plots at each location.<br />

(i) Complete <strong>the</strong> following table<br />

Df Mean Sq Expected Mean Square F P<br />

fertilizer * 12.15514 * * *<br />

location * 4.60598 * * *<br />

fertilizer:location * 0.15544 * * *<br />

Residuals 24 0.05793<br />

(Note <strong>the</strong> Sum of Squares column has been omitted and is not needed.)<br />

(ii) Write down and test appropriate hypo<strong>the</strong>ses concerning fertilizer treatments, locations<br />

and <strong>the</strong> interaction between fertilizer treatment and location.<br />

(iii) Estimate <strong>the</strong> components of variation.<br />

(iv) Calculate <strong>the</strong> variation of a single observation.<br />

(v) The fertilizer SS is partitioned using polynomial contrasts. The significance levels<br />

are given for <strong>the</strong> resulting F values in <strong>the</strong> ANOVA table.<br />

Analysis of Variance Table<br />

Response: yield<br />

Pr(>F)<br />

linear<br />

3.32e-05<br />

quadratic<br />

1.65e-11<br />

location<br />

1.31e-12<br />

location:fertilizer 0.0389<br />

<strong>In</strong>terpret <strong>the</strong>se results with <strong>the</strong> aid of <strong>the</strong> following table of mean yields for <strong>the</strong> three<br />

fertilizer levels.<br />

1 2 3<br />

16.7567 17.9208 15.9167<br />

(vi) Write a conclusion.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!