01.06.2013 Views

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

Statistical Methods in Medical Research 4ed

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

accord<strong>in</strong>g to their values of this function. A different approach, which has been<br />

developed over the last 20 years, is the construction of a b<strong>in</strong>ary decision tree. This<br />

is a computer-<strong>in</strong>tensive method. The method is more similar to diagnostic cl<strong>in</strong>ical<br />

decision-mak<strong>in</strong>g than is the construction of a discrim<strong>in</strong>ant function.<br />

The basis of the method is that at the first stage, termed a node, the <strong>in</strong>dividuals<br />

are divided <strong>in</strong>to two sets, the branches, accord<strong>in</strong>g to the value of one of the x<br />

variables. This variable is chosen from the full set so that the subdivision<br />

maximizes the separation of <strong>in</strong>dividuals <strong>in</strong> the groups between the two branches.<br />

Also, if the variable is not already dichotomous, then it is converted to b<strong>in</strong>ary<br />

form to achieve the same maximization. Each branch leads to a new node (Fig.<br />

13.3) and the same procedure is carried out, us<strong>in</strong>g the rema<strong>in</strong><strong>in</strong>g x variables, and<br />

not<strong>in</strong>g that the variable used for subdivision at each of the two nodes will most<br />

probably be different. Some branches end at a term<strong>in</strong>al node, when the <strong>in</strong>dividuals<br />

at that node are homogenous with respect to group, and the whole process<br />

cont<strong>in</strong>ues until all branches have term<strong>in</strong>ated.<br />

The choice of the f<strong>in</strong>al decision tree is a balance between its quality, def<strong>in</strong>ed<br />

<strong>in</strong> terms of homogeneity of the term<strong>in</strong>al nodes, and a penalty for complexity.<br />

This penalty is used to control mak<strong>in</strong>g the tree unnecessarily complex whilst<br />

achiev<strong>in</strong>g only m<strong>in</strong>or improvements.<br />

For a fuller description of the method of classification and regression trees,<br />

CART TM , see Zhang et al. (1998).<br />

Discrim<strong>in</strong>ation with more than two groups: MANOVA<br />

13.3 Discrim<strong>in</strong>ant analysis 477<br />

The l<strong>in</strong>ear discrim<strong>in</strong>ant function can be generalized to the situation where there<br />

are k…>2† groups <strong>in</strong> two different ways. The first approach leads to what are<br />

known as canonical variates. We saw from (13.5) that when k ˆ 2 the l<strong>in</strong>ear<br />

discrim<strong>in</strong>ant function maximizes the ratio of the difference <strong>in</strong> means between the<br />

groups to the standard deviation with<strong>in</strong> groups. A natural generalization of this<br />

criterion is to maximize the ratio of the sum of squares (SSq) between groups to<br />

the SSq with<strong>in</strong> groups. This requirement is found to lead to a standard technique<br />

of matrix algebraÐthe calculation of eigenvalues or latent roots of a matrix. The<br />

appropriate equation, <strong>in</strong> fact, has several solutions. One solution, correspond<strong>in</strong>g<br />

to the highest latent root, gives the coefficient <strong>in</strong> the l<strong>in</strong>ear function which<br />

maximizes the ratio of SSqs. This is called the first canonical variate, W1. If<br />

one wanted as good discrim<strong>in</strong>ation as possible from one l<strong>in</strong>ear function, this<br />

would be the one to choose. The second canonical variate, W2, is the function<br />

with the highest ratio of SSqs, subject to the condition that it is uncorrelated with<br />

W1 both between and with<strong>in</strong> groups. Similarly, W3 gives the highest ratio subject<br />

to be<strong>in</strong>g uncorrelated with W1 and W2. The number of canonical variates is the<br />

smaller of p or k 1. Thus, the l<strong>in</strong>ear discrim<strong>in</strong>ant function (13.4) is for k ˆ 2 the<br />

first and only canonical variate.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!