20.03.2013 Views

From Algorithms to Z-Scores - matloff - University of California, Davis

From Algorithms to Z-Scores - matloff - University of California, Davis

From Algorithms to Z-Scores - matloff - University of California, Davis

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

328 CHAPTER 16. ADVANCED STATISTICAL ESTIMATION AND INFERENCE<br />

16.2.4 Example: Confidence Interval for a Measurement <strong>of</strong> Prediction Ability<br />

Suppose we have a random sample X1, ..., Xn from some population. In other words, the Xi are<br />

independent and each is distributed as in the population. Let X represent a generic random variable<br />

having that distribution. Here we are allowing the Xi and X <strong>to</strong> be random vec<strong>to</strong>rs, though they<br />

won’t play much explicit role anyway.<br />

Let A and B be events associated with X. If for example X is a random vec<strong>to</strong>r (U,V), we might<br />

have A and B being the events U > 12 and U-V < 5. The question <strong>of</strong> interest here will be <strong>to</strong> what<br />

extent we can predict A from B.<br />

One measure <strong>of</strong> that might be the quantity ν = P (A|B)−P (A). The larger ν is (in absolute value),<br />

the stronger the ability <strong>of</strong> B <strong>to</strong> predict A. (We could look at variations <strong>of</strong> this, such as the quotient<br />

<strong>of</strong> those two probabilities, but will not do so here.)<br />

Let’s use the delta method <strong>to</strong> derive an approximate 95% confidence interval for ν. To that end,<br />

think <strong>of</strong> four categories—A and B; A and not B; not A and B; and not A and not B. Each Xi falls<br />

in<strong>to</strong> one <strong>of</strong> those categories, so the four-component vec<strong>to</strong>r Y consisting <strong>of</strong> counts <strong>of</strong> the numbers <strong>of</strong><br />

Xi falling in<strong>to</strong> the four categories has a multinomial distribution with r = 4.<br />

To use the theorem, set R = Y/n, so that R is the vec<strong>to</strong>r <strong>of</strong> the sample proportions. For instance,<br />

R1 will be the number <strong>of</strong> Xi satisfying both events A and B, divided by n. The vec<strong>to</strong>r η will then<br />

be the corresponding population proportion, so that for instance<br />

We are interested in<br />

η2 = P (A and not B) (16.37)<br />

ν = P (A|B) − P (A) (16.38)<br />

=<br />

P (A and B)<br />

− [P (A and B) + P (A and not B)]<br />

P (A and B) + P (not A and B)<br />

(16.39)<br />

=<br />

η1<br />

η1 + η3<br />

− (η1 + η2) (16.40)<br />

By the way, since η4 is not involved, let’s shorten R <strong>to</strong> (R1, R2, R3) ′ .<br />

What about Σ? Since Y is multinomial, Equation (8.114) provides us Σ:<br />

Σ = 1<br />

⎛<br />

⎝<br />

n<br />

η1(1 − η1)<br />

−η2η1<br />

−η1η2<br />

η2(1 − η2)<br />

−η1η3<br />

−η2η3<br />

⎞<br />

⎠ (16.41)<br />

−η3η1 −η3η2 η3(1 − η3)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!