20.03.2013 Views

From Algorithms to Z-Scores - matloff - University of California, Davis

From Algorithms to Z-Scores - matloff - University of California, Davis

From Algorithms to Z-Scores - matloff - University of California, Davis

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

15.6. ESTIMATING THAT RELATIONSHIP FROM SAMPLE DATA 289<br />

i ↓, j → 0 1 2 3<br />

0 0.002 0.024 0.036 0.008<br />

1 0.162 0.073 0.048 0.004<br />

2 0.012 0.024 0.006 0.000<br />

Now keep in mind that since mY ;B(t) is the conditional mean <strong>of</strong> Y given B, we need <strong>to</strong> use conditional<br />

probabilities <strong>to</strong> compute it. For our example here <strong>of</strong> mY ;B(2), we need the probabilities<br />

P (Y = k|B = 2). For instance,<br />

P (Y = 1|B = 2) = pY,B(1, 2)<br />

pB(2)<br />

(15.2)<br />

=<br />

0.048<br />

0.036 + 0.048 + 0.006<br />

(15.3)<br />

= 0.533 (15.4)<br />

The other conditional P (Y = k|B = 2) are then found <strong>to</strong> be 0.400 for k = 0 and 0.067 for k = 2.<br />

We then have<br />

mY ;B(2) = 0.400 · 0 + 0.533 · 1 + 0.067 · 2 = 0.667 (15.5)<br />

15.6 Estimating That Relationship from Sample Data<br />

As noted, though, mW ;H(t) is a population function, dependent on population distributions. How<br />

can we estimate this function from sample data?<br />

Toward that end, let’s again suppose we have a random sample <strong>of</strong> 1000 people from <strong>Davis</strong>, with<br />

(H1, W1), ..., (H1000, W1000) (15.6)<br />

being their heights and weights. We again wish <strong>to</strong> use this data <strong>to</strong> estimate population values. But<br />

the difference here is that we are estimating a whole function now, the whole curve mW ;H(t). That<br />

means we are estimating infinitely many values, with one mW ;H(t) value for each t. 4 How do we<br />

do this?<br />

One approach would be as follows. Say we wish <strong>to</strong> find mW ;H(t) (note the hat, for “estimate <strong>of</strong>”!)<br />

at t = 70.2. In other words, we wish <strong>to</strong> estimate the mean weight—in the population—among all<br />

people <strong>of</strong> height 70.2. What we could do is look at all the people in our sample who are within, say,<br />

1.0 inch <strong>of</strong> 70.2, and calculate the average <strong>of</strong> all their weights. This would then be our mW ;H(t).<br />

4 Of course, the population <strong>of</strong> <strong>Davis</strong> is finite, but there is the conceptual population <strong>of</strong> all people who could live in<br />

<strong>Davis</strong>.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!