20.03.2013 Views

From Algorithms to Z-Scores - matloff - University of California, Davis

From Algorithms to Z-Scores - matloff - University of California, Davis

From Algorithms to Z-Scores - matloff - University of California, Davis

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

17.2. THE CLASSIFICATION PROBLEM 337<br />

not contain “NIH,” and make five claims would have the value<br />

1<br />

1 + e −(β0+β1+5β3)<br />

17.2.3.1 The Logistic Model: Motivations<br />

The logistic function itself,<br />

1<br />

1 + e −u<br />

(17.10)<br />

(17.11)<br />

has values between 0 and 1, and is thus a candidate for modeling a probability. Also, it is mono<strong>to</strong>nic<br />

in u, making it further attractive, as in many classification problems we believe that mY ;X(t) should<br />

be mono<strong>to</strong>nic in the predic<strong>to</strong>r variables.<br />

But there are additional reasons <strong>to</strong> use the logit model, as it includes many common parametric<br />

models for X. To see this, note that we can write, for vec<strong>to</strong>r-valued discrete X and t,<br />

P (Y = 1|X = t) =<br />

= P (Y = 1)P (X = t|Y = 1)<br />

=<br />

=<br />

P (Y = 1 and X = t)<br />

P (X = t)<br />

P (X = t)<br />

P (Y = 1)P (X = t|Y = 1)<br />

P (Y = 1)P (X = t|Y = 1) + P (Y = 0)P (X = t|Y = 0)<br />

1<br />

1 +<br />

(1−q)P (X=t|Y =0)<br />

qP (X=t|Y =1)<br />

(17.12)<br />

(17.13)<br />

(17.14)<br />

(17.15)<br />

where q = P (Y = 1) is the proportion <strong>of</strong> members <strong>of</strong> the population which have Y = 1. (Keep in<br />

mind that this probability is unconditional!!!! In the patent example, for instance, if say q = 0.12,<br />

then 12% <strong>of</strong> all patents in the patent population—without regard <strong>to</strong> words used, numbers <strong>of</strong> claims,<br />

etc.—are publicly funded.)<br />

If X is a continuous random vec<strong>to</strong>r, then the analog <strong>of</strong> (17.15) is<br />

P (Y = 1|X = t) =<br />

1<br />

1 + (1−q)f X|Y =0(t)<br />

qf X|Y =1(t)<br />

(17.16)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!