14.03.2014 Views

Basic Analysis and Graphing - SAS

Basic Analysis and Graphing - SAS

Basic Analysis and Graphing - SAS

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 7 Performing Simple Logistic Regression 225<br />

Logistic Platform Options<br />

Lift Curve<br />

Save Probability Formula<br />

Script<br />

Produces a lift curve for the model. A lift curve shows the same<br />

information as a ROC curve, but in a way to dramatize the richness<br />

of the ordering at the beginning. The Y-axis shows the ratio of how<br />

rich that portion of the population is in the chosen response level<br />

compared to the rate of that response level as a whole. See the<br />

Modeling <strong>and</strong> Multivariate Methods book for details about lift curves.<br />

Creates new data table columns that contain formulas. See “Save<br />

Probability Formula” on page 226.<br />

This menu contains options that are available to all platforms. They<br />

enable you to redo the analysis or save the JSL comm<strong>and</strong>s for the<br />

analysis to a window or a file. For more information, see Using JMP.<br />

ROC Curves<br />

Suppose you have an x value that is a diagnostic measurement <strong>and</strong> you want to determine a threshold value<br />

of x that indicates the following:<br />

• A condition exists if the x value is greater than the threshold.<br />

• A condition does not exist if the x value is less than the threshold.<br />

For example, you could measure a blood component level as a diagnostic test to predict a type of cancer.<br />

Now consider the diagnostic test as you vary the threshold <strong>and</strong>, thus, cause more or fewer false positives <strong>and</strong><br />

false negatives. You then plot those rates. The ideal is to have a very narrow range of x criterion values that<br />

best divides true negatives <strong>and</strong> true positives. The Receiver Operating Characteristic (ROC) curve shows<br />

how rapidly this transition happens, with the goal being to have diagnostics that maximize the area under<br />

the curve.<br />

Two st<strong>and</strong>ard definitions used in medicine are as follows:<br />

• Sensitivity, the probability that a given x value (a test or measure) correctly predicts an existing condition.<br />

For a given x, the probability of incorrectly predicting the existence of a condition is 1 – sensitivity.<br />

• Specificity, the probability that a test correctly predicts that a condition does not exist.<br />

A ROC curve is a plot of sensitivity by (1 – specificity) for each value of x. The area under the ROC curve is<br />

a common index used to summarize the information contained in the curve.<br />

When you do a simple logistic regression with a binary outcome, there is a platform option to request a<br />

ROC curve for that analysis. After selecting the ROC Curve option, a window asks you to specify which<br />

level to use as positive.<br />

If a test predicted perfectly, it would have a value above which the entire abnormal population would fall<br />

<strong>and</strong> below which all normal values would fall. It would be perfectly sensitive <strong>and</strong> then pass through the<br />

point (0,1) on the grid. The closer the ROC curve comes to this ideal point, the better its discriminating<br />

ability. A test with no predictive ability produces a curve that follows the diagonal of the grid (DeLong, et al.<br />

1988).

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!