[U] User's Guide

More documents

Recommendations

Info

360 [ U ] 26 Overview of Stata estimation commands26.4 Generalized linear modelsThe generalized linear model isg{E(y j )} = x j β,y j ∼ Fwhere g() is called the link function and F is a member of the exponential family, both of whichyou specify before estimation. glm fits this model.The GLM framework encompasses a surprising array of models known by other names, includinglinear regression, Poisson regression, exponential regression, and others. Stata provides dedicatedestimation commands for many of these. Stata has, for instance, regress for linear regression,poisson for Poisson regression, and streg for exponential regression, and that is not all of theoverlap.glm by default uses maximum likelihood estimation and alternatively estimates via iteratedreweighted least squares (IRLS) when the irls option is specified. For each family, F , there isa corresponding link function, g(), called the canonical link, for which IRLS estimation producesresults identical to maximum likelihood estimation. You can, however, match families and link functionsas you wish, and, when you match a family to a link function other than the canonical link,you obtain a different but valid estimator of the standard errors of the regression coefficients. Theestimator you obtain is asymptotically equivalent to the maximum likelihood estimator, which, insmall samples, produces slightly different results.For example, the canonical link for the binomial family is logit. glm, irls with that combinationproduces results identical to the maximum-likelihood logit (and logistic) command. The binomialfamily with the probit link produces the probit model, but probit is not the canonical link here. Hence,glm, irls produces standard error estimates that differ slightly from those produced by Stata’smaximum-likelihood probit command.Many researchers feel that the maximum-likelihood standard errors are preferable to IRLS estimates(when they are not identical), but they would have a difficult time justifying that feeling. Maximumlikelihood probit is an estimator with (solely) asymptotic properties; glm, irls with the binomialfamily and probit link is an estimator with (solely) asymptotic properties, and in finite samples, thestandard errors differ a little.Still, we recommend that you use Stata’s dedicated estimators whenever possible. IRLS—thetheory—and glm, irls—the command—are all encompassing in their generality, meaning thatthey rarely use the right jargon or provide things in the way you wish they would. The narrowercommands, such as logit, probit, and poisson, focus on the issue at hand and are invariablymore convenient.glm is useful when you want to match a family to a link function that is not provided elsewhere.glm also offers several estimators of the variance–covariance matrix that are consistent, even whenthe errors are heteroskedastic or autocorrelated. Another advantage of a glm version of a modelover a model-specific version is that many of these VCE estimators are available only for the glmimplementation. You can also obtain the ML–based estimates of the VCE from glm.26.5 Binary-outcome qualitative dependent-variable modelsThere are many ways to write these models, one of which isPr(y j ≠ 0) = F (x j β)
[ U ] 26.5 Binary-outcome qualitative dependent-variable models 361where F is some cumulative distribution. Two popular choices for F () are the normal and logistic, andthe models are called the probit and logit (or logistic regression) models. A third is the complementarylog–log function; maximum likelihood estimates are obtained by Stata’s cloglog command.The two parent commands for the maximum likelihood estimator of probit and logit are probitand logit, although logit has a sibling, logistic, that provides the same estimates but displaysresults in a slightly different way. There is also an exact logistic estimator; see [U] 26.9 Exactestimators.Do not read anything into the names logit and logistic. Logit and logistic have two interchangeddefinitions in two scientific camps. In the medical sciences, logit means the minimum χ 2 estimator,and logistic means maximum likelihood. In the social sciences, it is the other way around. From ourexperience, it appears that neither reads the other’s literature, because both talk (and write books)asserting that logit means one thing and logistic the other. Our solution is to provide both logitand logistic, which do the same thing, so that each camp can latch on to the maximum likelihoodcommand under the name each expects.There are two slight differences between logit and logistic. logit reports estimates in thecoefficient metric, whereas logistic reports exponentiated coefficients—odds ratios. This is inaccordance with the expectations of each camp and makes no substantial difference. The otherdifference is that logistic has a family of postlogistic commands that you can run to explore thenature of the fit. Actually, that is not exactly true because all the commands for use after logisticcan also be used after logit.If you have not already selected logit or logistic as your favorite, we recommend that youtry logistic. Logistic regression (logit) models are more easily interpreted in the odds-ratio metric.In addition to logit and logistic, Stata provides glogit, blogit, and binreg commands.blogit is the maximum likelihood estimator (the same as logit or logistic) but applied ondata organized in a different way. Rather than having individual observations, your data are organizedso that each observation records the number of observed successes and failures.glogit is the weighted-regression, grouped-data estimator.binreg can be used to model either individual-level or grouped data in an application of thegeneralized linear model. The family is assumed to be binomial, and each link provides a distinctparameter interpretation. Also, binreg offers several options for setting the link function accordingto the desired biostatistical interpretation. The available links and interpretation options areOption Implied link Parameteror logit Odds ratios = exp(β)rr log Risk ratios = exp(β)hr log complement Health ratios = exp(β)rd identity Risk differences = βRelated to logit, the skewed logit estimator scobit adds a power to the logit link function and isestimated by Stata’s scobit command.Turning to probit, you have two choices: probit and ivprobit. probit fits a maximum-likelihoodprobit model. ivprobit fits a probit model where one or more of the regressors are endogenouslydetermined.
Page 1 and 2:
A Stata Press PublicationStataCorp
Page 3 and 4:
iTable of contentsStata basics1 Rea
Page 5 and 6:
iii8.2 The return message for obtai
Page 7 and 8:
v14 Matrix expressions . . . . . .
Page 9 and 10:
vii18.11.6 Writing online help . .
Page 11 and 12:
ix25.2.15 Treatment of empty cells
Page 13:
Stata basics1 Read this—it will h
Page 16 and 17:
4 [ U ] 1 Read this—it will helpA
Page 18 and 19:
6 [ U ] 1 Read this—it will helpW
Page 20 and 21:
8 [ U ] 1 Read this—it will helpW
Page 22 and 23:
10 [ U ] 1 Read this—it will help
Page 24 and 25:
Page 26 and 27:
Page 28 and 29:
Page 30 and 31:
Page 32 and 33:
Page 34 and 35:
Page 36 and 37:
Page 38 and 39:
Page 40 and 41:
Page 42 and 43:
Page 44 and 45:
Page 46 and 47:
Page 48 and 49:
36 [ U ] 2 A brief description of S
Page 50 and 51:
38 [ U ] 2 A brief description of S
Page 52 and 53:
40 [ U ] 3 Resources for learning a
Page 54 and 55:
Page 56 and 57:
Page 58:
Page 61:
[ U ] 4.3 help: Stata’s help syst
Page 64 and 65:
52 [ U ] 4 Stata’s help and searc
Page 66 and 67:
Page 68 and 69:
Page 70 and 71:
58 [ U ] 5 Flavors of Stata5.2 Stat
Page 72 and 73:
60 [ U ] 5 Flavors of StataThe diff
Page 74 and 75:
62 [ U ] 6 Setting the size of memo
Page 76 and 77:
Page 78 and 79:
Page 80 and 81:
Page 83 and 84:
8 Error messages and return codesCo
Page 85:
[ U ] 8.2 The return message for ob
Page 88 and 89:
76 [ U ] 9 The Break key9.2 Side ef
Page 90 and 91:
78 [ U ] 10 Keyboard usesave is the
Page 92 and 93:
80 [ U ] 10 Keyboard use10.4 Editin
Page 94 and 95:
82 [ U ] 10 Keyboard useAnother met
Page 97 and 98:
11 Language syntaxContents11.1 Over
Page 99 and 100:
[ U ] 11.1 Overview 8711.1.2 by var
Page 101 and 102:
[ U ] 11.1 Overview 89. summarize m
Page 103 and 104:
[ U ] 11.1 Overview 91Example 7Nega
Page 105 and 106:
11.1.7 options[ U ] 11.1 Overview 9
Page 107 and 108:
[ U ] 11.1 Overview 9511.1.8 numlis
Page 109 and 110:
[ U ] 11.2 Abbreviation rules 97by
Page 111 and 112:
[ U ] 11.3 Naming conventions 99The
Page 113 and 114:
[ U ] 11.4 varlists 101You may use
Page 115 and 116:
[ U ] 11.4 varlists 103Consider a v
Page 117 and 118:
[ U ] 11.4 varlists 105Here are som
Page 119 and 120:
[ U ] 11.4 varlists 107You can sele
Page 121 and 122:
[ U ] 11.4 varlists 109. use http:/
Page 123 and 124:
[ U ] 11.5 by varlist: construct 11
Page 125 and 126:
[ U ] 11.5 by varlist: construct 11
Page 127 and 128:
[ U ] 11.6 File-naming conventions
Page 129 and 130:
12 DataContents12.1 Data and datase
Page 131 and 132:
[ U ] 12.2 Numbers 119. display 2/0
Page 133 and 134:
[ U ] 12.2 Numbers 121Technical not
Page 135 and 136:
[ U ] 12.4 Strings 12312.4 StringsA
Page 137 and 138:
[ U ] 12.5 Formats: Controlling how
Page 139 and 140:
Page 141 and 142:
Page 143 and 144:
[ U ] 12.6 Dataset, variable, and v
Page 145 and 146:
Page 147 and 148:
Page 149 and 150:
Page 151 and 152:
[ U ] 12.8 Characteristics 139You c
Page 153:
[ U ] 12.10 References 141The Varia
Page 156 and 157:
144 [ U ] 13 Functions and expressi
Page 158 and 159:
Page 160 and 161:
Page 162 and 163:
Page 164 and 165:
Page 166 and 167:
Page 168 and 169:
Page 170 and 171:
Page 172 and 173:
Page 174 and 175:
162 [ U ] 14 Matrix expressionsHere
Page 176 and 177:
164 [ U ] 14 Matrix expressionsWe c
Page 178 and 179:
166 [ U ] 14 Matrix expressions. ma
Page 180 and 181:
168 [ U ] 14 Matrix expressionsRega
Page 182 and 183:
170 [ U ] 14 Matrix expressionsEith
Page 184 and 185:
172 [ U ] 14 Matrix expressionsThe
Page 186 and 187:
174 [ U ] 14 Matrix expressions7. W
Page 188 and 189:
176 [ U ] 15 Saving and printing ou
Page 190 and 191:
Page 192 and 193:
Page 194 and 195:
182 [ U ] 16 Do-filesYou then enter
Page 196 and 197:
184 [ U ] 16 Do-files* a sample ana
Page 198 and 199:
186 [ U ] 16 Do-files16.1.4 Error h
Page 200 and 201:
188 [ U ] 16 Do-filesMany users inc
Page 202 and 203:
190 [ U ] 16 Do-files5. If Stata is
Page 204 and 205:
192 [ U ] 16 Do-filesyou could writ
Page 207 and 208:
17 Ado-filesContents17.1 Descriptio
Page 209 and 210:
[ U ] 17.5 Where does Stata look fo
Page 211 and 212:
17.6 How do I install an addition?A
Page 213 and 214:
18 Programming StataContents18.1 De
Page 215 and 216:
[ U ] 18.2 Relationship between a p
Page 217 and 218:
[ U ] 18.3 Macros 205That problem i
Page 219 and 220:
[ U ] 18.3 Macros 207For the right
Page 221 and 222:
[ U ] 18.3 Macros 209To a programme
Page 223 and 224:
[ U ] 18.3 Macros 211Because Stata
Page 225 and 226:
[ U ] 18.3 Macros 213Above we have
Page 227 and 228:
[ U ] 18.3 Macros 21518.3.10 Advanc
Page 229 and 230:
[ U ] 18.4 Program arguments 217You
Page 231 and 232:
[ U ] 18.4 Program arguments 219Let
Page 233 and 234:
[ U ] 18.4 Program arguments 221Equ
Page 235 and 236:
[ U ] 18.4 Program arguments 223pro
Page 237 and 238:
[ U ] 18.4 Program arguments 22518.
Page 239 and 240:
[ U ] 18.7 Temporary objects 22718.
Page 241 and 242:
[ U ] 18.8 Accessing results calcul
Page 243 and 244:
[ U ] 18.9 Accessing results calcul
Page 245 and 246:
[ U ] 18.10 Saving results 2333. Co
Page 247 and 248:
[ U ] 18.10 Saving results 2355. Yo
Page 249 and 250:
[ U ] 18.11 Ado-files 237e(sample)
Page 251 and 252:
[ U ] 18.11 Ado-files 239If you wri
Page 253 and 254:
[ U ] 18.11 Ado-files 241duck is co
Page 255 and 256:
[ U ] 18.11 Ado-files 243begin fvar
Page 257 and 258:
[ U ] 18.11 Ado-files 245begin fvar
Page 259 and 260:
Technical note[ U ] 18.11 Ado-files
Page 261 and 262:
[ U ] 18.11 Ado-files 249{phang}{op
Page 263 and 264:
[ U ] 18.11 Ado-files 2515. Include
Page 265:
[ U ] 18.13 References 253Technical
Page 268 and 269:
256 [ U ] 19 Immediate commands19.1
Page 270 and 271:
258 [ U ] 19 Immediate commands19.1
Page 272 and 273:
260 [ U ] 20 Estimation and postest
Page 274 and 275:
Page 276 and 277:
Page 278 and 279:
Page 280 and 281:
Page 282 and 283:
Page 284 and 285:
Page 286 and 287:
Page 288 and 289:
Page 290 and 291:
Page 292 and 293:
Page 294 and 295:
Page 296 and 297:
Page 298 and 299:
Page 300 and 301:
Page 302 and 303:
Page 304 and 305:
Page 306 and 307:
Page 308 and 309:
Page 310 and 311:
Page 312 and 313:
Page 314 and 315:
Page 316 and 317:
Page 318 and 319:
Page 320 and 321:
Page 322 and 323: 310 [ U ] 20 Estimation and postest
Page 324 and 325: 312 [ U ] 20 Estimation and postest
Page 326 and 327: 314
Page 328 and 329: 316 [ U ] 21 Inputting data21.2 Det
Page 330 and 331: 318 [ U ] 21 Inputting dataM,Joe Sm
Page 332 and 333: 320 [ U ] 21 Inputting dataTechnica
Page 334 and 335: 322 [ U ] 21 Inputting dataAnother
Page 337 and 338: 22 Combining datasetsYou have two d
Page 339 and 340: 23 Working with stringsContents23.1
Page 341 and 342: [ U ] 23.4 Complex strings 329The p
Page 343 and 344: 24 Working with dates and timesCont
Page 345 and 346: [ U ] 24.2 Inputting dates and time
Page 347 and 348: [ U ] 24.2 Inputting dates and time
Page 349 and 350: [ U ] 24.6 Converting between date
Page 351 and 352: 25 Working with categorical data an
Page 353 and 354: [ U ] 25.1 Continuous, categorical,
Page 355 and 356: [ U ] 25.2 Estimation with factor v
Page 369 and 370: 26 Overview of Stata estimation com
Page 371: [ U ] 26.3 ANOVA, ANCOVA, MANOVA, a
Page 375 and 376: [ U ] 26.9 Exact estimators 363In t
Page 377 and 378: [ U ] 26.14 Models with time-series
Page 379 and 380: [ U ] 26.15 Panel-data models 367xt
Page 381 and 382: 26.15.6 Random-coefficient models w
Page 383 and 384: [ U ] 26.21 Multivariate and cluste
Page 385: [ U ] 26.25 Reference 373should try
Page 388 and 389: 376 [ U ] 27 Commands everyone shou
Page 390 and 391: 378 [ U ] 28 Using the Internet to
Page 402 and 403: 390 Subject and author indexcoeffic
Page 404 and 405: 392 Subject and author indexIif exp
Page 406 and 407: 394 Subject and author indexPiantad
Page 408: 396 Subject and author indexUnix,ke
show all

[U] User's Guide

Create successful ePaper yourself

Delete template?

Save as template?