13.11.2012 Views

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

Introduction to Categorical Data Analysis

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

314 RANDOM EFFECTS: GENERALIZED LINEAR MIXED MODELS<br />

<strong>to</strong> have more-alike observations than students from different schools. This could be<br />

because students within a school tend <strong>to</strong> be similar on various socioeconomic indices.<br />

Hierarchical models contain terms for the different levels of units. For the example<br />

just mentioned, the model would contain terms for the student and for the school.<br />

Level 1 refers <strong>to</strong> measurements at the student level, and level 2 refers <strong>to</strong> measurements<br />

at the school level. GLMMs having a hierarchical structure of this sort are called<br />

multilevel models.<br />

Multilevel models usually have a large number of terms. To limit the number of<br />

parameters, the model treats terms for the units as random effects rather than fixed<br />

effects. The random effects can enter the model at each level of the hierarchy. For<br />

example, random effects for students and random effects for schools refer <strong>to</strong> different<br />

levels of the model. Level 1 random effects can account for variability among students<br />

in student-specific characteristics not measured by the explana<strong>to</strong>ry variables. These<br />

might include student ability and parents’ socioeconomic status. The level 2 random<br />

effects account for variability among schools due <strong>to</strong> school-specific characteristics<br />

not measured by the explana<strong>to</strong>ry variables. These might include the quality of the<br />

teaching staff, the teachers’ average salary, the degree of drug-related problems in the<br />

school, and characteristics of the district for which the school enrolls students.<br />

10.4.1 Example: Two-Level Model for Student Advancement<br />

An educational study analyzes fac<strong>to</strong>rs that affect student advancement in school from<br />

one grade <strong>to</strong> the next. For student t in school i, the response variable yit measures<br />

whether the student passes <strong>to</strong> the next grade (yit = 1) or fails. We will consider a<br />

model having two levels, one for students and one for schools. When there are many<br />

schools and we can regard them as approximately a random sample of schools that<br />

such a study could consider, we use random effects for the schools.<br />

Let {xit1,...,xitk} denote the values of k explana<strong>to</strong>ry variables that have values<br />

that vary at the student level. For example, for student t in school i, perhaps xit1<br />

measures the student’s performance on an achievement test, xit2 is gender, xit3 is race,<br />

and xit4 is whether he or she previously failed any grades. The level-one model is<br />

logit[P(yit = 1)] =αi + β1xit1 + β2xit2 +···+βkxitk<br />

The level-two model provides a linear predic<strong>to</strong>r for the level-two (i.e., school-level)<br />

term in the level-one (i.e., student-level) model. That level-two term is the intercept,<br />

αi. The level-two model has the form<br />

αi = ui + α + γ1wi1 + γ2wi2 +···+γtwiℓ<br />

Here, {wi1,...,wiℓ} are ℓ explana<strong>to</strong>ry variables that have values that vary only at the<br />

school level, so they do not have a t subscript. For example, perhaps wi1 is per-student<br />

expenditure of school i. The term ui is the random effect for school i.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!