04.06.2013 Views

Using Applied Multilevel Modeling for MCH EPI Data Analysis

Using Applied Multilevel Modeling for MCH EPI Data Analysis

Using Applied Multilevel Modeling for MCH EPI Data Analysis

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>MCH</strong> Epidemiology 2010 Preconference <strong>Data</strong> Skills Training Workshop<br />

San Antonio, TX<br />

Training Description and Curriculum<br />

December 13 -14, 2010<br />

8am-5pm<br />

Topic<br />

<strong>Using</strong> <strong>Applied</strong> <strong>Multilevel</strong> <strong>Modeling</strong> <strong>for</strong> <strong>MCH</strong> <strong>EPI</strong> <strong>Data</strong> <strong>Analysis</strong><br />

Background<br />

<strong>Multilevel</strong> modeling has become a mainstay of public health analysis where subjects are often<br />

observed clustered within larger units (e.g., clinics, providers, service areas) or where subjects<br />

are repeatedly measured across time. Traditional approaches are insufficient <strong>for</strong> accounting <strong>for</strong><br />

the clustering of the data, resulting in underestimation of measures of uncertainly, and there<strong>for</strong>e<br />

invalid statistical conclusions. Recently, the introduction of multilevel modeling has given<br />

epidemiologists a new tool <strong>for</strong> producing adjusted measures that can be used to lay out a set of<br />

risk factors according to their impact on a health outcome, and there<strong>for</strong>e in<strong>for</strong>m the prioritization<br />

of prevention and intervention strategies. In this workshop, attendees will learn about the use of<br />

multilevel models (aka mixed or hierarchical models) <strong>for</strong> analysis of clustered and longitudinal<br />

datasets, continuous and dichotomous outcomes, and other related types of multilevel datasets.<br />

Training Goal<br />

The purpose of this training is to provide an in depth conceptual and methodological overview of<br />

data analysis using multilevel modeling, along with hands-on experience in calculating and<br />

interpreting measures using public health datasets.<br />

Training Objectives<br />

Participants in this training will:<br />

1. Gain an understanding of the different perspectives of collecting and analyzing <strong>MCH</strong><br />

datasets using multilevel modeling;<br />

2. Become familiar with the multilevel logistic regression models <strong>for</strong> analysis of clustered and<br />

longitudinal datasets, continuous and dichotomous outcomes, and other <strong>for</strong>ms of multilevel<br />

datasets used in common epidemiological studies;<br />

3. Gain experience in using extensions of the multilevel logistic model <strong>for</strong> ordinal and nominal<br />

outcomes;<br />

4. Become familiar with methods <strong>for</strong> dealing with missing data in longitudinal studies.<br />

1


Target Audience and Prerequisites<br />

This workshop is aimed at public health professionals such as epidemiologists and other<br />

statisticians who work in public health agencies. Attendees should have a good working<br />

knowledge of regression analysis, both <strong>for</strong> continuous and dichotomous outcomes. Registration<br />

will be limited to 30 participants.<br />

Training Methods and Approaches<br />

The training presentations will be interactive and will include the application of multilevel<br />

models, with direct application illustrated using standard statistical software (i.e., SAS). The<br />

training participants will be asked to share state examples of identified best practices <strong>for</strong> statemultilevel<br />

<strong>MCH</strong><strong>EPI</strong> datasets. Participants will be provided with a laptop <strong>for</strong> the duration of the<br />

training.<br />

Course Curriculum<br />

Day one<br />

Lecture – <strong>Multilevel</strong> models <strong>for</strong> clustered continuous outcomes<br />

• Description and structure of multilevel data<br />

• <strong>Multilevel</strong> model description, meaning of random cluster effects and intraclass<br />

correlation (ICC)<br />

• 2- and 3-level examples, model comparisons, R squared measures<br />

Exercise I – running 2- and 3-level models with SAS PROC MIXED<br />

Lecture – <strong>Multilevel</strong> models <strong>for</strong> clustered dichotomous outcomes<br />

• Extension of logistic regression<br />

• Latent variable <strong>for</strong>mulation, meaning of regression coefficients, ICC<br />

• 2- and 3-level examples, comparison of models<br />

Exercise II – running 2- and 3-level models <strong>for</strong> clustered dichotomous data with SAS<br />

PROC GLIMMIX and SAS PROC NLMIXED<br />

Lecture – <strong>Multilevel</strong> models <strong>for</strong> clustered ordinal and nominal outcomes<br />

• Proportional odds models <strong>for</strong> ordinal outcomes<br />

• Non proportional odds models <strong>for</strong> ordinal outcomes<br />

• Nominal models with reference cell<br />

Exercise III - running 2- and 3-level models <strong>for</strong> clustered ordinal and nominal data with<br />

SAS PROC GLIMMIX and SAS PROC NLMIXED; testing the proportional odds<br />

assumption<br />

Wrap-up <strong>for</strong> Day One: Discussion of Conceptual and Technical Issues in Analyzing<br />

Clustered <strong>Data</strong><br />

2


Day two<br />

Lecture – <strong>Multilevel</strong> models <strong>for</strong> longitudinal continuous outcomes<br />

• Population and subject-specific time trends<br />

• <strong>Multilevel</strong> model description, meaning of random effects and variance parameters<br />

• Treatment of time-varying covariates<br />

Exercise IV – running longitudinal models with SAS PROC MIXED<br />

Lecture – <strong>Multilevel</strong> models <strong>for</strong> longitudinal dichotomous outcomes<br />

• Extension of logistic regression<br />

• Latent variable <strong>for</strong>mulation, subject-specific and marginal regression coefficients<br />

• Examples, comparison of models, fitting marginal proportions<br />

Exercise V – running models <strong>for</strong> longitudinal dichotomous data with SAS PROC GLIMMIX and<br />

SAS PROC NLMIXED<br />

Lecture – Missing data in longitudinal studies<br />

• Missing data mechanisms<br />

• Testing missing completely at random assumption<br />

• Sensitivity analysis to missing data assumptions (pattern-mixture and selection models)<br />

Exercise VI - running pattern-mixture models with SAS PROC MIXED<br />

Wrap-up <strong>for</strong> Day Two: Discussion of Conceptual and Technical Issues in Analyzing Longitudinal<br />

<strong>Data</strong><br />

Trainer In<strong>for</strong>mation<br />

Don Hedeker, PhD<br />

Professor<br />

Division of Epidemiology and Biostatistics<br />

School of Public Health<br />

University of Illinois at Chicago<br />

1603 W. Taylor St., room 955<br />

Chicago, IL 60612-4336<br />

Phone: (312) 996-4896<br />

Fax: (312) 996-0064<br />

Email: hedeker@uic.edu<br />

Donald Hedeker, PhD., is a Professor of Biostatistics in the School of Public Health. He received<br />

his Ph.D. in Quantitative Psychology from The University of Chicago, IL.<br />

Don's main expertise is in the development and use of advanced statistical methods <strong>for</strong> clustered<br />

and longitudinal data, with particular emphasis on mixed-effects models. He is the primary<br />

author of four freeware computer programs <strong>for</strong> mixed-effects analysis: MIXREG <strong>for</strong> normaltheory<br />

models, MIXOR <strong>for</strong> dichotomous and ordinal outcomes, MIXNO <strong>for</strong> nominal outcomes,<br />

and MIXPREG <strong>for</strong> counts. These programs and manuals are available at<br />

http://www.uic.edu/~hedeker/mix.html.<br />

Don has received numerous awards including named a Fellow of the American Statistical<br />

Association, the highest honor in his field in 2000. He was also recognized as a University<br />

Scholar by the University of Illinois that same year. Don is a past Associate Editor <strong>for</strong> Journal of<br />

3


Educational and Behavioral Statistics and is a Consulting Editor <strong>for</strong> Statistics <strong>for</strong> Bipolar<br />

Disorders. He has also been the PI, co-PI, or co-I on over a dozen NIH or CDC research grants.<br />

Samples of Background Reading Material<br />

Hedeker, D., Gibbons, R. D., & Flay, B. R. (1994). Random-effects regression models <strong>for</strong><br />

clustered data: with an example from smoking prevention research. Journal of Consulting and<br />

Clinical Psychology, 62, 757-765.<br />

Hedeker, D., McMahon, S. D., Jason, L. A., & Salina, D. (1994). <strong>Analysis</strong> of clustered data in<br />

community psychology: with an example from a worksite smoking cessation project. American<br />

Journal of Community Psychology, 22, 595-615.<br />

Hedeker, D. & Gibbons, R. D. (1997). Application of random-effects pattern-mixture models <strong>for</strong><br />

missing data in longitudinal studies. Psychological Methods, 2, 64-78.<br />

Hedeker, D. (2004). An introduction to growth modeling. In D. Kaplan (Ed.), the Sage<br />

Handbook of Quantitative Methodology <strong>for</strong> the Social Sciences (pp. 215-234). Sage<br />

Publications, Thousand Oaks, CA.<br />

Hedeker, D. (2005). Generalized linear mixed models. In B. Everett & D. Howell (Eds.),<br />

Encyclopedia of Statistics in Behavioral Science. Wiley, New York.<br />

4

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!