Modè<strong>le</strong> nonlinéaire <strong>à</strong> processus latent 66Biometrics 62, 1014–1024December 2006DOI: 10.1111/j.1541-0420.2006.00573.xA Nonlinear Model with Latent Process for Cognitive EvolutionUsing Multivariate Longitudinal DataCéci<strong>le</strong> Proust, 1,∗ Hélène Jacqmin-Gadda, 1 Jeremy M. G. Taylor, 2 Julien Ganiayre, 1 and DanielCommenges 11 INSERM E0338, Université de Bordeaux 2, 146 rue Léo Saignat, 33076 Bordeaux Cedex, France2 Department of Biostatistics, University of Michigan, 1420 Washington Heights, Ann Arbor, Michigan 48109,U.S.A. ∗ email: Ceci<strong>le</strong>.Proust@isped.u-bordeaux2.frSummary. Cognition is not directly measurab<strong>le</strong>. It is assessed using psychometric tests, which can beviewed as quantitative measures of cognition with error. The aim of this artic<strong>le</strong> is to propose a model todescribe the evolution in continuous time of unobserved cognition in the elderly and assess the impactof covariates directly on it. The latent cognitive process is defined using a linear mixed model includinga Brownian motion and time-dependent covariates. The observed psychometric tests are considered asthe results of parameterized nonlinear transformations of the latent cognitive process at discrete occasions.Estimation of the parameters contained both in the transformations and in the linear mixed model is achievedby maximizing the observed likelihood and graphical methods are performed to assess the goodness of fit ofthe model. The method is applied to data from PAQUID, a French prospective cohort study of ageing.Key words: Cognitive ageing; Mixed model; Multip<strong>le</strong> outcomes; Random effects.1. IntroductionIn cognitive ageing studies, cognition is generally evaluatedthrough a battery of psychometric tests, which are quantitativemeasures of various dimensions of cognition. Describingcognitive evolution and assessing the impact of covariates onthis evolution is an interesting approach to help us understandthe process of cognitive ageing. As the various psychometrictests are highly correlated, multivariate longitudinal analysesof several psychometric tests are often performed usingmultivariate linear mixed models (Hall et al., 2001; Harvey,Beckett, and Mungas, 2003; Sliwinski, Hofer, and Hall, 2003).These models highlight both the differences in the shapes ofevolution for each dimension and the strong correlation betweenthe dimensions.The idea of a latent cognitive process explaining the cognitivedecline in the elderly is hypothesized in neuropsychology.This latent cognitive process can be viewed as a common cognitivefactor across all the psychometric tests (Salthouse et al.,1996; Fabrigou<strong>le</strong> et al., 1998) and is supposed to be a betterpredictor of dementia and cognitive decline. As a consequence,it would be of substantial interest to focus the analysis on thislatent process by describing its evolution and evaluating theimpact of covariates directly on it.In a cross-sectional framework, Sammel and Ryan (1996)proposed a latent variab<strong>le</strong> model in which covariates couldaffect directly the latent variab<strong>le</strong>, and the multip<strong>le</strong> outcomeswere assumed to be measures of the underlying latentvariab<strong>le</strong> with error. In a longitudinal framework, Gray andBrookmeyer (1998) proposed a marginal regression model,with estimation via generalized estimating equations, toassess an overall treatment effect on several continuous and repeatedoutcomes. Roy and Lin (2000) also extended the linearlatent variab<strong>le</strong> model of Sammel and Ryan (1996) to repeatedmultivariate data. In practice, the assumption of a linear relationshipbetween the outcomes and a Gaussian latent variab<strong>le</strong>is frequently too strong, because the psychometric tests oftenhave non-Gaussian distributions due to different metrologicalproperties and different behaviors with ageing (Hall et al.,2001; Amieva et al., 2005). For instance, some tests may bemore sensitive to changes at high <strong>le</strong>vels of cognition than atlow <strong>le</strong>vels of cognition, whi<strong>le</strong> others may have the same sensitivityat high and low <strong>le</strong>vels of cognition. Thus, we proposeto introduce parameterized f<strong>le</strong>xib<strong>le</strong> nonlinear transformationsto link the quantitative tests with the latent process. The latentprocess is defined in continuous time by a linear mixedmodel including a Brownian motion, and nonlinear transformationsof the psychometric tests are noisy measures of thelatent process at discrete occasions, the shapes of the estimatednonlinear transformations giving information on themetrological properties of each test.This extension of mixed models to latent variab<strong>le</strong> modelsis related to structural equation models (SEM), mainlydeveloped in psychometrics, because in both approaches thequantity of interest cannot be measured directly and is evaluatedinstead by a set of outcomes or items (Muthén, 2002;Dunson, 2003; Rabe-Hesketh, Skrondal, and Pick<strong>le</strong>s, 2004).Thus the formulation of the model has two components, ameasurement model which links the latent variab<strong>le</strong>s with theobservations and a structural model which explains the latentvariab<strong>le</strong> structure. In the last decade, there have been1014 C○ 2006, The International Biometric Society
Modè<strong>le</strong> nonlinéaire <strong>à</strong> processus latent 67Nonlinear Model with Latent Process for Cognitive Evolution 1015major improvements in SEM (Sánchez et al., 2005). Theseinclude (i) to hand<strong>le</strong> clustered or repeated data (Longfordand Muthén, 1992; Dunson, 2003; Rabe-Haseketh et al., 2004;Skrondal and Rabe-Hesketh, 2004; Song and Lee, 2004), (ii)to allow mixture of count, ordinal, and dichotomous outcomes(Dunson, 2003; Lee and Song, 2004; Rabe-Haskethet al., 2004), (iii) to relax linearity of the relationship betweenthe latent variab<strong>le</strong>s by using nonlinear structural models(Jöreskog and Yang, 1996; Arminger and Muthén, 1998;Wall and Amemiya, 2000; Lee and Song, 2004; Song and Lee,2004), and (iv) to relax linearity between the continuous responsesand the latent variab<strong>le</strong>s (Yalcin and Amemiya, 2001).Our modeling approach differs in a number of ways. First,we focus on the change over time of a sing<strong>le</strong> common latentprocess, whi<strong>le</strong> the main interest of SEM lies in the relationshipbetween several latent variab<strong>le</strong>s. Moreover, when dealingwith quantitative outcomes, SEM generally assumes aGaussian or a Poisson distribution for the outcomes. Exceptfor threshold models for ordinal data (Dunson, 2003; Lee andSong, 2004; Rabe-Hesketh et al., 2004), when nonlinear transformationslink the latent variab<strong>le</strong>s and the outcomes, theydo not depend on parameters to be estimated. As thresholdmodels are not appropriate for quantitative scores withmany possib<strong>le</strong> values, we estimate the shape of the transformationsby using parameterized nonlinear functions. Finally,our model includes a continuous-time latent process; this givesa description of the evolution of the latent cognitive <strong>le</strong>vel forall times in the range of the observations and furthermore,it can easily hand<strong>le</strong> data where the number and times ofthe observations are different for each subject and for eachoutcome.Nonlinearity in SEM either in the structural model or in therelationship between observed outcomes and latent variab<strong>le</strong>srequires the development of suitab<strong>le</strong> estimation methods. Formodels including products of latent variab<strong>le</strong>s, Jöreskog andYang (1996) proposed a frequentist approach based on themaximization of the likelihood, whi<strong>le</strong> Arminger and Muthén(1998) proposed a Bayesian approach using a Markov chainMonte Carlo (MCMC) algorithm. For models with nonlinearrelationships between the responses and the latent variab<strong>le</strong>s,Yalcin and Amemiya (2001) proposed to compute a quadraticapproximation of the nonlinear transformations, and thenmaximized the approximate likelihood. In contrast, to hand<strong>le</strong>the nonlinear relationships between the responses and thelatent process, we propose to maximize the exact likelihoodof the observed data, which is a product of the likelihood ofthe transformed data (the transformed data are multivariateGaussian in our model) and the Jacobian of the nonlineartransformations.The main characteristics of our methodology can be summarizedas follows:(a) it can be applied to multivariate longitudinal non-Gaussian quantitative outcomes;(b) it can study the evolution of a continuous-time latentprocess representing the common factor across all theoutcomes;(c) it can estimate the shape of the transformations linkingthe quantitative outcomes and the underlying latentprocess;(d) it can hand<strong>le</strong> any type of unbalanced data (number andtime of measurements, covariates, ...) and missing atrandom data;(e) it can estimate impact of covariates on both the latentprocess and the observed outcomes.The next section focuses on the formulation of the modelfor the latent process and the outcomes on the parameterizednonlinear transformations. Section 3 is devoted to maximumlikelihood estimation (MLE). In Section 4 we discuss goodnessof fit and Section 5 focuses on an application of the methodto data from the French prospective cohort study PAQUID(Letenneur et al., 1994).2. Methodology2.1 The Latent Process: Structural ModelConsider the continuous-time latent process Λ i =(Λ i (t)) t≥0representing the common cognitive factor for individual i withi =1,..., N. Λ i is defined at time t, t ∈ R + according to alinear mixed model,Λ i (t) =X 1i (t) T β + Z i (t) T u i + σ w w i (t), t ≥ 0, (1)where X 1i (t) istheq 1 vector of time-dependent covariates associatedwith the vector of fixed effects β. The (p + 1) vectorZ i (t) = (1, t, ..., t p ) T is a time polynomial of degree p (orany vector of functions of time) and the vector of randomeffects at subject <strong>le</strong>vel u i ∼ N(μ, D), where D is an unstructuredpositive definite matrix. The process w i =(w i (t)) t≥0 isa standard Brownian motion; w i (t) models local variation anddeparture from the polynomial trend whi<strong>le</strong> the random effectsaccount for the variability of the trend across the subjects. Noindependent error is added because this latent process is assumedto represent the actual cognition in continuous time.Note that the linearity in β or in the covariates is not crucial.Any function of time could be included in the model, becausethe model is still linear in the random effects, to ensure thenormality of the latent process. Moreover, the Brownian motionalso adds f<strong>le</strong>xibility to the parametric function of time.2.2 The Measurement ModelNow consider K quantitative outcomes. Each outcome couldbe an individual psychometric test, or the sum of scores froman itemized test. For subject i and outcome k, we observe then ik vector of measurements y ik =(y i1k ,...,y ijk ,...,y inik k) T ,where y ijk is the score of subject i at occasion j for test k. Thenumber and times of measurements may be comp<strong>le</strong>tely differentfor each subject and each outcome. In the spirit of latentgrowth curve modeling (Muthén, 2002) and SEM (Yalcin andAmemiya, 2001), we assume that this measurement y ijk is relatedto the latent process at time t ijk through the followingf<strong>le</strong>xib<strong>le</strong> model,g k (y ijk ; η k )=ỹ ijk =Λ i (t ijk )+α ik + X 2i (t ijk ) T γ k + ɛ ijk , (2)where the function g k comes from a family of nonlinear transformationsG depending on the vector of parameters η k , whichwill be estimated; the random effects α ik are independentlydistributed according to an N(0, σ 2 α k) distribution; the vectorsX 2i (t ijk ) and γ k are, respectively, a q 2 vector of timedependentcovariates and the associated vector of contrasts
- Page 1 and 2:
Université Victor Segalen Bordeaux
- Page 3 and 4:
3RemerciementsA Monsieur Jean-Louis
- Page 5 and 6:
5Un immense merci à tous ceux que
- Page 7 and 8:
7A Delphine et ses mille et une his
- Page 9 and 10:
TABLE DES MATIÈRES 92.3.2 Estimati
- Page 11 and 12:
TABLE DES MATIÈRES 116.3.1 Adéqua
- Page 13 and 14:
Introduction 13ans atteintes d’un
- Page 15 and 16: Introduction 15Mais, pour l’insta
- Page 17 and 18: Introduction 171.2 Problèmes méth
- Page 19 and 20: Introduction 191.2.3 Association en
- Page 21 and 22: Introduction 21QUID suggère que le
- Page 23 and 24: Chapitre 2Etat des connaissancesCe
- Page 25 and 26: Etat des connaissances 25Plusieurs
- Page 27 and 28: Etat des connaissances 27Prise en c
- Page 29 and 30: Etat des connaissances 29Extensions
- Page 31 and 32: Etat des connaissances 31données s
- Page 33 and 34: Etat des connaissances 33variables
- Page 35 and 36: Etat des connaissances 35(2000) ne
- Page 37 and 38: Etat des connaissances 372.3 Modél
- Page 39 and 40: Etat des connaissances 39chapitre 3
- Page 41 and 42: Etat des connaissances 41de mélang
- Page 43 and 44: Etat des connaissances 43Hawkins et
- Page 45 and 46: Etat des connaissances 45tique de d
- Page 47 and 48: Etat des connaissances 47L’estima
- Page 49 and 50: Etat des connaissances 49normales s
- Page 51 and 52: Etat des connaissances 512.4 Modél
- Page 53 and 54: Etat des connaissances 53en tant qu
- Page 55 and 56: Etat des connaissances 55interactio
- Page 57 and 58: Etat des connaissances 57décrite p
- Page 59 and 60: Etat des connaissances 59entre l’
- Page 61 and 62: Etat des connaissances 612.4.3 Cas
- Page 63 and 64: Etat des connaissances 63l’inform
- Page 65: Modèle nonlinéaire à processus l
- Page 69 and 70: Modèle nonlinéaire à processus l
- Page 71 and 72: Modèle nonlinéaire à processus l
- Page 73 and 74: Modèle nonlinéaire à processus l
- Page 75 and 76: Modèle nonlinéaire à processus l
- Page 77 and 78: Modèle nonlinéaire à processus l
- Page 79 and 80: Modèle nonlinéaire à processus l
- Page 81 and 82: Modèle nonlinéaire à processus l
- Page 83 and 84: Modèle nonlinéaire à processus l
- Page 85 and 86: Modèle nonlinéaire à processus l
- Page 87 and 88: Modèle nonlinéaire à processus l
- Page 89 and 90: Modèle nonlinéaire à processus l
- Page 91 and 92: Modèle nonlinéaire à processus l
- Page 93 and 94: Chapitre 4Modèle nonlinéaire à c
- Page 95 and 96: Modèle nonlinéaire à classes lat
- Page 97 and 98: Modèle nonlinéaire à classes lat
- Page 99 and 100: Modèle nonlinéaire à classes lat
- Page 101 and 102: Modèle nonlinéaire à classes lat
- Page 103 and 104: Modèle nonlinéaire à classes lat
- Page 105 and 106: Modèle nonlinéaire à classes lat
- Page 107 and 108: Modèle nonlinéaire à classes lat
- Page 109 and 110: Modèle nonlinéaire à classes lat
- Page 111 and 112: Modèle nonlinéaire à classes lat
- Page 113 and 114: Modèle nonlinéaire à classes lat
- Page 115 and 116: Modèle nonlinéaire à classes lat
- Page 117 and 118:
Modèle nonlinéaire à classes lat
- Page 119 and 120:
Modèle nonlinéaire à classes lat
- Page 121 and 122:
Modèle nonlinéaire à classes lat
- Page 123 and 124:
Modèle nonlinéaire à classes lat
- Page 125 and 126:
Modèle nonlinéaire à classes lat
- Page 127 and 128:
Modèle nonlinéaire à classes lat
- Page 129 and 130:
Modèle nonlinéaire à classes lat
- Page 131 and 132:
Modèle nonlinéaire à classes lat
- Page 133 and 134:
Modèle nonlinéaire à classes lat
- Page 135 and 136:
Modèle nonlinéaire à classes lat
- Page 137 and 138:
Modèle nonlinéaire à classes lat
- Page 139 and 140:
Modèle nonlinéaire à classes lat
- Page 141 and 142:
Modèle nonlinéaire à classes lat
- Page 143 and 144:
Modèle nonlinéaire à classes lat
- Page 145 and 146:
Modèle nonlinéaire à classes lat
- Page 147 and 148:
Modèle nonlinéaire à classes lat
- Page 149 and 150:
Chapitre 6Discussion et perspective
- Page 151 and 152:
Discussion et perspectives 151la pr
- Page 153 and 154:
Discussion et perspectives 153Varia
- Page 155 and 156:
Discussion et perspectives 1556.2 M
- Page 157 and 158:
Discussion et perspectives 157à cl
- Page 160 and 161:
Discussion et perspectives 160effet
- Page 162 and 163:
Discussion et perspectives 162Dans
- Page 164 and 165:
Chapitre 7BibliographieAmieva, H.,
- Page 166 and 167:
Bibliographie 1661, S19-25.Brown, E
- Page 168 and 169:
Bibliographie 168Sons, New-York.Fol
- Page 170 and 171:
Bibliographie 170Hogan, J. W. et La
- Page 172 and 173:
Bibliographie 172Lin, H., McCulloch
- Page 174 and 175:
Bibliographie 174Park, CA.Muthén,
- Page 176 and 177:
Bibliographie 176Schlattmann, P. (2
- Page 178 and 179:
Bibliographie 178in the random-effe
- Page 180 and 181:
Chapitre 8Annexes8.1 Liste des publ
- Page 182 and 183:
Annexes 182Pau (France)Communicatio
- Page 184 and 185:
Annexes 184166 C. Proust, H. Jacqmi
- Page 186 and 187:
Annexes 186168 C. Proust, H. Jacqmi
- Page 188 and 189:
Annexes 188170 C. Proust, H. Jacqmi
- Page 190 and 191:
Annexes 190172 C. Proust, H. Jacqmi
- Page 192 and 193:
AnnexesAmerican Journal of Epidemio
- Page 194 and 195:
Annexes 194Psychometric Tests’ Se
- Page 196 and 197:
Annexes 196Psychometric Tests’ Se
- Page 198 and 199:
Annexes 198Psychometric Tests’ Se
- Page 200 and 201:
Annexes 200Abstract :When investiga
- Page 202 and 203:
Annexes 202educated subjects. This
- Page 204 and 205:
Annexes 204(iii) the recognition fo
- Page 206 and 207:
Annexes 206Explanatory variablesIn
- Page 208 and 209:
Annexes 208IST15 (median=28, IQR=24
- Page 210 and 211:
Annexes 210and on the mean evolutio
- Page 212 and 213:
Annexes 212These findings should be
- Page 214 and 215:
Annexes 214Appendix : model specifi
- Page 216 and 217:
Annexes 216References[1] Amieva H,
- Page 218 and 219:
Annexes 218[19] Letenneur L, Commen
- Page 220 and 221:
Annexes 220Figure 1 : (A) Predicted
- Page 222 and 223:
Annexes 222Table 1: demographic and
- Page 224:
Annexes 224MMSE 0.0037 * 0.0013,0.0