12.07.2015 Views

Télécharger le texte intégral - ISPED-Enseignement à distance

Télécharger le texte intégral - ISPED-Enseignement à distance

Télécharger le texte intégral - ISPED-Enseignement à distance

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Modè<strong>le</strong> nonlinéaire <strong>à</strong> processus latent 67Nonlinear Model with Latent Process for Cognitive Evolution 1015major improvements in SEM (Sánchez et al., 2005). Theseinclude (i) to hand<strong>le</strong> clustered or repeated data (Longfordand Muthén, 1992; Dunson, 2003; Rabe-Haseketh et al., 2004;Skrondal and Rabe-Hesketh, 2004; Song and Lee, 2004), (ii)to allow mixture of count, ordinal, and dichotomous outcomes(Dunson, 2003; Lee and Song, 2004; Rabe-Haskethet al., 2004), (iii) to relax linearity of the relationship betweenthe latent variab<strong>le</strong>s by using nonlinear structural models(Jöreskog and Yang, 1996; Arminger and Muthén, 1998;Wall and Amemiya, 2000; Lee and Song, 2004; Song and Lee,2004), and (iv) to relax linearity between the continuous responsesand the latent variab<strong>le</strong>s (Yalcin and Amemiya, 2001).Our modeling approach differs in a number of ways. First,we focus on the change over time of a sing<strong>le</strong> common latentprocess, whi<strong>le</strong> the main interest of SEM lies in the relationshipbetween several latent variab<strong>le</strong>s. Moreover, when dealingwith quantitative outcomes, SEM generally assumes aGaussian or a Poisson distribution for the outcomes. Exceptfor threshold models for ordinal data (Dunson, 2003; Lee andSong, 2004; Rabe-Hesketh et al., 2004), when nonlinear transformationslink the latent variab<strong>le</strong>s and the outcomes, theydo not depend on parameters to be estimated. As thresholdmodels are not appropriate for quantitative scores withmany possib<strong>le</strong> values, we estimate the shape of the transformationsby using parameterized nonlinear functions. Finally,our model includes a continuous-time latent process; this givesa description of the evolution of the latent cognitive <strong>le</strong>vel forall times in the range of the observations and furthermore,it can easily hand<strong>le</strong> data where the number and times ofthe observations are different for each subject and for eachoutcome.Nonlinearity in SEM either in the structural model or in therelationship between observed outcomes and latent variab<strong>le</strong>srequires the development of suitab<strong>le</strong> estimation methods. Formodels including products of latent variab<strong>le</strong>s, Jöreskog andYang (1996) proposed a frequentist approach based on themaximization of the likelihood, whi<strong>le</strong> Arminger and Muthén(1998) proposed a Bayesian approach using a Markov chainMonte Carlo (MCMC) algorithm. For models with nonlinearrelationships between the responses and the latent variab<strong>le</strong>s,Yalcin and Amemiya (2001) proposed to compute a quadraticapproximation of the nonlinear transformations, and thenmaximized the approximate likelihood. In contrast, to hand<strong>le</strong>the nonlinear relationships between the responses and thelatent process, we propose to maximize the exact likelihoodof the observed data, which is a product of the likelihood ofthe transformed data (the transformed data are multivariateGaussian in our model) and the Jacobian of the nonlineartransformations.The main characteristics of our methodology can be summarizedas follows:(a) it can be applied to multivariate longitudinal non-Gaussian quantitative outcomes;(b) it can study the evolution of a continuous-time latentprocess representing the common factor across all theoutcomes;(c) it can estimate the shape of the transformations linkingthe quantitative outcomes and the underlying latentprocess;(d) it can hand<strong>le</strong> any type of unbalanced data (number andtime of measurements, covariates, ...) and missing atrandom data;(e) it can estimate impact of covariates on both the latentprocess and the observed outcomes.The next section focuses on the formulation of the modelfor the latent process and the outcomes on the parameterizednonlinear transformations. Section 3 is devoted to maximumlikelihood estimation (MLE). In Section 4 we discuss goodnessof fit and Section 5 focuses on an application of the methodto data from the French prospective cohort study PAQUID(Letenneur et al., 1994).2. Methodology2.1 The Latent Process: Structural ModelConsider the continuous-time latent process Λ i =(Λ i (t)) t≥0representing the common cognitive factor for individual i withi =1,..., N. Λ i is defined at time t, t ∈ R + according to alinear mixed model,Λ i (t) =X 1i (t) T β + Z i (t) T u i + σ w w i (t), t ≥ 0, (1)where X 1i (t) istheq 1 vector of time-dependent covariates associatedwith the vector of fixed effects β. The (p + 1) vectorZ i (t) = (1, t, ..., t p ) T is a time polynomial of degree p (orany vector of functions of time) and the vector of randomeffects at subject <strong>le</strong>vel u i ∼ N(μ, D), where D is an unstructuredpositive definite matrix. The process w i =(w i (t)) t≥0 isa standard Brownian motion; w i (t) models local variation anddeparture from the polynomial trend whi<strong>le</strong> the random effectsaccount for the variability of the trend across the subjects. Noindependent error is added because this latent process is assumedto represent the actual cognition in continuous time.Note that the linearity in β or in the covariates is not crucial.Any function of time could be included in the model, becausethe model is still linear in the random effects, to ensure thenormality of the latent process. Moreover, the Brownian motionalso adds f<strong>le</strong>xibility to the parametric function of time.2.2 The Measurement ModelNow consider K quantitative outcomes. Each outcome couldbe an individual psychometric test, or the sum of scores froman itemized test. For subject i and outcome k, we observe then ik vector of measurements y ik =(y i1k ,...,y ijk ,...,y inik k) T ,where y ijk is the score of subject i at occasion j for test k. Thenumber and times of measurements may be comp<strong>le</strong>tely differentfor each subject and each outcome. In the spirit of latentgrowth curve modeling (Muthén, 2002) and SEM (Yalcin andAmemiya, 2001), we assume that this measurement y ijk is relatedto the latent process at time t ijk through the followingf<strong>le</strong>xib<strong>le</strong> model,g k (y ijk ; η k )=ỹ ijk =Λ i (t ijk )+α ik + X 2i (t ijk ) T γ k + ɛ ijk , (2)where the function g k comes from a family of nonlinear transformationsG depending on the vector of parameters η k , whichwill be estimated; the random effects α ik are independentlydistributed according to an N(0, σ 2 α k) distribution; the vectorsX 2i (t ijk ) and γ k are, respectively, a q 2 vector of timedependentcovariates and the associated vector of contrasts

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!