Estimating the covariance function with functional data - Statistics ...

More documents

Recommendations

Info

248 Sik-Yum Lee et al. important contributions of Ramsay (1982) and Besse and Ramsay (1986), there are very few publications concerned with theoretical developments or applications in the psychometric literature. One reason may be the highly technical nature of the required statistical and mathematical background knowledge associated with existing methods. The main objective of this paper is to propose a two-step procedure for estimating the covariance function with functional data as a non-technical complement to the work cited above. We calculate the raw estimates of the eigenfunctions via the standard principal components method in multivariate analysis, and then obtain smooth estimates of the eigenfunctions and eigenvalues via a one-dimensional smoothing technique. Hence, the proposed procedure is simple to understand and easy to implement. In this paper, we will use the local polynomial approach (see Cleveland, 1979; Ruppert & Wand, 1994) to complete the second step. This choice is motivated by its nice properties; for example, it is highly intuitive and simple to implement (Fan & Marron, 1994), achieves automatic boundary correction and possesses certain important optimal properties (Cheng, Fan, & Marron, 1997), as well as good empirical performance (Fan & Gijbels, 1996; Fan &Zhang, 1999). However, we emphasize that standard nonparametric methods, such as spline smoothing or cross-validation, can be applied. The paper is organized as follows. The motivation for our method is given in Section 2. In Section 3, we propose a two-step procedure which applies local polynomial tting to estimate the covariance function, its eigenfunctions and eigenvalues. In Section 4, the results of a simulation and three real examples are presented to illustrate the empirical performance of the proposed method. A discussion given in Section 5. 2. Motivation First consider a random sample of multivariate data from a population with mean zero and covariance matrix S. The classical statistical inference on S is based on the sample covariance matrix. Since S is symmetric and positive denite, we have the following orthogonal expansion: S = Xp i = 1 l i a i a T i , (1) where l 1 $ . . . $ l p $ 0 are the eigenvalues of S, and a i = (a 1i , . . . , a pi ) T is the normalized eigenvector corresponding to the eigenvalue l i . Hence, S is determined by l 1 , . . . , l p and a 1 , . . . , a p . In particular, the (i, j )th element of S is given by j i j = Xp k = 1 l k a i k a j k . (2) In addition, we have the following decomposition on the corresponding random vector X: X = Xp i = 1 a i y i , (3) where y 1 , . . . , y p are uncorrelated random variables with zero mean and variances l 1 , . . . , l p respectively. It is well known that the decomposition (3) is not unique and is not identiable. Now consider the situation with functional data, where we have a univariate stochastic process X(t) and the data are curves. Without loss of generality, we assume
$ $ $ E(X(t)) = 0. Let n(u , v) = Cov(X(u), X(v)) be the covariance function. Viewing the random function X(t) as a vector with innite dimension and tracing the idea behind of (2.1) and (2.2), it is natural to impose the following condition on n(u , v): there exists a series orthonormal functions f 1 ( · ), f 2 ( · ), . . . and m 1 m 2 . . . 0, such that the covariance function n(u , v) is given by n(u , v) = X¥ i = 1 m i f i (u)f i (v), (4) (see Loeve, 1963). Here the m i play the role of l i , and the f i ( · ) play the role of elements in a i . Moreover, if f i ( · ) and m i , i = 1, 2, . . . , satisfy (4), we have a similar decomposition on X(t) as in (2.3). More specically, X(t) = X¥ i = 1 á X, f i ñ f i (t) = X¥ i = 1 h i f i (t), (5) where á f , gñ = „ f (t)g(t) dt. For i Þ j, and under some regularity conditions, we have … … ¼ … … Eh i h j = E X(t)f i (t) dt X(t)f j (t) dt = EfX(t)X(s)g f i (t)f j (s) dt ds = = X¥ … … n(t, s)f i (t)f j (s) dt ds = X¥ k = 1 k = 1 … … m k f k (t)f i (t) dt f k (s)f j (s) ds = 0, m k … … f k (t)f k (s)f i (t)f j (s) dt ds and Eh i = 0, so Cov(h i , h j ) = 0. This implies that h 1 , h 2 , . . . are uncorrelated. Similarly, for each h i , Var(h i ) = X¥ … … m k f k (t)f i (t) dt f k (s)f i (s) ds = m i . k = 1 Estimating the covariance function with functional data 249 Hence, (5) can be regarded as an extension of (3). Similarly to the multivariate case, the representations in (4) and (5) are neither unique nor identiable. If, for any i Þ j, m i Þ m j , then f i ( · ) is identiable, except for the change of sign. Our development is not hindered by the non-identication of (5), because our interest is in how to estimate n(u , v) and how to nd the orthonormal functions f 1 ( · ), f 2 ( · ), . . . that satisfy (4), from the observed functional data. We will call these orthonormal functions f 1 ( · ), f 2 ( · ), . . . the eigenfunctions, and the corresponding m 1 , m 2 , . . . the eigenvalues of n(u , v). 3. A two-step estimation method First, let us motivate our method with a p-dimensional random vector X. Let ˆl 1 $ . . . $ ˆl p be the eigenvalues of the sample covariance matrix and â i = (â 1i , . . . , â pi ) T be the eigenvector corresponding to ˆl i . An estimate of the (i, j )th element of S is given by ĵ i j = Xp k = 1 ˆl k â i k â j k . The idea will be used to handle functional data as follows.
Page 1: 247 British Journal of Mathematical
Page 5 and 6: ( ˆb 1 j , t 1 ), . . . , ( ˆb n
Page 7 and 8: pThe truep covariancep function p
Page 9 and 10: Estimating the covariance function
Page 15: Ramsay, J. O., & Silverman, B. W. (

Estimating the covariance function with functional data - Statistics ...

Create successful ePaper yourself

Delete template?

Save as template?