20.04.2013 Views

advanced spectral methods for climatic time series - Atmospheric ...

advanced spectral methods for climatic time series - Atmospheric ...

advanced spectral methods for climatic time series - Atmospheric ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

3-8 ● Ghil et al.: CLIMATIC TIME SERIES ANALYSIS 40, 1 / REVIEWS OF GEOPHYSICS<br />

C X k k k. (7)<br />

The eigenvalue k equals the partial variance in the<br />

direction k, and the sum of the k, i.e., the trace of C X,<br />

gives the total variance of the original <strong>time</strong> <strong>series</strong> X(t).<br />

[42] An equivalent <strong>for</strong>mulation of (7), which will<br />

prove useful further on, is given by <strong>for</strong>ming the M M<br />

matrix E X that has the eigenvectors k as its columns and<br />

the diagonal matrix X whose elements are the eigenvalues<br />

k, in decreasing order:<br />

E X t CXE X X; (8)<br />

t<br />

here EX is the transpose of EX. Each eigenvalue k gives<br />

the degree of extension, and hence the variance, of the<br />

<strong>time</strong> <strong>series</strong> in the direction of the orthogonal eigenvector<br />

k. [43] A slightly different approach to computing the<br />

eigenelements of CX was originally proposed by Broomhead<br />

and King [1986a]. They constructed the N M<br />

trajectory matrix D that has the N augmented vectors<br />

X˜(t) as its rows and used singular-value decomposition<br />

(SVD) [see, <strong>for</strong> instance, Golub and Van Loan, 1996] of<br />

C X 1<br />

N Dt D (9)<br />

to obtain the square roots of k. The latter are called the<br />

singular values of D and have given SSA its name.<br />

[44] Allen and Smith [1996] and Ghil and Taricco<br />

[1997] have discussed the similarities and differences<br />

between the approaches of Broomhead and King [1986a]<br />

and Vautard and Ghil [1989] in computing the eigenelements<br />

associated in SSA with a given <strong>time</strong> <strong>series</strong> X(t).<br />

Both the Toeplitz estimate (6) and the SVD estimate (9)<br />

lead to a symmetric covariance matrix C X. In addition,<br />

the eigenvectors k of a Toeplitz matrix are necessarily<br />

odd and even, like the sines and cosines of classical<br />

Fourier analysis. The Toeplitz approach has the advantage<br />

of better noise reduction when applied to short <strong>time</strong><br />

<strong>series</strong>, as compared with the SVD approach. This advantage<br />

comes at the price of a slightly larger bias when<br />

the <strong>time</strong> <strong>series</strong> is strongly nonstationary over the interval<br />

of observation 1 t N [Allen, 1992]. Such biasversus-variance<br />

trade-offs are common in estimation<br />

problems.<br />

[45] To obtain S/N separation, one plots the eigenvalue<br />

spectrum illustrated in Figure 3. In this plot an<br />

initial plateau that contains most of the signal is separated<br />

by a steep slope from the noise; the latter is<br />

characterized by much lower values that <strong>for</strong>m a flat floor<br />

or a mild slope [Kumaresan and Tufts, 1980; Pike et al.,<br />

1984; Vautard and Ghil, 1989].<br />

[46] As the M M matrix C X is symmetric, standard<br />

algorithms [Press et al., 1988] will per<strong>for</strong>m its <strong>spectral</strong><br />

decomposition efficiently, as long as M is not too large.<br />

The choice of M is based on a trade-off between two<br />

considerations: quantity of in<strong>for</strong>mation extracted versus<br />

the degree of statistical confidence in that in<strong>for</strong>mation.<br />

Figure 3. Singular spectrum of the SOI <strong>time</strong> <strong>series</strong>. The<br />

eigenvalues are plotted, as usual, in decreasing order. The<br />

embedding dimension <strong>for</strong> Figures 3–7 is M 60, which<br />

represents a <strong>time</strong> window of 5 years. The error bars are based<br />

on the default option of the singular spectrum analysis (SSA)<br />

multitaper method (MTM) Toolkit (http://www.atmos.ucla.edu/tcd/ssa/).<br />

This option uses the ad hoc estimate of variance<br />

k of an eigenvalue k given by k (2/Nˆ ) k. The key<br />

element of this estimate is the number Nˆ of independent<br />

degrees of freedom (DOF) in the <strong>time</strong> <strong>series</strong>. Vautard and Ghil<br />

[1989], who first proposed this estimate in SSA, used Nˆ N,<br />

which is very optimistic, while Ghil and Mo [1991a] used Nˆ <br />

N/M, which is too conservative. The Toolkit’s current Version<br />

4.0, following Unal and Ghil [1995], uses Nˆ N/, where is<br />

the <strong>time</strong> <strong>series</strong>’ estimated decorrelation <strong>time</strong> (see equations<br />

(15) and (16)) and is an empirical constant, 1.5.<br />

The <strong>for</strong>mer requires as wide a window as possible, i.e., a<br />

large M, while the latter requires as many repetitions of<br />

the features of interest as possible, i.e., as large a ratio<br />

N/M as possible. The choice of M 60 in Figure 3<br />

allows us to capture periodicities as long as 5 years, since<br />

t 1 month, and thus the dimensional window width is<br />

Mt 5 years; on the other hand, N/M 11 is fairly<br />

safe, and the diagonalization of C X <strong>for</strong> this moderate<br />

value of M does not introduce large numerical errors<br />

either.<br />

[47] In Figure 3, there is a clear grouping of the first<br />

five eigenvalues, followed by a very steep slope of three<br />

additional eigenvalues. The latter are well separated<br />

from the first five, as well as from the remaining 52<br />

eigenvalues, which <strong>for</strong>m the mildly sloping and flattening<br />

out “tail” of the SSA spectrum.<br />

[48] The S/N separation obtained by merely inspecting<br />

the slope break in a “scree diagram” of eigenvalues<br />

k or singular values k 1/2 versus k works well when the<br />

intrinsic noise that perturbs the underlying deterministic<br />

system and the extrinsic noise that affects the observations<br />

are both white, i.e., uncorrelated from one <strong>time</strong><br />

step to the next (see definition of (t) in equation (1)).<br />

This rudimentary separation works less well when either

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!