06.09.2021 Views

Linear Algebra, Theory And Applications, 2012a

Linear Algebra, Theory And Applications, 2012a

Linear Algebra, Theory And Applications, 2012a

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

326 SELF ADJOINT OPERATORS<br />

This part follows fairly easily from taking the ln and then setting partial derivatives equal to<br />

0. The estimation of Σ is harder. However, it is not too hard using the theorems presented<br />

above. I am following a nice discussion given in Wikipedia. It will make use of Theorem<br />

7.5.3 on the trace as well as the theorem about the square root of a linear transformation<br />

given above. First note that by Theorem 7.5.3,<br />

(x i −m) ∗ Σ −1 (x i −m) = trace ( (x i −m) ∗ Σ −1 (x i −m) )<br />

= trace ( (x i −m)(x i −m) ∗ Σ −1)<br />

Therefore, the thing to maximize is<br />

n∏<br />

(<br />

1<br />

det (Σ) exp − 1 1/2 2 trace ( (x i −m)(x i −m) ∗ Σ −1))<br />

i=1<br />

= det ( Σ −1) n/2<br />

exp<br />

(<br />

− 1 n<br />

)<br />

2 trace ∑<br />

(x i −m)(x i −m) ∗ Σ −1<br />

⎛<br />

S<br />

⎞<br />

{ }} {<br />

= det ( Σ −1) n/2 n∑<br />

exp ⎜<br />

⎝ −1 2 trace (x i −m)(x i −m) ∗ Σ −1 ⎟<br />

⎠<br />

i=1<br />

i=1<br />

≡ det ( Σ −1) n/2<br />

exp<br />

(<br />

− 1 2 trace ( SΣ −1))<br />

where S is the p × p matrix indicated above. Now S is symmetric and has eigenvalues which<br />

are all nonnegative because (Sy, y) ≥ 0. Therefore, S has a unique self adjoint square root.<br />

Using Theorem 7.5.3 again, the above equals<br />

det ( Σ −1) (<br />

n/2<br />

exp − 1 (<br />

2 trace S 1/2 Σ −1 S 1/2))<br />

Let B = S 1/2 Σ −1 S 1/2 and assume det (S) ≠0. Then Σ −1 = S −1/2 BS −1/2 .<br />

equals<br />

det ( S −1) det (B) n/2 exp<br />

(− 1 )<br />

2 trace (B)<br />

The above<br />

Of course the thing to estimate is only found in B. Therefore, det ( S −1) can be discarded<br />

in trying to maximize things. Since B is symmetric, it is similar to a diagonal matrix D<br />

which has λ 1 , ··· ,λ n down the diagonal. Thus it is desired to maximize<br />

( p∏<br />

) n/2 ( )<br />

λ i exp − 1 p∑<br />

λ i<br />

2<br />

i=1<br />

i=1<br />

Taking ln it follows that it suffices to maximize<br />

n<br />

p∑<br />

ln λ i − 1 2<br />

2<br />

i=1<br />

Taking the derivative with respect to λ i ,<br />

n<br />

2<br />

p∑<br />

i=1<br />

1<br />

− 1 λ i 2 =0<br />

λ i

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!