14.02.2013 Views

Mathematics in Independent Component Analysis

Mathematics in Independent Component Analysis

Mathematics in Independent Component Analysis

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

86 Chapter 3. Signal Process<strong>in</strong>g 84(5):951-956, 2004<br />

the preced<strong>in</strong>g section—here dependence is allowed<br />

between real-valued couples of random variables.<br />

Let k; n ∈ N such that k divides n. We call an<br />

n-dimensional random vector Y k-<strong>in</strong>dependent if the<br />

k-dimensional random vectors<br />

⎛ ⎞ ⎛ ⎞<br />

⎜<br />

⎝<br />

Y1<br />

.<br />

Yk<br />

⎟<br />

⎠ ;:::;<br />

⎜<br />

⎝<br />

Yn−k+1<br />

.<br />

Yn<br />

⎟<br />

⎠<br />

are mutually <strong>in</strong>dependent. A matrix W ∈ Gl(n; R) is<br />

called a k-multidimensional ICA of an n-dimensional<br />

random vector X if WX is k-<strong>in</strong>dependent. If k =1,<br />

this is the same as ord<strong>in</strong>ary ICA.<br />

Obvious <strong>in</strong>determ<strong>in</strong>acies are, similar to ord<strong>in</strong>ary<br />

ICA, <strong>in</strong>vertible transforms <strong>in</strong> Gl(k; R) <strong>in</strong> each tuple<br />

as well as the fact that the order of the <strong>in</strong>dependent<br />

k-tuples is not xed. So, de ne for r; s =1;:::;n=k<br />

the (r; s) sub-k-matrix of W =(wij) tobethek × k<br />

submatrix<br />

(wij) i=rk; :::; rk+k−1<br />

j=sk; :::; sk+k−1<br />

that is the k × k submatrix of W start<strong>in</strong>g at position<br />

(rk; sk). A matrix L ∈ Gl(n; R) is said to be a k-scal<strong>in</strong>g<br />

and permutation matrix if for each r=1;:::;n=kthere<br />

exists precisely one s with the (r; s) sub-k-matrix of<br />

L to be nonzero, and such that this submatrix is <strong>in</strong><br />

Gl(k; R), and if for each s=1;:::;n=kthere exists only<br />

one r with the (r; s) sub-k-matrix satisfy<strong>in</strong>g the same<br />

condition. Hence, if Y is k-<strong>in</strong>dependent, also LY is<br />

k-<strong>in</strong>dependent.<br />

Two matrices A and B are said to be k-equivalent,<br />

A ∼k B, if there exists such a k-scal<strong>in</strong>g and permutation<br />

matrix L with A=BL. As stated above, given two<br />

matrices W and V with W −1 ∼k V −1 such that one of<br />

them is a k-multidimensional ICA of a given random<br />

vector, then so is the other. We will show that there are<br />

no more <strong>in</strong>determ<strong>in</strong>acies of multidimensional ICA.<br />

As usual multidimensional ICA can solve the multidimensional<br />

BSS problem<br />

X = AS;<br />

where A ∈ Gl(n; R) and S is a k-<strong>in</strong>dependent<br />

n-dimensional random vector. F<strong>in</strong>d<strong>in</strong>g the <strong>in</strong>determ<strong>in</strong>acies<br />

of multidimensional ICA then shows that<br />

A can be found except for k-equivalence (separability),<br />

because if X = AS and W is a demix<strong>in</strong>g matrix<br />

F.J. Theis / Signal Process<strong>in</strong>g 84 (2004) 951 – 956 955<br />

such that WX is k-<strong>in</strong>dependent, then WA ∼k I, so<br />

W−1 ∼k A as desired.<br />

However, for the proof we need one more condition<br />

for A: We call A k-admissible if for each r; s =<br />

1;:::;n=kthe (r; s) sub-k-matrix of A is either <strong>in</strong>vertible<br />

or zero. Note that this is not a strong restriction—<br />

if we randomly choose A with coe cients out of a cont<strong>in</strong>uous<br />

distribution, then with probability one we get<br />

a k-admissible matrix, because the non-k-admissible<br />

matrices ⊂ Rn2 lie <strong>in</strong> a submanifold of dimension<br />

smaller than n2 .<br />

Theorem 5.1 (Separability of multidimensional<br />

BSS): Let A ∈ Gl(n; R) and S a k-<strong>in</strong>dependent<br />

n-dimensional random vector hav<strong>in</strong>g no Gaussian<br />

k-tuple (Srk;:::;Srk+k−1) T . Assume that A is<br />

k-admissible.<br />

If AS is aga<strong>in</strong> k-<strong>in</strong>dependent, then A is k-equivalent<br />

to the identity.<br />

For the case k = 1 this is l<strong>in</strong>ear BSS separability<br />

because every matrix is 1-admissible.<br />

Proof. Denote X := AS. Assume that A k I.<br />

Then there exist <strong>in</strong>dices r1;r2 and s such that<br />

the (r1;s) and the (r2;s) sub-k-matrices of A are<br />

non-zero (hence <strong>in</strong> Gl(k; R) by k-admissability).<br />

Apply<strong>in</strong>g Corollary 3.3 to the two random vectors<br />

(Xr1k;:::;Xr1k+k−1) T and (Xr2k;:::;Xr2k+k−1) T then<br />

shows that (Ssk;:::;Ssk+k−1) T is Gaussian, which is a<br />

contradiction.<br />

Note that we could have used whiten<strong>in</strong>g to assume<br />

that A is orthogonal; however there does not seem to<br />

be a direct way to exploit this <strong>in</strong> order to allow one<br />

fully Gaussian k-tuple, contrary to the complex ICA<br />

case, see Theorem 4.1.<br />

6. Conclusion<br />

Uniqueness and separability results play a central<br />

role <strong>in</strong> solv<strong>in</strong>g BSS problems s<strong>in</strong>ce they allow algorithms<br />

to apply ICA <strong>in</strong> order to uniquely (except for<br />

scal<strong>in</strong>g and permutation) nd the orig<strong>in</strong>al mix<strong>in</strong>g matrices.<br />

We have used a multidimensional version of the<br />

Skitovitch–Darmois theorem <strong>in</strong> order to calculate the

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!