Mathematics in Independent Component Analysis
Mathematics in Independent Component Analysis
Mathematics in Independent Component Analysis
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
86 Chapter 3. Signal Process<strong>in</strong>g 84(5):951-956, 2004<br />
the preced<strong>in</strong>g section—here dependence is allowed<br />
between real-valued couples of random variables.<br />
Let k; n ∈ N such that k divides n. We call an<br />
n-dimensional random vector Y k-<strong>in</strong>dependent if the<br />
k-dimensional random vectors<br />
⎛ ⎞ ⎛ ⎞<br />
⎜<br />
⎝<br />
Y1<br />
.<br />
Yk<br />
⎟<br />
⎠ ;:::;<br />
⎜<br />
⎝<br />
Yn−k+1<br />
.<br />
Yn<br />
⎟<br />
⎠<br />
are mutually <strong>in</strong>dependent. A matrix W ∈ Gl(n; R) is<br />
called a k-multidimensional ICA of an n-dimensional<br />
random vector X if WX is k-<strong>in</strong>dependent. If k =1,<br />
this is the same as ord<strong>in</strong>ary ICA.<br />
Obvious <strong>in</strong>determ<strong>in</strong>acies are, similar to ord<strong>in</strong>ary<br />
ICA, <strong>in</strong>vertible transforms <strong>in</strong> Gl(k; R) <strong>in</strong> each tuple<br />
as well as the fact that the order of the <strong>in</strong>dependent<br />
k-tuples is not xed. So, de ne for r; s =1;:::;n=k<br />
the (r; s) sub-k-matrix of W =(wij) tobethek × k<br />
submatrix<br />
(wij) i=rk; :::; rk+k−1<br />
j=sk; :::; sk+k−1<br />
that is the k × k submatrix of W start<strong>in</strong>g at position<br />
(rk; sk). A matrix L ∈ Gl(n; R) is said to be a k-scal<strong>in</strong>g<br />
and permutation matrix if for each r=1;:::;n=kthere<br />
exists precisely one s with the (r; s) sub-k-matrix of<br />
L to be nonzero, and such that this submatrix is <strong>in</strong><br />
Gl(k; R), and if for each s=1;:::;n=kthere exists only<br />
one r with the (r; s) sub-k-matrix satisfy<strong>in</strong>g the same<br />
condition. Hence, if Y is k-<strong>in</strong>dependent, also LY is<br />
k-<strong>in</strong>dependent.<br />
Two matrices A and B are said to be k-equivalent,<br />
A ∼k B, if there exists such a k-scal<strong>in</strong>g and permutation<br />
matrix L with A=BL. As stated above, given two<br />
matrices W and V with W −1 ∼k V −1 such that one of<br />
them is a k-multidimensional ICA of a given random<br />
vector, then so is the other. We will show that there are<br />
no more <strong>in</strong>determ<strong>in</strong>acies of multidimensional ICA.<br />
As usual multidimensional ICA can solve the multidimensional<br />
BSS problem<br />
X = AS;<br />
where A ∈ Gl(n; R) and S is a k-<strong>in</strong>dependent<br />
n-dimensional random vector. F<strong>in</strong>d<strong>in</strong>g the <strong>in</strong>determ<strong>in</strong>acies<br />
of multidimensional ICA then shows that<br />
A can be found except for k-equivalence (separability),<br />
because if X = AS and W is a demix<strong>in</strong>g matrix<br />
F.J. Theis / Signal Process<strong>in</strong>g 84 (2004) 951 – 956 955<br />
such that WX is k-<strong>in</strong>dependent, then WA ∼k I, so<br />
W−1 ∼k A as desired.<br />
However, for the proof we need one more condition<br />
for A: We call A k-admissible if for each r; s =<br />
1;:::;n=kthe (r; s) sub-k-matrix of A is either <strong>in</strong>vertible<br />
or zero. Note that this is not a strong restriction—<br />
if we randomly choose A with coe cients out of a cont<strong>in</strong>uous<br />
distribution, then with probability one we get<br />
a k-admissible matrix, because the non-k-admissible<br />
matrices ⊂ Rn2 lie <strong>in</strong> a submanifold of dimension<br />
smaller than n2 .<br />
Theorem 5.1 (Separability of multidimensional<br />
BSS): Let A ∈ Gl(n; R) and S a k-<strong>in</strong>dependent<br />
n-dimensional random vector hav<strong>in</strong>g no Gaussian<br />
k-tuple (Srk;:::;Srk+k−1) T . Assume that A is<br />
k-admissible.<br />
If AS is aga<strong>in</strong> k-<strong>in</strong>dependent, then A is k-equivalent<br />
to the identity.<br />
For the case k = 1 this is l<strong>in</strong>ear BSS separability<br />
because every matrix is 1-admissible.<br />
Proof. Denote X := AS. Assume that A k I.<br />
Then there exist <strong>in</strong>dices r1;r2 and s such that<br />
the (r1;s) and the (r2;s) sub-k-matrices of A are<br />
non-zero (hence <strong>in</strong> Gl(k; R) by k-admissability).<br />
Apply<strong>in</strong>g Corollary 3.3 to the two random vectors<br />
(Xr1k;:::;Xr1k+k−1) T and (Xr2k;:::;Xr2k+k−1) T then<br />
shows that (Ssk;:::;Ssk+k−1) T is Gaussian, which is a<br />
contradiction.<br />
Note that we could have used whiten<strong>in</strong>g to assume<br />
that A is orthogonal; however there does not seem to<br />
be a direct way to exploit this <strong>in</strong> order to allow one<br />
fully Gaussian k-tuple, contrary to the complex ICA<br />
case, see Theorem 4.1.<br />
6. Conclusion<br />
Uniqueness and separability results play a central<br />
role <strong>in</strong> solv<strong>in</strong>g BSS problems s<strong>in</strong>ce they allow algorithms<br />
to apply ICA <strong>in</strong> order to uniquely (except for<br />
scal<strong>in</strong>g and permutation) nd the orig<strong>in</strong>al mix<strong>in</strong>g matrices.<br />
We have used a multidimensional version of the<br />
Skitovitch–Darmois theorem <strong>in</strong> order to calculate the