14.02.2013 Views

Mathematics in Independent Component Analysis

Mathematics in Independent Component Analysis

Mathematics in Independent Component Analysis

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

1.2. Uniqueness issues <strong>in</strong> <strong>in</strong>dependent component analysis 9<br />

f 1, f 2<br />

2<br />

1.5<br />

1<br />

0.5<br />

-2 -1.5 -1 -0.5<br />

-0.5<br />

0.5 1 1.5 2<br />

-1<br />

-1.5<br />

-2<br />

1<br />

0.5<br />

0<br />

-0.5<br />

-1<br />

f (As ) A − 1 f (As )<br />

0.5 1 1.5<br />

0.5 1 1.5<br />

1<br />

0.5<br />

0<br />

-0.5<br />

-1<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0<br />

0.2 0.4 0.6 0.8<br />

0.2 0.4 0.6 0.8<br />

Figure 1.3: Example of a non-trivial postnonl<strong>in</strong>ear transformation us<strong>in</strong>g an absolutely degenerate<br />

matrix A and <strong>in</strong> [0, 1] 2 uniform sources s. Both sources s and recovered sources A −1 f(As)<br />

have support <strong>in</strong> [0, 1] 2 , but A is not a permutation and scal<strong>in</strong>g.<br />

(Babaie-Zadeh, 2002). There, he already discussed identifiability issues, albeit explicitly only <strong>in</strong><br />

the two-dimensional analytic case.<br />

Postnonl<strong>in</strong>ear BSS is a generalization of l<strong>in</strong>ear BSS, so the <strong>in</strong>determ<strong>in</strong>acies of postnonl<strong>in</strong>ear<br />

ICA conta<strong>in</strong> at least the <strong>in</strong>determ<strong>in</strong>acies of l<strong>in</strong>ear ICA: A can only be reconstructed up to<br />

scal<strong>in</strong>g and permutation. Here of course additional <strong>in</strong>determ<strong>in</strong>acies come <strong>in</strong>to play because of<br />

translation: fi can only be recovered up to a constant. Also, if L ∈ Gl(n) is a scal<strong>in</strong>g matrix,<br />

then f(As) = (f ◦ L) � (L −1 A)s � , so f and A can <strong>in</strong>terchange scal<strong>in</strong>g factors <strong>in</strong> each component.<br />

Another <strong>in</strong>determ<strong>in</strong>acy could occur if A is not mix<strong>in</strong>g, i.e. at least one observation xi conta<strong>in</strong>s<br />

only one source; <strong>in</strong> this case fi can obviously not be recovered. For example if A equals the<br />

unit matrix I, then f(s) is already aga<strong>in</strong> <strong>in</strong>dependent, because <strong>in</strong>dependence is <strong>in</strong>variant under<br />

component-wise nonl<strong>in</strong>ear transformation; so f cannot be found us<strong>in</strong>g this method.<br />

A not so obvious <strong>in</strong>determ<strong>in</strong>acy occurs if A is absolutely degenerate, which essentially means<br />

that the normalized columns differ only by the signs of the entries (Theis and Gruber, 2005, def<strong>in</strong>ition<br />

6). Then only the matrix A but not the nonl<strong>in</strong>earities can be recovered by consider<strong>in</strong>g the<br />

edges of the support of the fully-bounded random vector as illustrated <strong>in</strong> figure 1.3. Nonetheless<br />

this is no <strong>in</strong>determ<strong>in</strong>acy of the model itself, s<strong>in</strong>ce A −1 f(As) is obviously not <strong>in</strong>dependent. So by<br />

look<strong>in</strong>g at the boundary alone, we sometimes cannot detect <strong>in</strong>dependence if the whole system<br />

is highly symmetric.<br />

If A is mix<strong>in</strong>g and not absolutely degenerate, then for all fully-bounded sources s no more<br />

<strong>in</strong>determ<strong>in</strong>acies than <strong>in</strong> the aff<strong>in</strong>e l<strong>in</strong>ear case exist, except for scal<strong>in</strong>g <strong>in</strong>terchange between f<br />

and A:<br />

Theorem 1.2.3 (Separability of bounded postnonl<strong>in</strong>ear BSS). Let A, W ∈ Gl(n) and one of<br />

them mix<strong>in</strong>g and not absolutely degenerate, h : R n → R n be a diagonal <strong>in</strong>jective analytic function<br />

such that h ′ i �= 0 and let s be a fully-bounded <strong>in</strong>dependent random vector. If also W� h(As) � is<br />

<strong>in</strong>dependent, then h is aff<strong>in</strong>e l<strong>in</strong>ear.<br />

1<br />

0.8<br />

0.6<br />

0.4<br />

0.2<br />

0

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!