14.02.2013 Views

Mathematics in Independent Component Analysis

Mathematics in Independent Component Analysis

Mathematics in Independent Component Analysis

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

180 Chapter 12. LNCS 3195:718-725, 2004<br />

Postnonl<strong>in</strong>ear overcomplete bl<strong>in</strong>d source separation us<strong>in</strong>g sparse sources 5<br />

the one used <strong>in</strong> the identifiability proof of theorem 3, also see figure 1. In the<br />

second stage, the mix<strong>in</strong>g matrix A and then the sources s are reconstructed<br />

by apply<strong>in</strong>g the l<strong>in</strong>ear algorithms from [8], section 1, to the l<strong>in</strong>earized mixtures<br />

f −1 x. So <strong>in</strong> the follow<strong>in</strong>g it is enough to reconstruct f.<br />

3.1 Geometrical preprocess<strong>in</strong>g<br />

Let x(1), . . . , x(T ) ∈ R m be i.i.d.-samples of the random vector x. The goal of geometrical<br />

preprocess<strong>in</strong>g is to construct vectors y(1), . . . , y(T ) and z(1), . . . , z(T ) ∈<br />

R m us<strong>in</strong>g cluster<strong>in</strong>g or <strong>in</strong>terpolation on the samples x(t) such that f −1 (y(t)) and<br />

f −1 (z(t)) lie <strong>in</strong> two l<strong>in</strong>early <strong>in</strong>dependent l<strong>in</strong>es of R m . In figure 1 they are to span<br />

the two thick l<strong>in</strong>es which already determ<strong>in</strong>e the postnonl<strong>in</strong>earities.<br />

Algorithmically, y and z can be constructed <strong>in</strong> the case m = 2 by first choos<strong>in</strong>g<br />

far away samples (on different ’non-opposite’ curves) as <strong>in</strong>itial start<strong>in</strong>g po<strong>in</strong>t<br />

and then advanc<strong>in</strong>g to the known data set center by always choos<strong>in</strong>g the closest<br />

samples of x with smaller modulus. Such an algorithm can also be implemented<br />

for larger m but only for sources with at most one non-zero coefficient at each<br />

time <strong>in</strong>stant, but it can be generalized to sources of sparseness m − 1 us<strong>in</strong>g more<br />

elaborate cluster<strong>in</strong>g.<br />

3.2 Postnonl<strong>in</strong>earity estimation<br />

Given the subspace vectors y(t) and z(t) from the previous section, the goal is<br />

to f<strong>in</strong>d C1-diffeomorphisms gi : R → R such that g1 × . . . × gm maps the vectors<br />

y(t) and z(t) onto two different l<strong>in</strong>ear subspaces.<br />

In abuse of notation, we now assume that two curves (<strong>in</strong>jective <strong>in</strong>f<strong>in</strong>itely<br />

differentiable mapp<strong>in</strong>gs) y, z : (−1, 1) → Rm are given with y(0) = z(0) = 0.<br />

These can for example be constructed from the discrete sample po<strong>in</strong>ts y(t) and<br />

z(t) from the previous section by polynomial or spl<strong>in</strong>e <strong>in</strong>terpolation. If the two<br />

curves are mapped onto l<strong>in</strong>es by g1 × . . . × gm (and if these are <strong>in</strong> sufficiently<br />

general position) then gi = λif −1<br />

i for some λ �= 0 accord<strong>in</strong>g to theorem 3. By<br />

requir<strong>in</strong>g this condition only for the discrete sample po<strong>in</strong>ts from the previous<br />

section we get an approximation of the unmix<strong>in</strong>g nonl<strong>in</strong>earities gi. Let i �= j be<br />

fixed. It is then easy to see that by project<strong>in</strong>g x, y and z onto the i-th and j-th<br />

coord<strong>in</strong>ate, the problem of f<strong>in</strong>d<strong>in</strong>g the nonl<strong>in</strong>earities can be reduced to the case<br />

m = 2, and g2 is to be reconstructed, which we will assume <strong>in</strong> the follow<strong>in</strong>g.<br />

A is chosen to be mix<strong>in</strong>g, so we can assume that the <strong>in</strong>dices i, j were chosen<br />

such that the two l<strong>in</strong>es f −1 ◦ y, f −1 ◦ z : (−1, 1) → R 2 do not co<strong>in</strong>cide with<br />

the coord<strong>in</strong>ate axes. Reparametrization (¯y := y ◦ y −1<br />

1<br />

) of the curves lets us<br />

further assume that y1 = z1 = id. Then after some algebraic manipulation, the<br />

condition that the separat<strong>in</strong>g nonl<strong>in</strong>earities g = g1 × g2 must map y and z onto<br />

l<strong>in</strong>es can be written as g2 ◦ y2 = ag1 = a<br />

b g2 ◦ z2 with constants a, b ∈ R \ {0},<br />

a �= ±b.<br />

So the goal of geometrical postnonl<strong>in</strong>earity detection is to f<strong>in</strong>d a C 1 -diffeomorphism<br />

g on subsets of R with<br />

g ◦ y = cg ◦ z (3)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!