14.02.2013 Views

Mathematics in Independent Component Analysis

Mathematics in Independent Component Analysis

Mathematics in Independent Component Analysis

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

48 Chapter 1. Statistical mach<strong>in</strong>e learn<strong>in</strong>g of biomedical data<br />

components <strong>in</strong> digitized section images. First, a so-called cell classifier was tra<strong>in</strong>ed with cell and<br />

non-cell patches us<strong>in</strong>g s<strong>in</strong>gle- and multi-layer perceptrons as well as unsupervised <strong>in</strong>dependent<br />

component analysis with correlation comparison. In order to account for a larger variety of cell<br />

shapes, a directional normalization approach is proposed. The cell classifier can then be used <strong>in</strong><br />

an arbitrary number of sections by scann<strong>in</strong>g the section and choos<strong>in</strong>g maxima of this classifier<br />

as cell center locations. This is illustrated us<strong>in</strong>g a toy example <strong>in</strong> figure 1.25. A flow-chart with<br />

the basic segmentation setup is shown <strong>in</strong> figure 1.26.<br />

ZANE was successfully applied to measure neurogenesis <strong>in</strong> adult rodent bra<strong>in</strong> sections, where<br />

we showed that the proliferation of neurons is substantially stronger (340%) <strong>in</strong> the dentate gyrus<br />

of an epileptic mouse than <strong>in</strong> a control group. When compar<strong>in</strong>g the count<strong>in</strong>g result with manual<br />

counts, the mean ZANE classification rate is 90% of all (manually detected) cells; this was with<strong>in</strong><br />

the error bounds of a perfect count, s<strong>in</strong>ce manual counts by different experts varied by roughly<br />

10% themselves (Theis et al., 2004b).<br />

1.6.3 Surface electromyograms<br />

In sections 1.2 and 1.4, we presented bl<strong>in</strong>d<br />

data factorization models based on statistical<br />

<strong>in</strong>dependence, explicit sparseness and nonnegativity.<br />

It is known that all three approaches<br />

tend to <strong>in</strong>duce a more mean<strong>in</strong>gful,<br />

often more sparse representation of the multivariate<br />

data set. However, develop<strong>in</strong>g explicit<br />

applications and more so perform<strong>in</strong>g mean<strong>in</strong>gful<br />

comparisons of such methods is still of considerable<br />

<strong>in</strong>terest.<br />

In Theis and García (2006), see chapter<br />

20, we analyzed and compared the above<br />

models, but not from a theoretical po<strong>in</strong>t of<br />

view but rather based on a real-world example,<br />

namely the analysis of surface electromyogram<br />

(sEMG) data sets. An electromyogram<br />

(EMG) denotes the electric signal generated<br />

by a contract<strong>in</strong>g muscle (Basmajian and Luca,<br />

1985). In general, EMG measurements make<br />

use of <strong>in</strong>vasive, pa<strong>in</strong>ful needle electrodes. An<br />

alternative is to use sEMG, which is measured<br />

us<strong>in</strong>g non-<strong>in</strong>vasive, pa<strong>in</strong>less surface electrodes.<br />

However, <strong>in</strong> this case the signals are rather<br />

more difficult to <strong>in</strong>terpret due to noise and Figure 1.26: ZANE image segmentation.<br />

the overlap of several source signals. Direct<br />

application of the ICA model to real-world noisy sEMG turns out to problematic (García et al.,<br />

2004), and it is yet unknown if the assumption of <strong>in</strong>dependent sources holds well <strong>in</strong> the sett<strong>in</strong>g<br />

of sEMG.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!