05.01.2013 Views

Perceptual Coherence : Hearing and Seeing

Perceptual Coherence : Hearing and Seeing

Perceptual Coherence : Hearing and Seeing

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

132 <strong>Perceptual</strong> <strong>Coherence</strong><br />

light-sensitive units. The receptors learn not by being right or wrong, but<br />

by maximizing mutual information under the sparseness constraint. What<br />

we are mathematically trying to find is a set of independent basis functions<br />

that when added together reproduce the image of the natural scene. 11<br />

We impose the sparse-coding restriction that the output of each basis<br />

function is zero for most inputs but strongly excitatory or inhibitory for a<br />

small number of inputs. In terms of the mathematical analysis (principal<br />

components), the basis functions are the causes of the visual scenes so that<br />

if added together they will reproduce the natural scenes. If the causal basis<br />

functions resemble the space-time receptive fields of retinal or cortical receptor<br />

cells that decode the brightness patterning of visual scenes, then it is<br />

possible to argue, somewhat indirectly, that the physiological receptive<br />

fields have evolved to maximize mutual information by matching the<br />

causes.<br />

Field (1987), following such a strategy, found that the basis functions<br />

indeed resembled Gabor functions similar to the receptive fields of cortical<br />

neurons (see figures 2.7 <strong>and</strong> 2.14 for cortical receptive fields). Compared to<br />

normal (Gaussian) probability distributions, the Gabor-like functions had<br />

sharper probability peaks at zero response <strong>and</strong> higher probabilities of extreme<br />

response rates. Field (1987) further demonstrated that by maximizing<br />

the sparseness of the distribution, he was able to match the b<strong>and</strong>width<br />

(approximately 0.8–1.5 octaves) <strong>and</strong> vertical-horizontal aspect ratio (2 to1)<br />

of cortical cells. This outcome supported the notion of a sparse code: Individual<br />

neurons would be active only when a particular brightness pattern<br />

(i.e., a causal basis) occurred at a specific location <strong>and</strong> orientation. Any single<br />

scene would stimulate only a small set of these receptors, but each possible<br />

scene would activate a different set of receptors.<br />

Olshausen <strong>and</strong> Field (1996, 1998) “trained” a set of basis functions using<br />

image patches r<strong>and</strong>omly selected from natural scenes to minimize the<br />

difference between the image patch <strong>and</strong> the reconstruction using the addition<br />

of basis functions. Minimizing the number of basis functions by<br />

adding a cost for each additional basis function imposed sparseness. The<br />

resulting set of basis functions was overcomplete, so that there were more<br />

basis functions than the dimensionality of the input. 12 The theoretical advantage<br />

of an overcomplete basis set is that the visual (<strong>and</strong> auditory) worlds<br />

11. The set of sine <strong>and</strong> cosine functions for the Fourier analysis is one basis set. A set of<br />

Gaussian functions, as illustrated in figure 3.6, is another basis set. We can produce any arbitrary<br />

curve by multiplying the height of each Gaussian by a constant <strong>and</strong> then summing the resulting<br />

Gaussian functions together point by point.<br />

12. For example, an overcomplete set might have 200 basis functions to describe a 12<br />

pixel × 12 pixel image that contains only 144 independent points. Because the set is overcomplete,<br />

there are an infinite number of solutions to describing the 144 image points. The sparseness<br />

criterion is necessary to force a single solution.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!