05.01.2013 Views

Perceptual Coherence : Hearing and Seeing

Perceptual Coherence : Hearing and Seeing

Perceptual Coherence : Hearing and Seeing

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

134 <strong>Perceptual</strong> <strong>Coherence</strong><br />

cell as portrayed in figure 3.10 <strong>and</strong> the corresponding basis function represents<br />

the input that maximally stimulates that filter. Passing an image<br />

through any filter yields the strength of the corresponding basis function.<br />

Given that primary cells have spatial-temporal receptive fields (see figures<br />

2.18 <strong>and</strong> 2.19), Hateren <strong>and</strong> Ruderman (1998) utilized sequences of<br />

natural images to more closely simulate normal vision. They performed independent<br />

component analyses on sequences of 12 images (roughly 0.5 s<br />

total) taken from television shows. The basis functions resembled bars or<br />

edges that moved as a unit perpendicular to their orientation. The majority<br />

respond to edges moving in one direction only, exactly what would occur<br />

with nonseparable spatial-temporal receptive fields (also found by Olshausen,<br />

2002). Generally speaking, there is a negative correlation between<br />

spatial <strong>and</strong> temporal tuning; filters centered at higher spatial frequencies<br />

are more likely to be tuned to lower temporal frequencies <strong>and</strong> vice versa.<br />

Hateren <strong>and</strong> Ruderman (1998) suggested that the space-time properties<br />

of the filters explain the sparse response distribution. The visual world consists<br />

of rigid homogeneous objects moving at different speeds that occlude<br />

one another. A localized space-time filter would typically measure small<br />

brightness differences within a single homogeneous object <strong>and</strong> therefore<br />

would have a high probability of not responding at all (i.e., peaky at zero).<br />

Such a space-time filter would respond strongly to the brightness contrasts<br />

found at edges that move into or out of the receptive field, <strong>and</strong> those responses<br />

make up the long, low probability but high firing rate tails of the<br />

response distribution.<br />

Up to this point, the analyses have not made use of any properties of the<br />

nervous system. All of the outcomes have been based on an analysis of sets<br />

of images (although physiological considerations act to impose constraints).<br />

A second strategy begins with a hypothetical but naive nervous<br />

system. It is hypothetical because, based on current knowledge, you build<br />

in a plausible set of interconnections <strong>and</strong> cortical units, a model of how the<br />

interconnections combine, a model of how the outputs of the cortical cells<br />

are integrated, a model of learning, a measure of error to drive the learning<br />

procedure, <strong>and</strong> so on. It is naive because at the beginning of the simulation<br />

all of the interconnections <strong>and</strong> units have equal strength (although the procedural<br />

“software” is in place). Then you present a series of images <strong>and</strong><br />

measure the difference between the original image <strong>and</strong> the reproduction of<br />

the image by the hypothetical model. 13 On the basis of the learning <strong>and</strong> error<br />

models, the strengths of the connections <strong>and</strong> units are changed across<br />

the set of images to yield the closest matches. The resulting structure of the<br />

interconnections <strong>and</strong> units is then thought to be a model for how the actual<br />

13. The term reproduction is used in a least-squares error sense. The visual system obviously<br />

does not create a replica of the image.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!