08.02.2013 Views

Bernal S D_2010.pdf - University of Plymouth

Bernal S D_2010.pdf - University of Plymouth

Bernal S D_2010.pdf - University of Plymouth

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

4.1 HMAX AS A BAYESIAN NETWORK<br />

4.1 HMAX as a Bayesian network<br />

4.1.1 HMAX model summary<br />

Tile HMAX mode! (Riesenhuberand Poggio 1999, Serre el ai. 2007b), which captures the basic<br />

principles <strong>of</strong> I'eedforward hierarchical object recognition in the visual system, has already been<br />

described in some detail in Section 2.1.2. This model was chosen as a starting point, firstly<br />

because it reproduces many anatomical, physiological and psychophysical data from regions<br />

VI, V4 and IT. The second reason is that it has been repeatedly argued that the main limitation<br />

<strong>of</strong> the HMAX model is that it docs not account for the extensive feedback projections found<br />

in the visual cortex (Serre 2006, Walther and Koch 2007). Our proposed methodology, namely<br />

Bayesian nelworks and belief propagation, is ideal to tackle this problem and provide such an<br />

exiension. Below is a brief technical outline <strong>of</strong> the different layers and operations in the original<br />

HMAX model, which will faciUtate understanding <strong>of</strong> the proposed model. I'igure 2.4 provides<br />

a graphical representation <strong>of</strong> the dilTerent layers in HMAX.<br />

SI layer - Units in this layer implement Oabor filters, which have been extensively used lo<br />

model simple cell receptive fields (RJ-). and have been shown lo fit well the physiological daia<br />

from striate cortex (Jones and Palmer 1987). There are 64 types <strong>of</strong> units or fillers, one for each<br />

<strong>of</strong>the KsK^ 4) orientations (0°, 45°.90°. 135") xA/Vsi(^ 16) sizes or peak spatial frequencies<br />

(ranging from 7x7 pixels to M x 37 pixels, in steps <strong>of</strong> 2 pixels). The four different orientations<br />

and 16 different sizes, although an oversimplification, have been shown to be sufficient lo pro­<br />

vide rotation and size invariance at the higher levels. Phases are approximated by centring the<br />

Gabor filters at all locations. The RF size range is consistent with primate visual cortex (0.2" lo<br />

1 °). The input image, a gray-valued image (160 x 160 pixels Rs 5''j.'i° <strong>of</strong> visual angle) is filtered<br />

at every IcKaiion by each <strong>of</strong> the 64 Ciabor fillers described by the following equation;<br />

Gij. = expl--i ^- '-—^ •• '-\ xcosi 2;r-(j:cose + .vsine)-|-iJ<br />

The paraniciers in the equation, that is, the orientation 6, the aspect ratio 7. the effective width<br />

140<br />

(4.1)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!