08.02.2013 Views

Bernal S D_2010.pdf - University of Plymouth

Bernal S D_2010.pdf - University of Plymouth

Bernal S D_2010.pdf - University of Plymouth

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

2.1. OBJECT RECOGNITION<br />

2.1.2.1 HMAX / 'The Standard Model'<br />

In 1999, Riesenhuber and Poggio presented a landmark paper describing the fundamental ele­<br />

ments <strong>of</strong> object recognition models in the visual .system (Riesenhuberand Poggio 1999). These<br />

principles were exemplified in a computalional model, HMAX, also known as the .-iiandani<br />

model. It was labelled standard as it attempts to consolidate in a single model many <strong>of</strong> the<br />

widely accepted facts and observations in the primate visual system (more specifically, the ven­<br />

tral path). It has subsequently been employed to simulate other phenomena such as attention<br />

(Walther and Koch 2007). biological motion (Giese and Poggio 2003) and learning using spike-<br />

lime dependent plasticity (STDP) (Masquelier and Thorpe 2010)- it is also the backbone <strong>of</strong> the<br />

architecture used for the model in this thesis, and for that reason il will be described in greater<br />

detail in this section.<br />

The HMAX model attempts to reproduce activity and functionality observed along the ventral<br />

visual pathway, comprising areas VI, V2, V4 and IT. Tlte model is based upon widely accepted<br />

basic principles such as the hierarchical arrangement <strong>of</strong> these areas, with a progressive increase<br />

in receptive field size and complexity <strong>of</strong> preferred stimuli, as well as a gradual build-up <strong>of</strong><br />

invariance lo position and scale as we move further up the hierarchy. These concepts have been<br />

described in Section 2.1.<br />

Several versions <strong>of</strong> the model have been published, although they all share the same underlying<br />

structure. It usually comprises three different levels representing V1. V2/V4 and FT. which are<br />

subdivided into two layers, simple and complex. Figure 2.4 shows a schematic representation<br />

<strong>of</strong>lhcHMAX model including Ihc different types <strong>of</strong> units and operations, and the mapping onto<br />

the visual cortex.<br />

Two operations are performed in aliemaling layers <strong>of</strong> the hierarchy: the invariancc operation,<br />

which occurs between layers <strong>of</strong> the same level (e.g. from SI to CI); and the selectivity operalion<br />

implemented between layers <strong>of</strong> different levels (e.g. from CI lo S2}.<br />

Invariance is implemented by applying the max function over a set <strong>of</strong> afferenis selective lo the<br />

same feature but with slightly different positions and sizes. Thus, the respimse <strong>of</strong> a complex<br />

16

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!