08.02.2013 Views

Bernal S D_2010.pdf - University of Plymouth

Bernal S D_2010.pdf - University of Plymouth

Bernal S D_2010.pdf - University of Plymouth

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

3.4. EXISTING MODELS<br />

ihL- output <strong>of</strong> the standard HMAX model (Serre el al. 2007b), described in Section 2.1.2, as the<br />

input to a Bayesian network which simulates the effects <strong>of</strong> spatial and feature-based attention<br />

(modelling the prefrontal cortex and the lateral intraparietal regions). The network consists <strong>of</strong><br />

a node L. encoding the location and scale <strong>of</strong> the target object; a node O. encoding the identity<br />

<strong>of</strong> the object; and a set <strong>of</strong> nodes X, that c(xle the different features and their locations. The<br />

feature nodes receive evidence from the HMAX-based preprocessing network, which extracts<br />

a set <strong>of</strong> high-level features (roughly corresponding to V2/V4 receptive fields) from the image.<br />

At the same time, they receive top-down feedback from the object location (L) and identity<br />

(O), using conditional probability distribution P{X,\0,L). This distribution is constructed based<br />

on whether the object contains a given feature (obtained from the HMAX parameters), and<br />

whether Ihe feature location matches the spatial attention location (Gaussian centred around<br />

that location).<br />

The model is successful at capturing several attenlional effects such as the pop-out effect and<br />

fealure-based and spatial attention, and predicts eye fixaiions during free viewing and visual<br />

search tasks. However, it cannot be considered a generative model <strong>of</strong> the visual system as it<br />

cannot produce input images, i.e. the model relies on the HMAX framework to analyze and ex­<br />

tract features. This means the effects <strong>of</strong> attention on lower visual areas cannot be modelled. The<br />

Bayesian network is limited to a relatively abstract implemenlalion <strong>of</strong> the high-level interactions<br />

between the ventral and dorsal pathways, lixact inference can be performed using a single up<br />

and down pass <strong>of</strong> the belief propagaiion algorithm due to the simplicity <strong>of</strong> the network, where<br />

only the feature layer has more than one node.<br />

Another interesting architecture, and one which takes into account temporal as well as spatial<br />

information, is the Hierarchical Temporal Memory (HTM) proposed by George and Hawkins<br />

(2009). The model assumes that images are generated by a hierarchy <strong>of</strong> causes, and that a<br />

particular cause at one level unfolds into a sequence <strong>of</strong> causes ai a lower level. An HTM can<br />

be considered a special type <strong>of</strong> Bayesian network which contains a variable coding the spatial<br />

patterns, and a second variable coding sequences <strong>of</strong> those spatial patterns (represented using a<br />

Markov chain).<br />

123

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!