08.02.2013 Views

Bernal S D_2010.pdf - University of Plymouth

Bernal S D_2010.pdf - University of Plymouth

Bernal S D_2010.pdf - University of Plymouth

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

3.4. EXISTING MODELS<br />

The model computes the similarity between each low-level feature and the image at H different<br />

ItKations. which is used as input evidence for the network (similar to dummy nodes in Bayesian<br />

networks). A simple bottom-up sweep <strong>of</strong> the belief propagation algorithm then obtains the<br />

probability distribution for each variable (i.e. presence/location <strong>of</strong> each fragment), including<br />

that <strong>of</strong> the root node, which represents the class. Note, unlike conventional feedforward meth­<br />

ods, the model computes the relative likelihoods <strong>of</strong> all cla.ss sub-hierarchies given the stimuli<br />

(i.e. there is a graph for each class <strong>of</strong> objects), leading to multiple alternatives at each level <strong>of</strong><br />

the model. Later, a top-down cycle obtains the optimal value for all the object parts given the<br />

state/location <strong>of</strong> the root/class node, correcting most <strong>of</strong> the errors made during the bottom-up<br />

pass. This provides not only object recognition, bui a detailed interpretation <strong>of</strong> the image ai<br />

different scales and levels <strong>of</strong> detail. The model was tested a large number <strong>of</strong> natural images<br />

belonging to three different object classes.<br />

Unlike most related models (Riesenhuber and Poggio 1999, George and Hawkins 2009, Murray<br />

and Kreutz-Delgado 2007, Lewicki and Sejnowski 1997), where nodes represent locations and<br />

states represent features, the model by lipshtein et al. (200B) uses nodes to represent features and<br />

stales to represent locations. In essence, the network includes a fixed hierarchical representation<br />

<strong>of</strong> ail the possible combinations <strong>of</strong> features and subfealures <strong>of</strong> a class <strong>of</strong> objects. The graph is a<br />

singly connected tree (no loops and a single parent per node) which makes tractable the use <strong>of</strong><br />

belief propagation to perform exact inference.<br />

However, the previous properties imply that features are not shared within the same object (each<br />

feature can only be present at one given location), amongst different objects <strong>of</strong> the same class<br />

(the graph for each class is singly connected), or within objects <strong>of</strong> different classes (there is an<br />

independent network for each object class). This lack <strong>of</strong> overlap between features speaks for<br />

an inefficient coding strategy, as low-level features <strong>of</strong> distinct objects are likely to be similar.<br />

Additionally, the model is restricted to a set <strong>of</strong> informative learned fragments, which, for exam­<br />

ple, limit its ability to explain retinotopic contour completion at an arbitrary (less informative)<br />

object region.<br />

A second model falling into this category is that proposed by Chikkerur et al. (2010). It uses<br />

122

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!