08.02.2013 Views

Bernal S D_2010.pdf - University of Plymouth

Bernal S D_2010.pdf - University of Plymouth

Bernal S D_2010.pdf - University of Plymouth

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

4.3. LEARNING<br />

4.3.2 S1-C1 CRTs<br />

The belief propagalton operations between SI and CI need to approximate the max operation<br />

implemented in the original HMAX model. To do this wc propose increasing the number <strong>of</strong><br />

states <strong>of</strong> the CI layer so that for each SI orienlalion there are Kcigmup states coding different<br />

spatial arrangements <strong>of</strong> SI units. In this way all the CI states corresponding to the same SI<br />

orienlalion can be grouped together and treated as a single state during the generation <strong>of</strong> the<br />

outpui A mcs-sage. The operation to compute the output A message implements the sum over<br />

all the stales <strong>of</strong> each CI group. In other words. CI nodes provide a dislribulion over SI features<br />

and locations, which after marginalizing (summing) over tlie locations during the gcneraiion <strong>of</strong><br />

ihc outpui A message, provides an approximation to the max operation.<br />

However, ihe number <strong>of</strong> different possible spatial arrangements <strong>of</strong> SI units converging on a<br />

CI unit i.s given by the number <strong>of</strong> ^-combinations <strong>of</strong> the n-eiemenl set equal to the binomial<br />

coefficient, ("). where n — ANa • ANc\ • 2 bands and it is the number <strong>of</strong> active units (assuming<br />

binary values). For example, for n = 8 • 8 • 2 = 128 and k = .12, the number <strong>of</strong> possible spatial<br />

arrangements is ('^2} ^ \(i^^. This is just a lower bound on Ihereal number <strong>of</strong> combinations, as<br />

we would need to sum over all the diffcrenl values <strong>of</strong> k, and ihe weight values at each location<br />

are not necessarily binary, but range from 0 to I. Creating a distribution for each CI node<br />

containing/Tc] =^51 -Kngroup — 4- lO" slates, is obviously intractable.<br />

Tor this reason, the value Kc\grouii is limited lo include only the most common arrangements<br />

<strong>of</strong> S1 units for each orientation. Figure 4.7 portrdys a toy example <strong>of</strong> this method where the<br />

number<strong>of</strong>SI units is n —.3'3-9. the number <strong>of</strong> SI stales is A'51 = 4 (orientations), the number<br />

<strong>of</strong> features per group is li(:\g„,up — 3 and the resulting number <strong>of</strong> C1 stales is Kci —4-3= 12.<br />

The equation to calculate A(C1), which combines the bottom-up evidence, and Ac](S2), which<br />

sends the bottom-up evidence lo S2, arc shown in the diagram.<br />

Noie that, as illusiraied in the toy example, the weights are learned for each lixed CI state-i a.s<br />

a function <strong>of</strong> the n afferent SI units and the j SI slates per node. This yields a weight matrix<br />

(shown ontheboliom left) for each <strong>of</strong> the CM f states. However, theCITs<strong>of</strong> a Bayesian network<br />

are defined as a function <strong>of</strong> the child and the parent stales, j and / respectively, for each rtxed<br />

\58

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!