08.02.2013 Views

Bernal S D_2010.pdf - University of Plymouth

Bernal S D_2010.pdf - University of Plymouth

Bernal S D_2010.pdf - University of Plymouth

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

5.1. FEEDFORWARD PROCESSING<br />

5.1.2 Object categorization<br />

This section describes the performance <strong>of</strong> the model during feedforward categorization, based<br />

on the feedforward processing conslraints defined in Section 4.4. The network waslrained using<br />

60 object silhouette images, shown in I'igure 5.^, from which the S2 and S3 prototypes were<br />

learned. The trained network was then tested on different transformations <strong>of</strong> the same images<br />

including occluded, translated and scaled versions.<br />

Throughout this section, I have used correct categorization to mean that the state with the<br />

maximum value <strong>of</strong> the model's lop layer response corresponds to the input image, for the<br />

3-level and 4-leve! architectun;s, the distributions <strong>of</strong> the four top layer nodes, corresponding<br />

to each <strong>of</strong> the four S2 RF sizes, are averaged, resulting in a single distribution with 60 states.<br />

Additionally, for the alternative 3-Ievel architecture, the values <strong>of</strong> the four prototypes learned<br />

for each object category are also averaged, leading again to a single dislribulion with 60 slates.<br />

The model's performance is measured as a percentage <strong>of</strong> correctly categorized images for each<br />

datasel <strong>of</strong> 60 images.<br />

For the occluded lest set an average <strong>of</strong> 30% <strong>of</strong> the image's black pixels are deleted using a<br />

reclangular while patch. The rectangle is placed in a position thai leaves the image identifiable<br />

to a human observer In the translated test-set. the object is moved to a new position within<br />

the available image frame <strong>of</strong> 160x160 pixels. The displacement will be near to the maximum<br />

permitted in both directions but will depend on the original object size, i.e. small objects allow<br />

for bigger displacements. Two different scale sets have been used; scale ±10%, where the<br />

image is scaled to either 90% or 110% <strong>of</strong> the original size and centred; and scale ±20%. where<br />

ihe image is scaled to either 80% or 120% <strong>of</strong> the original size and centred. An example <strong>of</strong> the<br />

different transformations for five arbitrary images is shown in Figure 5.9.<br />

200

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!