10.08.2013 Views

Bachelor Thesis - Computer Graphics Group

Bachelor Thesis - Computer Graphics Group

Bachelor Thesis - Computer Graphics Group

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

The data delivered to the neural network is encoded as a sequence of cosines<br />

and sinuses of the angles between the subsequent segments [7]. Hence, the<br />

number of inputs equals twice the amount of lines in the normalized shape.<br />

Each output corresponds to a single recognizable gesture pattern. Thus, the<br />

number of outputs is the same as the size of the pattern list. Therefore, every<br />

time a new gesture is added to or removed from the list, the neural network<br />

has to retrain from scratch.<br />

To recognize a gesture, we transform the normalized sequence of points into the<br />

proposed input format and propagate it through the neural network. We find<br />

the maximum value, which signals the corresponding gesture pattern, when it<br />

is above a defined threshold value. On overview of the neural network can be<br />

seen in figure 2.4.<br />

cos α<br />

sin α<br />

cos β<br />

sin β<br />

cos γ<br />

sin γ<br />

Figure 2.4: Neural network overview<br />

The training is performed by repeating the standard back-propagation algorithm.<br />

Preprocessed and transformed pattern samples are used as the training<br />

input. Expected output samples are constructed by taking a vector filled with<br />

zeros except for a one on the index, which is assigned to the given gesture<br />

pattern. The training process is finished when either the error rate reaches the<br />

target value, or the number of cycles exceeds the maximum count.<br />

2.3 K-nearest neighbors<br />

The k-nearest neighbors [15] is the second available gesture classifier. Big<br />

advantage over the neural network is the lack of the learning phase. The<br />

idea of the algorithm is very simple. To recognize a gesture, we compare<br />

it with all pattern samples from the list. By applying a specified distance<br />

14

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!