30.12.2012 Views

European Journal of Scientific Research - EuroJournals

European Journal of Scientific Research - EuroJournals

European Journal of Scientific Research - EuroJournals

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Feature Selection Based on Statistical Analysis 429<br />

KG-rule retains only those PCs whose variances, i.e. eigenvalues that are ≥ 1. Nevertheless, for<br />

large variable spaces p, the KG-rule usually retains too many PCs [4], [8][10].<br />

c) Cumulative Variance utters the criterion for choosing m is to select a cumulative variance<br />

threshold, t where t is at certain percentage <strong>of</strong> the total variance that the first m PCs should account<br />

for. The required number <strong>of</strong> PCs is then the smallest value <strong>of</strong> m for which the chosen percentage is<br />

exceeded [7][10]. From PCA theory, the variance <strong>of</strong> the i-th PC (eigenvector) is equal to its<br />

corresponding eigenvalues λi [6].<br />

Three different methods have been elucidated on how to retain the optimal number <strong>of</strong> PCs. In<br />

this study, we investigated the required number <strong>of</strong> PCs to be retained using these three methods.<br />

Statistical Analysis<br />

ANOVA is a standard technique for measuring the statistical significance <strong>of</strong> a set <strong>of</strong> independent<br />

variables. It takes a single feature and the class associated with the data samples and measures the<br />

significance <strong>of</strong> the class variables in predicting the means <strong>of</strong> the feature. The measure that ANOVA<br />

produces is the p-value for the feature set. On the other hand, Multiple Comparison Procedure (MCP)<br />

test is one that can be used to determine which means amongst a set <strong>of</strong> means differ from the rest. The<br />

ANOVA leads to a conclusion that there is evidence that the group means differ but the main goal is to<br />

determine which <strong>of</strong> the means are different. Therefore, the MCP is applied.<br />

The MCP test compares the difference between pair <strong>of</strong> means with appropriate adjustment for<br />

the multiple testing. The results <strong>of</strong> MCP present the p-value or confidence interval for each pair. In this<br />

study, after performing ANOVA on the eigenpostures, MCP is applied to assess which ‘eigenpostures’<br />

means are significantly different. Further, multiple range test is realized for testing homogeneous<br />

subsets <strong>of</strong> groups based on their group means. In doing so, the groups that differ significantly are<br />

revealed. Finally, we will determine the optimized number <strong>of</strong> eigenpostures that will act as inputs to<br />

the ANN for classification <strong>of</strong> the four main postures.<br />

Artificial Neural Network (ANN) as Classifier<br />

Artificial Neural Networks (ANN) are known for their ability to express highly nonlinear decision and<br />

makes them appropriate for recognition <strong>of</strong> complex pattern and the ability to maintain accuracy even<br />

when some input data are inapt. In this study, the multilayer perceptron is chosen for recognition<br />

purpose. A multilayer perceptron is a feed forward network structure in which neurons are connected<br />

only between two succeeding layers [9][12]. A feed forward neural net that performs the recognition<br />

part consist <strong>of</strong> one input layer with one neuron per feature, two hidden layers with four and three nodes<br />

in each layer and a single output.<br />

Experiments and Results<br />

A collection <strong>of</strong> 400 images <strong>of</strong> various human postures constitutes the database to generate the<br />

eigenpostures for this study. The various postures include all four main postures namely standing,<br />

sitting, bending and lying position for both gender with the subjects are either facing front or facing<br />

either side and no restriction impose on the type <strong>of</strong> clothing being worn. Initially, each image has m x n<br />

pixels, but eventually reshaped to a column vector <strong>of</strong> x mn. Then, the eigenvectors and eigenvalues are<br />

computed according to [3][11]. Implementing the three rules mentioned previously, we select the most<br />

suitable eigenpostures required as inputs to the classification system. In other words, we select the<br />

most relevant eigenvalues or PCs to be retained in this study.<br />

Figure 3 illustrates the results <strong>of</strong> the Scree test. The decrease in magnitude for successive<br />

eigenvalues implies that the first few principal components can approximate a large part <strong>of</strong> the original<br />

data’s variance. In this case, decision to retain the first thirty-five PCs is appropriate and they<br />

reasonably represent good approximation <strong>of</strong> the original data set. Next, from the PCA results, applying

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!