01.03.2013 Views

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

Applied Statistics Using SPSS, STATISTICA, MATLAB and R

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

346 8 Data Structure Analysis<br />

A: Only the first four eigenvalues satisfy the Kaiser criterion. The first two<br />

eigenvalues are responsible for about 58% of the total variance; therefore, when<br />

discarding the remaining eigenvalues, we are discarding a substantial amount of<br />

the information from the dataset (see Exercise 8.12).<br />

We can conveniently interpret the data by using a graphic display of the<br />

st<strong>and</strong>ardised data projected onto the plane of the first two principal components,<br />

say F1 <strong>and</strong> F2, superimposed over the correlation plot. In <strong>STATISTICA</strong>, this<br />

overlaid graphic display can be obtained by first creating a datasheet with the<br />

projections (“factor scores”) <strong>and</strong> the correlations (“factor loadings”). For this<br />

purpose, we first extract the scrollsheet of the “factor scores” (click with the right<br />

button of the mouse over the corresponding “factor scores” sheet in the workbook<br />

<strong>and</strong> select Extract as st<strong>and</strong> alone window). Then, secondly, we join<br />

the factor loadings in the same F1 <strong>and</strong> F2 columns <strong>and</strong> create a grouping variable<br />

that labels the data classes <strong>and</strong> the original variables. Finally, a scatter plot with all<br />

the information, as shown in Figure 8.10, is obtained.<br />

By visual inspection of Figure 8.10, we see that F1 has high correlations with<br />

chemical features, i.e., reflects the chemical composition of the rocks. We see,<br />

namely, that F1 discriminates between the silica-rich rocks such as granites <strong>and</strong><br />

diorites from the lime-rich rocks such as marbles <strong>and</strong> limestones. On the other<br />

h<strong>and</strong>, F2 reflects physical properties of the rocks, such as density (MVAP),<br />

porosity (PAOA) <strong>and</strong> water absorption (AAPN). F2 discriminates dense <strong>and</strong><br />

compact rocks (e.g. marbles) from less dense <strong>and</strong> more porous counterparts (e.g.<br />

some limestones).<br />

1.0<br />

0.5<br />

0.0<br />

-0.5<br />

-1.0<br />

-1.5<br />

-2.0<br />

-2.5<br />

F2<br />

Granite<br />

Diorite<br />

Marble<br />

Slate<br />

Limestone<br />

F1-type variable<br />

F2-type variable<br />

Fe2O3<br />

Al2O3Na2O<br />

SiO2 K2O<br />

MVAP<br />

F1<br />

PAOAAAPN<br />

-1.5 -1.0 -0.5 0.0 0.5 1.0 1.5<br />

Figure 8.10. Partial view of the st<strong>and</strong>ardised rock dataset projected onto the F1-F2<br />

principal component plane, overlaid with the correlation plot.<br />

CaO

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!