14.03.2014 Views

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

Modeling and Multivariate Methods - SAS

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 19 Analyzing Principal Components <strong>and</strong> Reducing Dimensionality 483<br />

Principal Components<br />

Principal Components<br />

If you want to see the arrangement of points across many correlated variables, you can use principal<br />

component analysis to show the most prominent directions of the high-dimensional data. Using principal<br />

component analysis reduces the dimensionality of a set of data. Principal components is a way to picture the<br />

structure of the data as completely as possible by using as few variables as possible.<br />

For n original variables, n principal components are formed as follows:<br />

• The first principal component is the linear combination of the st<strong>and</strong>ardized original variables that has<br />

the greatest possible variance.<br />

• Each subsequent principal component is the linear combination of the variables that has the greatest<br />

possible variance <strong>and</strong> is uncorrelated with all previously defined components.<br />

Each principal component is calculated by taking a linear combination of an eigenvector of the correlation<br />

matrix (or covariance matrix or SSCP matrix) with a variable. The eigenvalues show the variance of each<br />

component.<br />

Principal components representation is important in visualizing multivariate data by reducing it to<br />

dimensionalities that are graphable.<br />

Launch the Platform<br />

Select Analyze > <strong>Multivariate</strong> <strong>Methods</strong> > Principal Components to launch the platform. JMP presents<br />

you with a dialog box to specify the variables involved in the analysis. In this example, we use all the<br />

continuous variables from the Solubility.jmp data set.<br />

The Estimation Method list provides different methods for calculating the correlations. For details on the<br />

methods, see the “Correlations <strong>and</strong> <strong>Multivariate</strong> Techniques” chapter on page 441.<br />

Principal components analysis is also available in the <strong>Multivariate</strong> <strong>and</strong> Scatterplot 3D platforms.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!