Professional Documents
Culture Documents
•To select a subset of variables from a larger set, based on which original variables have the
highest correlations with the principal component.
Example
A census provided information, by tract, on five socioeconomic variables for the Madison,
Wisconsin, area. The data from 61 tracts are listed in a table. Can the sample variation be
summarized by one or two principal components?
Procedure
1. Choose Stat > Multivariate > Principal Component.
2. In variables, enter C1-C5.
3. In graphs, select Scree plot and Biplot.
4. Click OK
Coefficients for the principal components
Governament
Value
Percentage of
Total variance
Interpretation
The first principal component explains 67.7% of the total sample variance .The first two
principal components, explain 92.8% of the total sample variance .Consequently,
sample variation is summarized very well by two principal components and a reduction
in the data from 61 observations on 5 observations to 61 observations on 2 principal
components is reasonable.
Scree Plot
100
80
Eigenvalue
60
40
20
1 2 3 4 5
Component Number
Interpretation
An elbow occurs in the plot at about i=3. That is, the eigen values after ̂2 are all relatively small
and about same size.
Biplot
30
Govt. employment(%)
Second Component
20
10
Professional degree(%)
Total pop.(thousands)
Median home value($100000)
0
-10
-20
Interpretation
In this plot first two principal components explain 92.8% of the total sample variance.