Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
What is the
nature of those
factors
2
How many different How well do the
factors are needed to 3 hypothesized factors
explain the pattern of 1 explain the observed
relationship among data?
these variables?
3. Click on Graphs
1. Select “Scree Plot” Check “Sort Loading” (Sorting is done by the maximum
absolute loading for any factor. Variables that have their highest
absolute loading on factor 1 are printed first, in sorted order.
2. Click OK Variables with their highest absolute loadings on factor 2 are
printed next, in sorted order, and so on.)
3. Click Results
Click OK till the end.
INTERPRETING OUTPUT
Unrotated Factor Loadings and Communalities Since, we did not mention the number of factors to be
extracted, Minitab by default extracted 5 (because no. of
Variable Factor1 Factor2 Factor3 Factor4 Factor5 Communality variables = 5) factors.
Pop 0.972 0.149 -0.006 0.170 0.067 1.000
School 0.545 0.715 0.415 -0.140 -0.001 1.000 All factors perfectly explain the variability in the data.
Employ 0.989 0.005 -0.089 0.083 -0.085 1.000 Factor 1, 2 &3 together explains 97.8% of variability in
Home -0.303 0.797 -0.523 0.005 -0.002 1.000 data.
Health 0.847 -0.352 -0.344 -0.200 0.022 1.000
Factor 4 and 5 explain only 2.1% of variability which is
Variance 3.0289 1.2911 0.5725 0.0954 0.0121 5.0000 very less, hence we can ignore Factor 4 and 5.
% Var 0.606 0.258 0.114 0.019 0.002 1.000 Next step would be to compare model with 2 and 3
factors, if two factor model fell short of explaining.
HOW MANY FACTORS TO
CONSIDER
2.0
Eigenvalue
1.0
0.0
1 2 3 4 5
% Variance Criteria: Consider factors which explain large
number of variability in the data.
Factor Number
MAXIMUM LIKELIHOOD
EXTRACTION METHOD
1. Select Variables
2. Select No. of
factors to extract
3. Select “Maximum
likelihood”
4. Select “Varimax”
Rotation
5. Click on “Graphs”
Stat > Multivariate > Factor
Analysis
EXAMPLE
Loading Plot of Pop, ..., Health Scree Plot of Pop, ..., Health
1.0 School
3.0
0.8 2.5
Pop
2.0
Second Factor
0.6 Employ
Eigenvalue
1.5
0.4
1.0
0.2 Home
Health 0.5
0.0 0.0
Further, from loading plot it is clearly visible that You can conclude that the first two factors account
Factor 1 has high +ve loading on variable Pop, for most of the total variability in data (given by the
Health and Employ, and has -ve loading on Home. eigenvalues). The remaining factors account for a
very small proportion of the variability (close to
zero) and are likely unimportant.
TIPS
Which Matrix to use Use correlation matrix if the variables are measured by different scales and you want to standardize them or if the
When? variance differ widely among variables. You can use covariance or correlation matrix in all other scenarios.
If the factors errors after obtained after fitting factor model are not assumed to follow normal distribution, use PCA
method. If it follows Normal distribution use Maximum Likelihood Extraction Method.
Which Extraction
method to use? However it is advised to first conduct Factor Analysis using PCA as extraction method as it gives you an idea as to
how many factors explain the extent of variation in your data you are interested to study
Equimax (Gamma = No. of factor/2): Focuses on rotating initial factor. It maximizes loading on variable on one factor
and minimizes on other factors.
Varimax(Gamma = 1): Many variables can have high loadings close (±1) and some low close to 0 on one factor. This
provides clear positive or negative association.
Which Rotation Quartimax(Gamma = 0): This is in between the two methods Quartimax and Varimax.
method to use?
If you use a method with a low value of gamma, the rotation will tend to simplify the rows of the loadings; if you use a
method with a high value of gamma, the rotation will tend to simplify the columns of the loadings.
Represent how much a factor influences a variable. High Factor Sample Factor Sample
loadings (positive or negative) indicate that the factor Loading size Loading size
strongly influences the variable. Low loadings indicate
that the factor has a weak influence on the variables. The 0.3 350 0.55 100
What is the
largest loading either positive indicates that the
threshold of
Loading value to be
contribution of the variables increase with increasing 0.35 250 0.6 85
loading in a dimension and negative loading indicates a
considered as
significant
decrease. 0.4 200 0.65 70
Examine the loading pattern to determine to which factor
each variable belongs. In the unrotated loading table, the 0.45 150 0.7 60
loadings are difficult to interpret, so examine the rotated
loading table to interpret the loading pattern. 0.5 120 0.75 50
Though the techniques are similar, they actually are designed for different purposes.
When to use PCA? Principal components analysis is used to reduce data into a smaller number of components,
When to use Factor Factor analysis is used to understand what constructs underlie the data.
Analysis The two analyses are often performed on the same data. For example, you can conduct a principal components analysis to
determine the number of factors to extract in a factor analytic study.
Please login to:
tqm.jsw.in
to read the training modules
THANK YOU