Week 10 Factor Analysis Principle Components Analysis (PCA) Flashcards
To provide an overview of Principle Components Analysis (PCA), which is a type of Factor Analysis
How does Hills define Factor Analysis?
“Factor Analyses … is a generic term that covers a number of different but related analysis techniques, most importantly Principal Components Analysis (PCA) and Factor Analysis”
What do we need to bear in mind with Exploratory Factor Analysis?
Exploratory Factor Analysis (EFA) can only be undertaken in SPSS – Confirmatory Factor Analysis (CFA) requires another package for eg; Lisrel, AMOS, etc.
What are the main differences between PCA, EFA and CFA?
*EFA just looks at shared variance
*PCA is a simple form of Factor Analysis that analyses all the items clustered together, & identifies as much variance as possible
*CFA is more complex looks at unexplained variance and
forces the unexplained constructs into the model - we put all items in and the AMOS program determines which load onto each construct
What do Tabachnick & Fidell say is the goal of research that uses PCA and Factor Analysis?
The goal of research using PCA or FA is to concisely describe, & perhaps understand, the relationships among observed variables or to test theory about underlying processes
What are the uses for Principle Components Analysis (PCA) and Factor Analysis (FA)?
*PCA and FA have considerable utility in reducing numerous variables down to a few factors. *Mathematically, PCA and FA produce several linear combinations of observed variables, each linear combination a factor”
What is the point of Exploratory Factor Analysis (EFA)?
*EFA is intended to describe and summarize data by grouping correlated variables together
Tabachnick & Fidell tell us that FA and PCA differ on the variance that is analysed, how does each analyse the variance?
- In PCA, all the variance in the observed variables is analysed.
- In FA, only shared variance is analyzed; attempts are made to estimate and eliminate variance due to error & variance that is unique to each variable
So, just to clarify, why is Confirmatory Factor Analysis (CFA) so kick-arse?
- CFA tests theoretical & conceptual underpinnings of a theoretical model with items loading on specific factors.
- It measures both the amount of variance explained & the unexplained variance not accounted for within the model.
- CFA identifies the residuals at each level of the model
- CFA cannot be performed through SPSS.
In SPSS there are several Factor Analysis extractions available (except CFA), what are they?
- PCA – the mathematically determined solution with the common, unique and error variances mixed into the components.
- Principal Factor Extraction (PFE) – Estimates communalities in an attempt to eliminate unique and error variances from variables – only shared variance is evaluated.
Model Rotation is required because without rotation it would be difficult to interpret the results. What are the 2 main rotation methods used in SPSS?
*Orthogonal Rotation = axes are maintained at 90 degrees (Orthogonal means right angle
Most common is Varimax)
*Oblique Rotation = axes are not maintained at 90 degrees (Oblique rotations do not need to be at right angles. This is a good for things that are closely associated but not strong correlations)
Tell me more about Orthogonal Rotation
Orthogonal rotation – Varimax rotation is orthogonal rotation that simplifies the factors by setting levels on a simplicity criterion & is the default option in SPSS.
*The goal of Varimax is to maximize the variance of the factor loadings by making high loadings higher and low ones lower for each factor.
Tell me more about Oblique Rotation
Oblique rotation – uses the orthogonally rotated solution on rescaled factor loadings, therefore the solution may be oblique with respect to the original factor loadings. *Note that the factors often do not correlate in Oblique rotation.
What should I be wary of when considering undertaking Factor Analysis?
- You should have a good spread of scores to produce enough variance in the inter-correlations.
- Beware of factors that are defined by only 2 variables. You have a saturated model with 3 or less variables and the question should be asked would one question also be representative – some packages won’t run with 2 variables.
- Relying on statistical analyses alone to produce results should not be done. Remember GIGO – garbage in, garbage out – your data and measures should be theoretically driven
What do Tabachnick and Fidell (2007) suggest a factor matrix should include?
- A matrix that is factorable should include several sizeable correlations.
- The expected size of correlation depends, to some extent, on the sample size
- but if correlations do not exceed .30 then the use of FA is questionable
What do we we need to know about Bartlett’s test of sphericity?
Bartlett’s test of sphericity is very sensitive and with a large sample size it may yield significant results when the sample is >5 (greater than) per variable.
What is Kaiser’s measure of sampling adequacy?
Kaiser’s measure of sampling adequacy is a ratio of the sum of squared correlations to the sum of squared correlations plus sum of squared partial correlations. Values >= (greater han or equal to) .6 are required for good FA