Factor Analysis Flashcards
Factor analysis
Ways of chunking data into theoretically reverent sub-scales or factors
SPSS lists it as ‘data reduction method’
Way of summarising large amount of info in terms of its underlying dimensions or factors
Like a quantitative thematic analysis- runs correlations between all items in large data set
Then generates themes (components/factors) based on the patterns (which items correlation more/less)
When would you use factor analysis/principal component analysis
When developing new scale/questionnaire-identify sub-scales & superfluous/irrelevant items
When trying to reduce number of items on existing questionnaire
When testing underlying structure of dataset
To see if there might be common theoretical underpinning or set of separate measures
Individual differences literature
Not when you want to chunk groups of pp together- cluster analysis
(Exploratory) factor analysis vs PCA
SPSS treats them like they’re the same
Most often give same results so terms used interchangeably
Confirmatory factor analysis is different & SPSS can’t do it
Stages of exploratory factor analysis
1) Make correlation matrix (of all variables)
2) extract principal components
3) rotate principal components
4) interpret components/factors
Is it quantitative?
Yes in as far as calculations, rotations & sorting factors- based on numerical data & statistical transformations
But no critical values so no significant/non-significant or effect sizes
Factors named & interpreted far more like qualitative analysis
Types of factor analysis
Exploratory= looks for emergent structure in current dataset based on patterns of correlations between variables (bottom-up)
Confirmatory= tests whether factor structure in current dataset fits specified model (top-down)
Stage 1: correlation matrix
Straightforward correlation matrix of variables but rather than correlating mean or sum scales, this is a correlation matrix of every item in dataset
Factor analysis tries to identify underlying patterns or dimensions that explain patterns of correlations
Stage 2: extracting components
Extracting first component is like finding a line of best fit in regression
Trying to explain as much variance between variables as possible with single factor
This new component will be as highly correlated as possible with as many items as possible- explains most variance
Second component will try to explain as much variable as possible while controlling for the first component & so on- explains less variance
Stage 3; rotation
After components/factors extracted, they are rotated in statistical space
Puts number of stats constraints on what final factors look like
These different rotations (patterns of constraints) may make more or less sense to use depending on data & questions asked
Types of rotation
Orthogonal= factors resulting from analysis effectively cannot correlate with one another, each factor is theoretically distinct
Oblique= factors can correlate with each other, so different factors can account for some of same underlying variability
Factor loadings
A component/factor is way of reducing set of related variables into simpler, single score
But each of these items will have different strengths of relationships with component factor
Relationship of each item to given component/factor is called factor loading
We care about factor loadings of 0.3 or higher
Orthogonal analyses
VARIMAX= maximises variance explained
QUARTIMAX= variables will load highly on one factor but low on others
EQUAMAX= seeks more equal variance explained by each factor
Oblique analyses
DIRECT OBLIMIN= gives higher Eigen values but can be difficult to interpret due to overlapping constructs
PROMAX= same
Deciding number of factors: Kaiser Criterion
Kaiser Criterion= Eigenvalues>1
Eigenvalues represent total (squared) factor loadings onto single factor
So higher factor loadings mean factor explains more of variance in sample
Ratio of total variance explained to number of items
So if Eigenvalues=1 for a factor, it explains same amount of variance as individual variables do by themselves
Deciding number of factors: Scree plot
Look for knee or elbow of plot
Sometimes may be ambiguous