Factor Analysis Flashcards
What are you trying to do in a factor analysis?
Trying to build a table - solid, dependable, everyone uses them, easy to understand. But when you look closer, it is hard to construct a table - legs, what it is made up of etc
First one won’t be very good
How do you make a questionnaire?
Items: the part people interact with - see if they are good quality
Factors: the structure which holds up the items, want a few but provide the main support
What is the main idea?
Reduce a large number of variables to a smaller set of representative, meaningful variables while keeping as much information as possible - identify factors from a large set of correlated items
What do you get out of a FA?
A set of statistically identified factors - clusters of items which all measure the same characteristics / data - use these as variables for future analyses
What are factors?
Clusters of items which all measure the same characteristic
What is the goal?
Identify how many factors you have and what characteristic those factors each represent
What are the stages of FA?
- Identify variables and design
- Check data and assumptions
- Rotation
- Interpret the results
Identify variables and design: what are the initial checks?
Check the data
is there any missing?
what scale is the data measured on? - what do each end mean?
how many items and participants are there?
remove ppts who are incomplete cases or make invalid answers
Checking data and assumptions: what are the initial checks?
Normality and standard deviations
check the items are normally distributed (all of them)
check SD’s are between 0.5 and 1.5
identify the worst offenders - if all the data is skewed, can’t chuck them all out, identify the worst ones, may want to exclude them
What should the SD’s be?
Between 0.5 and 1.5
Checking data and assumptions: what are the second checks?
Correlations
Sphericity
Sampling adaquacy
Checking the correlations
You get a massive correlation - you expect them to correlate in FA, we are looking for underlying factors that explain groups of items so want correlations
What are the possible problems with correlations?
- items that don’t correlate with anything else - might indicate that an item doesn’t measure the construct, so not valid
look for items with r < .3 or p > .05 (not just one, has to be many) - items that correlate too highly - too much overlap, measuring the same thing, not valid
singularity r > .9
problems with multicollinearity
Check the determinant - should be greater than 0.00001 - no problems with multicollinearity
What should you do with correlations?
Identify the worst offenders - can’t discard them all if lots of WO, pick the ones which don’t correlate with loads of items, if it is just one item, then it is fine, report with justification
At this point, run the analysis with the final set of items
How do you check for suitability of the data?
Kaiser-Meyer-Olkin Measure of sampling adequacy KMO
Do you have a sufficient sample to extract the factors?
Values range between 0 (inappropriate) and 1 (go for it)
marvellous - bigger than .9
middling - bigger than .7
miserable - above 0.5
if below 0.5, you should stop and collect more data or do something else
Report it and cite the data