SPRING Factor Analysis Flashcards
what is the purpose of factor analysis
data reduction - understand the variables and their relationships and reduce to their underlying factors
aim to explain variance between individuals and certain variables- account for as much variance as possible
what is a variable
score/measure of an underlying construct
what is a factor
clusters of variables
a dimension/component of a construct ie introversion/extravarsion of personality
beliefs/attitudes etc
some corr more highly than others
what is a latent variable
what youre expecting to measure
want to identify clusters of factros that accurately measure what you want them to
what are pure factors/clusters of factors
uncorrelated - not too strongly with some and weaky with others
how does factor analysis mean to account for as much variance as possible
gather small no factors that explain a good proportion of the vairance
not always able to explain all variables but can explain majoirty well
what is a principle component analysis PCA
simplest factor analysis (factor = component)
when can you use PCA
correlational (interval/ordinal NOT categorical)
sample size 300+
min 5 but preferaby 10+ subjects per variable
cases w/missing data omitted - need to know how many actually analysed
why might you have missing data
too sensitive
missed by accident
didnt understand the question
code data dependent on diff reasons
how many factors need to explain the majority of vairance?
only keep those that explain the most
eigenvalue > 1+ + work out cumulative variance
what is cumulative variance
total variance that all the variables explain together
communalities
proportion of factors in each variable explained by all the extracted factors/components
want variables to explain more that half (0.5) of communalities (extraction collumn in spss) BUT subjective cut off
therefore variables w/communialities 0.5+ are reasonably well explained by the extracted factors
component analysis
shows the initial factors and their loadings- the correlation between the variables and the factors
a strong loading (+-) means variable is strongly representative of a factor
loading values
- 6 = strong
- 4 = mod
- 3 = weak
rotating extracted factors
difficult to interpret
factors in a graph tend to clster together
rotate axic so that loads better on one axis - easier
shows how each variable is represented on each of the 4 factors