Week 5 - Structural Equation Modelling 1 (Confirmatory factor analysis) Flashcards
What is it about Structural Equation Modelling (SEM) that makes is particularly useful, relative to all the other techniques we’ve been taught (like regression, ANOVA, etc)
All those techniques are limited to looking at a single relationship at a time, where as SEM allows you to look at multiple relationships simultaneously.
Confirmatory Factor Analysis and Path Analysis are both examples of what?
Structural Equation Modelling
A Structural Equation Model typically has two ‘parts’. What are they?
- The measurement model
2. The structural model
What is the difference between Exploratory Factor Analysis and Confirmatory Factor Analysis?
EFA can impose two kinds of restriction:
- number of factors
- constrain the factor loadings to be uncorrelated (ie orthogonal rotation)
CFA can restrict factor loadings (or factor correlations or variances) to take certain values
- a common value: zero
- if a factor loading was set to zero the hypothesis sis that the observed variable score was not due to the factor
- we get a measure of model fit
Also the sense that CFA contains a hypothesis whereas EFA does not
Confirmatory Factor Analysis is ultimately a measurement model, T/F
TRUE
What are the seven issues that Geoff discussed in relation to CFA?
- Sample size
- Significance testing
- Distribution
- Identification
- Estimation (methods of)
- Assessment of fit
- Specify the model
In CFA, how should we think about SAMPLE SIZE?
In short, it depends. There are various opinions. But at very least, you need a decent sample size.
It’s ropey, but you can work with the idea of wanting more than 20 participants per survey item, and being in trouble if you have less than 10 per item.
In CFA, how should we think about SIGNIFICANCE TESTING?
It’s not such a big deal in CFA.
CFA adopts a ‘higher level’ view.
Also significance testing is pretty suspect just generally
In CFA, how should we think about DISTRIBUTION?
The default technique (maximum likelihood) assumes MULTIVARIATE normality
Widaman (2012): maximum likelihood estimation appears relatively robust to moderate violations of distributional assumptions.
CFA assumes CONTINUOUS variables
In CFA, it is important that your model is IDENTIFIED.
What are the requirements for this?
- Model degrees of freedom must be greater or qual to 0
- All latent variables must be assigned a scale (don’t know what this means atm)
- A number of equations must be solved
With CFA, would we prefer our model to be UNidentified, JUST identified or OVERidentified?
Over
For EFA, what are the three most commonly used methods of estimation?
And which one is preferred
- Unweighted Least Squares
- Generalised Least Squares
- Maximum Likelihood**
In EFA, when assessing model fit, do you want small or large p value?
LARGE!
With EFA, what is the main model fit test statistic used?
Chi squared
What’s the eccentric thing about using Chi Squared to test model fit for EFA?
(Hint: sample size)
It punishes large samples and rewards small samples