Confirmatory Factor Analysis Flashcards
Confirmatory factor analysis purposes
Validate the model
- is a non-iterative procedure
- applies the model developed in previous steps, to a new independent sample
- assess how well the model fits the new sample
- conclude adequate or inadequate
Constraints
to make the model estimable we need to fix one parameter among items on each latent variable to a constant 1.0. This only applies to the unstandardised paths.
CMIN
A measure of residual Chi-Square, the equivalent of the residual SS in regression. Smaller indicates the model ‘fits’ better.
CMIN/DF: values <2 or <5 are often used as desireable rules of thumb. Over this can indicate poor fit of the common factor model.
Independence model:
Equivalent of the null model in regression, assumes the variables are all unrelated.
Saturated model
Equivalent of the full additive model in regression, assumes every variable is related to every other variable.
Default model
The model we have specified, means that the associations exists as per the arrows we include.
NPAR
Number of parameters estimated in the model. Includes variances, covariance and path coefficients.
CFI:
Comparative Fit Index. Assesses improvement in fit in the default model compared with the independence model (IM). Values >0.95 are desirable. Since the independence model cannot be an improvement over itself it’s CFI must be zero. The saturated model must be a complete improvement over the IM so it’s value must be 1.0. The value 0.765 indicates a poor fit.
TLI
Tucker-Lewis Index. A variation on CFI that penalizes models for complexity. Relatively independent of sample size. Values >0.95 are thought of as desireable. The value of CFI=0.742 indicates a poor fit.
RMSEA
Root mean square error of approximation. Assesses error in fit and hence small values are desirable. A common rule-of-thumb is RMSEA<0.05 is desirable. RMSEA has a standard error and hence confidence intervals and hypothesis tests can be constructed.
Modification indices
Is a measure of how much CMIN will reduce by the inclusion of a path or covariance not currently included.
The MI has an associated statistical significance so we can determine the relative value and significance of the improvement.
Each path/covariance uses 1df. A Chi-Square test on 1df needs to exceed 3.8 for p<0.05.
As a general rule:
- Large MIs for covariances between observed variables suggest there is an association between them that is in addition to the specified latent variable structure.
- Large MIs for covariances between latent variables suggest those constructs are related (cf non-orthogonal rotation)
- Large MIs for paths suggest a structural deficiency, such as cross loadings.
CFA - reliability formula
true score variable/total variance
(sum paths)sqrd x factor / ((sum paths)sqrd x factor) + error = reliability
- A model needs to be:
- Identified means the total df:
- Just identified means total df:
- Under identified means total df:
- The total df is calculated as:
- Parameters estimated are:
- Error terms in CFA measure:
- identified for us to obtain SEs and measures of model ‘fit’
- > number of parameters estimated
- = number of parameters estimated
- < number of parameters estimated
- sample moments - parameters estimated
- are all path coeffieicents, variances and covariances
- The amount of variance in the observed variables that is not related to the latent variable.