M5 - CFA Flashcards

1
Q

Part A - Question: Which of the following statements is correct?

  1. Whereas exploratory factor analysis uses dimensions/factors to explain shared variance among variables in the analysis, confirmatory factor analysis uses dimensions/factors to account for all variance (shared and unique) in the model.
  2. Confirmatory factor analysis allows us to test a proposed model about how variables relate to each other.
  3. Factor analysis suggests that two variables are related, but doesn’t propose a shared third-variable to explain this association.
  4. Factor analysis takes a small number of variables and combines into a large number of factors
A
  1. Confirmatory factor analysis allows us to test a proposed model about how variables relate to each other.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Part E - Question: Based on the information here, which model would you prefer?

Model 1:
Chi square = 44.59, p = .002
CFI = .876
RMSEA = .12
SRMR = .08
Model 2:
Chi square = 123.45, p < .001
CFI = .622
RMSEA = .37
SRMR = .12
  1. Model 1.
  2. Model 2.
  3. Neither.
  4. Both.
A
  1. Neither.

Although Model 1 has better fit, neither is a good fit to the data and hence we would not be satisfied with either.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is confirmatory factor analysis?

A

What is CFA?

  • a Data Reduction method
  • useful to summarise large number of variables into a smaller number of factors (latent variables).
  • tests plausibility of the model –> more control over result
  • Occurs when 2 variables are related but it is due to an underlying cause. Factors explain variance in the variables and correlations amongst them (third variable scenario)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does a Confirmatory factor analysis imply?

A

It implies:

  • there is a latent variable
  • correlations between measured variables are due to a shared underlying factor
  • after controlling for the underlying factor, the variables should no longer correlate (no residual relationship between the variables remain)
  • with two+ latent factors, associations between variables from different latent factors are entirely accounted for by factor correlation (oblique model)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Explain scaling.

A

Scaling helps to fix the scores so that we can ascertain the relationship between the variables and the factors

Two methods:
1 - Constrain factor variance to 1 (eg standardised variables into z scores - where SD is 1)
2 - Constrain one of the factor loadings to 1 (most common approach)

Need to do this with
- a path from each latent variable to a related measured variable
- for every error term
Otherwise AMOS won’t run

Having these both set to 1 allows us to have the sense of total variance being 100% (partly latent –> measured, partly measured to error)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are five problems with chi square?

A
  1. Favours complexity
    - harder to get p > .05 good fit no sig. diff with parsimonious model
  2. Overly sensitive to non-normality
    - inflates chi2
    - more likely to reject the model
  3. sensitive to sample size
    - harder to achieve p> .05 with larger sample
  4. sensitive to size of correlation
    - model fit punished more by omission of big correlations than small
  5. sensitive to measurement error
    - measurement error reduces power by reducing possible size of correlation
    - bias in favour of accepting your model (this is why its important to measure internal consistency)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the alternative fit indices beyond chi square.

A

Global Fit Statistics
Absolute fit –> comparing against perfectly fitting model
- SRMR is the Standardised Root Mean Residual. It is the average error of all things to be reproduced by our model
- RMSEA is the Root Mean Square Error of Approximation = average error accounting for model complexity and sample size. It punishes a model that is overly complex, compared to chi-2 which favours complexity

Incremental fit –> comparing against worst fitting model (null)
- CFI is Comparative Fit Index
expressed as:
x2 - df (null) - x2 - df (proposed) / x2 - df (null)
- TLI is Tucker Lewis Index
punishes for model complexity
expressed as:
x2 /df (null) - x2 / df (proposed) / [(x2/ (df (null) - 1)]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are oblique and orthogonal factors?

A

In a confirmatory factor analysis where there are multiple factors, the researcher must specify whether they are oblique or orthogonal

Oblique - expected to correlate
orthogonal - expected not to correlate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why use CFA?

A
  • Useful for working out construct validity of research measures like self report scales
  • Focus on separating unique from shared variance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

When should you use CFA?

A
  • Test in SPSS AMOS when we have an idea of the how the variables are related
  • Test after researching the literature, designing the theoretical model and collecting the data.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How should the fit statistics for CFA be interpreted?

A
Global good fit
Chi2 = CMIN closer to zero and non-significant
Normative chi2 <3
RMSEA < .08
SRMR < .06
CFI >.95 
TLI > .95

Local Fit
covariance = unstandardised correlation - are the variables related to the factor proposed?)
correlation - standardised
Use Cohen’s 0.1, 0.3, 0.5 guidelines for assessing strength of correlation
R2 communalities
If a communality is < .1 (< 10% of variance) then it might be an outlier and it should be considered to remove it. Then the remaining factor will likely increase in their relationship (but the scale can no longer be compared with other

  • One way around this would be to split the sample it half and try to recreate the results research with other samples
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Explain what the CFA Global Fit Output CMIN, NPAR and DF relate to

What is meant by Default, Saturated and Independence models?

A

CMIN is Chi-Sq minimum - is a representation of Chi Square

NPAR - the number of parameters that have the be estimated in the model

DF - degrees of freedom - # additional paths that we could draw in the model than we currently have drawn

Models

  • Default - Proposed
  • Saturated - Perfect fitting - all relationship between variables identified
  • Independence - Worst fitting - no relationships between models identified
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a good fit for global fit indices?

A

Good fit is:

X2 low and not significant
normative chi2 < 3
RMSEA < .08
SRMR < .06
CLO and TFI > .95
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Explain CFA local fit output

A

Covariance is unstandardised
Correlation is standardised
Use Cohen’s guidelines for weak, moderate strong correlation (.1 .3 .5)

C.R critical ratio - estimate / SE = T test
if there is an even more important predictor in the true model that hasnt been identified or drawn, the predictor may look better than they actually are, so should be taken with a grain of salt

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Outline the Key Output for CFA

A
Global Fit Statistics 
Unstandardised Factor loadings
Standardised Factor Loadings
Covariances and Correlations
R Squared Estimate (communalities)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Interpreting CFA

  • what happens when a correlation remains between measured variables after controlling for the latent factor
A

this suggests other unidentified factor share or explain the relationship and the model fit will be poor

17
Q

Explain the strength of the relationship 1) between variables and 2) between measured variable and latent factor and how it relates to explained and unexplained variance

A

1) stronger relationship between variables means stronger FA and that those variables more closely represent the factor
2) stronger relationship between latent factor and measured variable also means strong FA. More variance in the measured variance is explained and less variance remains unexplained (error term)

18
Q

What is communalities?

A
  • Communalities is the R-squared estimate

- helps determine the amount of variance in a variable that is explained by all factors included in the model