the optimization can only provide the best set of values it can find for our model parameters - we still need to assess the quality of the model specification and estimated parameter values - to determine how well they explain observed covariance matrix - this is where model fit and fit indices come into play

- evaluate the fit of the specified model given the estimated parameter values - checking whether there is evidence of model misspecification

model estimation & fit Flashcards by Lynn de Munnik

what is estimation

we need to find values for the model parameters such that 𝚺 and 𝑺 are as similar as possible → using Maximum Likelihood estimation

How well did you know this?

Not at all

Perfectly

the discrepancy between 𝚺(𝜽) and 𝑺 is operationalized by

he fit function

How well did you know this?

Not at all

Perfectly

fit function his expression is derived based on the assumption of

multivariate normality y ∼ N (0 ,Σ)

How well did you know this?

Not at all

Perfectly

When does the fit function yield the lowest value

when the model-implied and sample covariance matrices are identical

How well did you know this?

Not at all

Perfectly

A value of 0 for the fit function means

that the model implied covariance matrix reproduces the observed covariance matrix perfectly
→ this means our model is just identified!
→ which means no meaningful way to asses model fit!

How well did you know this?

Not at all

Perfectly

the Maximum Likelihood (ML) approach is

robust to violations
- parameter estimates will be correctly obtained
- but standard errors and model fit may be affected

How well did you know this?

Not at all

Perfectly

we have to alternatives estimation approaches related to ML

Satorra-Bentler → MLM

How well did you know this?

Not at all

Perfectly

why is ML not enough

the optimization can only provide the best set of values it can find for our model parameters

we still need to assess the quality of the model specification and estimated parameter values
- to determine how well they explain observed covariance matrix
- this is where model fit and fit indices come into play

How well did you know this?

Not at all

Perfectly

model fit =

evaluate the fit of the specified model given the estimated parameter values
checking whether there is evidence of model misspecification

How well did you know this?

Not at all

Perfectly

The $F_{ml}$ is proportional to the likelihood ratio…

the likelihood of the specified (hypothesized) model divided by the likelihood of the saturated model
- i.e., difference in the log case

How well did you know this?

Not at all

Perfectly

The $F_{ml}$ tells us

how well the specified model fits compared to the best possible fit → i.e., the saturated model
- we translate it into a summary test statistic that is central to model fit

How well did you know this?

Not at all

Perfectly

a value of $F_{ml}$ = 0 is unlikely since we

do not know the population covariance matrix → we only have a sample 𝑺
even if our model is correctly specified → $F_{ml}$ ≠ 0 due to sampling error
are not interested in the saturated model → we usually want positive degrees of freedom

How well did you know this?

Not at all

Perfectly

T test statistic formula

T = n*Fml

How well did you know this?

Not at all

Perfectly

T test follows chi square distribution if

- if the model-implied covariance matrix 𝚺 = the population (true) covariance matrix
- then, 𝑇 has a central $𝜒^2$distribution with as many degrees of freedom as the
  specified (hypothesized) model

How well did you know this?

Not at all

Perfectly

Types of model fit

Exact, close

How well did you know this?

Not at all

Perfectly

For exact fit, we want Chi square value to be…

Study These Flashcards

LOW, we do not want to reject H0

caveats of exact fit

Study These Flashcards

for small sample sizes we have a poor approximation of the $𝜒^2$distribution
large sample size test is overpowered

Two models we compare our model with

Study These Flashcards

baseline, only item variances
Saturated = DF = 0, T = 0

types of fit indices

Study These Flashcards

Incremental (compare to baseline) - How much better does the specified model fits compared to the baseline model? - CFI

absolute (compare to saturated) - how close the specified model $𝑇_m$ is to the saturated model - RMSEA

incremental fit indices rules

Study These Flashcards

CFI, NFI, -> the higher the better!
between 0 and 1
.95 = good
.90 = acceptable

absolute fit indices

Study These Flashcards

RMSEA = measures the amount of misfit per degree of freedom. allows us to quantify if the specified model is close to the saturated model

smaller values are better
proposed benchmarks → i.e., rules of thumb
- < 0.5 à very good fit or close fit
- . 05 − .08 à good fit or fair fit
- . 08 − .1 à mediocre fit or good
- . 05 − .08 à good fit or fair fit
- > .10 à poor or unacceptable

Two RMSEA tests

Study These Flashcards

Exact fit - RMSEA = 0 ( same as T test statistic)

close fit - RMSEA = good 0.05
sig -> not good

poor fit - RMSEA = 0.08
sig -> good!

SRMR

Study These Flashcards

compares 𝑺 and 𝚺 based on the residuals
it is the average of the squared values in the residual correlation matrix

a cutoff value < 0.08 is considered a good fit

Goodness of fit

Study These Flashcards

how much variance in 𝑺 is explained by 𝚺…

the proportion of variance in the sample covariance matrix 𝑺 accounted for by the model-implied covariance matrix 𝚺

takes values in the range 0 to 1
- higher values are better
- > .90 is considered acceptable fit

Select competing models based on...

Nested -> Likelihood ratio test NOT nested -> information criteria

Types of models to compare

Simple model (more restrictions) vs. complicated model (more general, less restrictions)

complicated model has

additional parameter, so 1 DF less.

Which model always fits better? simple or complicated (general) model?

general model!

what if simple and complicated model have similar fit

If they indicate similar model fit → we prefer the simpler model

what are information criteria

Information criteria balance model fit and complexity by weighing the log-likelihood function with model complexity AIC, BIC ->

what do we want AIC and BIC to be?

we prefer models that have lower information criteria values

model modification - `mi` = - `epc` =

→ modification index → the expected decrease in the 𝑇 test statistic expected parameter change → the approximate value for the added model parameter

- we can think of 𝜽 as

- a vectorized version of the full model specification of 𝚺

- 𝚺(𝜽) contains

- only the free model parameters for which we want to estimate values

model estimation & fit Flashcards

(34 cards)