model estimation & fit Flashcards

1
Q

what is estimation

A

we need to find values for the model parameters such that 𝚺 and 𝑺 are as similar as possible β†’ using Maximum Likelihood estimation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

the discrepancy between 𝚺(𝜽) and 𝑺 is operationalized by

A

he fit function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

fit function his expression is derived based on the assumption of

A

multivariate normality y ∼ N (0 ,Σ)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When does the fit function yield the lowest value

A

when the model-implied and sample covariance matrices are identical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A value of 0 for the fit function means

A

that the model implied covariance matrix reproduces the observed covariance matrix perfectly
β†’ this means our model is just identified!
β†’ which means no meaningful way to asses model fit!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

the Maximum Likelihood (ML) approach is

A
  • robust to violations
    • parameter estimates will be correctly obtained
    • but standard errors and model fit may be affected
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

we have to alternatives estimation approaches related to ML

A

Satorra-Bentler β†’ MLM

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

why is ML not enough

A

the optimization can only provide the best set of values it can find for our model parameters

  • we still need to assess the quality of the model specification and estimated parameter values
    • to determine how well they explain observed covariance matrix
    • this is where model fit and fit indices come into play
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

model fit =

A
  • evaluate the fit of the specified model given the estimated parameter values
  • checking whether there is evidence of model misspecification
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The $F_{ml}$ is proportional to the likelihood ratio…

A
  • the likelihood of the specified (hypothesized) model divided by the likelihood of the saturated model
    • i.e., difference in the log case
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The $F_{ml}$ tells us

A
  • how well the specified model fits compared to the best possible fit β†’ i.e., the saturated model
    • we translate it into a summary test statistic that is central to model fit
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

a value of $F_{ml}$ = 0 is unlikely since we

A
  1. do not know the population covariance matrix β†’ we only have a sample 𝑺
    even if our model is correctly specified β†’
    $F_{ml}$ β‰  0 due to sampling error
  2. are not interested in the saturated model β†’ we usually want positive degrees of freedom
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

T test statistic formula

A

T = n*Fml

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

T test follows chi square distribution if

A
    • if the model-implied covariance matrix 𝚺 = the population (true) covariance matrix
    • then, 𝑇 has a central $πœ’^2$distribution with as many degrees of freedom as the
      specified (hypothesized) model
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Types of model fit

A

Exact, close

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

For exact fit, we want Chi square value to be…

A

LOW, we do not want to reject H0

17
Q

caveats of exact fit

A
  • for small sample sizes we have a poor approximation of the $πœ’^2$distribution
  • large sample size test is overpowered
18
Q

Two models we compare our model with

A

baseline, only item variances
Saturated = DF = 0, T = 0

19
Q

types of fit indices

A

Incremental (compare to baseline) - How much better does the specified model fits compared to the baseline model? - CFI

absolute (compare to saturated) - how close the specified model $𝑇_m$ is to the saturated model - RMSEA

20
Q

incremental fit indices rules

A

CFI, NFI, -> the higher the better!
between 0 and 1
.95 = good
.90 = acceptable

21
Q

absolute fit indices

A

RMSEA = measures the amount of misfit per degree of freedom. allows us to quantify if the specified model is close to the saturated model

  • smaller values are better
    proposed benchmarks β†’ i.e., rules of thumb
    • < 0.5 Γ  very good fit or close fit
    • . 05 βˆ’ .08 Γ  good fit or fair fit
    • . 08 βˆ’ .1 Γ  mediocre fit or good
    • . 05 βˆ’ .08 Γ  good fit or fair fit
    • > .10 Γ  poor or unacceptable
22
Q

Two RMSEA tests

A

Exact fit - RMSEA = 0 ( same as T test statistic)

close fit - RMSEA = good 0.05
sig -> not good

poor fit - RMSEA = 0.08
sig -> good!

23
Q

SRMR

A
  • compares 𝑺 and 𝚺 based on the residuals
  • it is the average of the squared values in the residual correlation matrix

a cutoff value < 0.08 is considered a good fit

24
Q

Goodness of fit

A

how much variance in 𝑺 is explained by πšΊβ€¦

the proportion of variance in the sample covariance matrix 𝑺 accounted for by the model-implied covariance matrix 𝚺

  • takes values in the range 0 to 1
    • higher values are better
    • > .90 is considered acceptable fit
25
Q

Select competing models based on…

A

Nested -> Likelihood ratio test
NOT nested -> information criteria

26
Q

Types of models to compare

A

Simple model (more restrictions)
vs.
complicated model (more general, less restrictions)

27
Q

complicated model has

A

additional parameter, so 1 DF less.

28
Q

Which model always fits better? simple or complicated (general) model?

A

general model!

29
Q

what if simple and complicated model have similar fit

A

If they indicate similar model fit β†’ we prefer the simpler model

30
Q

what are information criteria

A

Information criteria balance model fit and complexity

by weighing the log-likelihood function with model complexity

AIC, BIC ->

31
Q

what do we want AIC and BIC to be?

A

we prefer models that have lower information criteria values

32
Q

model modification
- mi =
- epc =

A

β†’ modification index β†’ the expected decrease in the 𝑇 test statistic

expected parameter change β†’ the approximate value for the added model parameter

33
Q
  • we can think of 𝜽 as
A
  • a vectorized version of the full model specification of 𝚺
34
Q
  • 𝚺(𝜽) contains
A
  • only the free model parameters for which we want to estimate values