model estimation & fit Flashcards
what is estimation
we need to find values for the model parameters such that πΊ and πΊ are as similar as possible β using Maximum Likelihood estimation
the discrepancy between πΊ(π½) and πΊ is operationalized by
he fit function
fit function his expression is derived based on the assumption of
multivariate normality y βΌ N (0 ,Ξ£)
When does the fit function yield the lowest value
when the model-implied and sample covariance matrices are identical
A value of 0 for the fit function means
that the model implied covariance matrix reproduces the observed covariance matrix perfectly
β this means our model is just identified!
β which means no meaningful way to asses model fit!
the Maximum Likelihood (ML) approach is
- robust to violations
- parameter estimates will be correctly obtained
- but standard errors and model fit may be affected
we have to alternatives estimation approaches related to ML
Satorra-Bentler β MLM
why is ML not enough
the optimization can only provide the best set of values it can find for our model parameters
- we still need to assess the quality of the model specification and estimated parameter values
- to determine how well they explain observed covariance matrix
- this is where model fit and fit indices come into play
model fit =
- evaluate the fit of the specified model given the estimated parameter values
- checking whether there is evidence of model misspecification
The $F_{ml}$ is proportional to the likelihood ratioβ¦
- the likelihood of the specified (hypothesized) model divided by the likelihood of the saturated model
- i.e., difference in the log case
The $F_{ml}$ tells us
- how well the specified model fits compared to the best possible fit β i.e., the saturated model
- we translate it into a summary test statistic that is central to model fit
a value of $F_{ml}$ = 0 is unlikely since we
-
do not know the population covariance matrix β we only have a sample πΊ
even if our model is correctly specified β $F_{ml}$ β 0 due to sampling error - are not interested in the saturated model β we usually want positive degrees of freedom
T test statistic formula
T = n*Fml
T test follows chi square distribution if
- if the model-implied covariance matrix πΊ = the population (true) covariance matrix
- then, π has a central $π^2$distribution with as many degrees of freedom as the
specified (hypothesized) model
Types of model fit
Exact, close