SEM Flashcards

Question

total effect are...

Answer 1

Ð These represent the total causal DIRECT AND INDIRECT effects on one variable to another Calculated by add/sum all direct and indirect effects

Answer 2

Ð You cannot enter and exit a variable on an arrowhead Ð You cannot enter a variable twice on the same trace

Answer 3

It is important to remember that just because you find a good model fit, doesn’t exclude the possibility another model will explain the data better

Answer 4

Paths omitted are as important to model as paths included. Their absence is making a theoretical statement (even if not not explicitly expressed); e.g. ‘I hypothesise there are no direct effects of ethnicity and family background on grades’

Answer 5

for model parsimony Model is therefore simpler or more parsimonious than a full model with all possible paths required to be estimated ``` Parsimonious models (if plausible) have several advantages: Ð Simplest (but sufficient) models preferred in science Occam's razor - ‘all other things being equal, the simplest model is the most preferred’ Ð Easy for a reduced model to be a statistically worse fit than full model - if survives this test of fit then more credibility as plausible model ``` Explaining more with less – A saturated model would explain everything – but if just 2 variables used explaining 85% of the model would be a great parsimonious model.

Answer 6

Constraining the model in various ways as a way of testing theory or particular set of research questions: we want to rigorously test our data the best we can by having the most parsimoniously model.

Answer 7

unmeasured

Answer 8

close to saturation

Answer 9

basic notion is difference between observed correlations the saturated full sample correlation and the implied correlation (reduced model) is the RESIDUAL and we want this to be as small as possible

Answer 10

come in sr. mr. Ramseay: go fuck it 1) Chi-square test (as a minimum – CMIN in AMOS) 2) Standardised Root Mean Square residual (SRMR) 3) Root mean square error of approximation (RMSEA) 4) Goodness of fit index

Answer 11

Ð If a x2 of 0 it will be the saturated model

Answer 12

goes up, fewer degrees of freedom - and error increases

Answer 13

high Significance of x2 is a measure of bad fit. So we are looking for non-significance as we are looking to show the results are not significantly different from the saturated model despite having fewer paths.

Answer 14

Ð a residual correlation is the difference between a sample correlation and the implied correlation Ð the SRMR is based on the average absolute value of the residual correlations

Answer 15

Ð an SRMR of zero would equal perfect fit (no residual) Ð SRMR

Answer 16

Ð popular fit measure Ð designed to assess the approximate fit of a model rewarding parsimony of two models with similar explanatory power, the simpler model -- fewer paths (df) -- will be favoured

Answer 17

Ð Browne and Cudeck (1993) suggested: • RMSEA < .05 – good fit * RMSEA < .08 – reasonable fit * RMSEAs above .10 poor fit

Answer 18

Ð Analogous to R2 (estimates total variance accounted for by the model)

Answer 19

Ð Values closer to 1 are better fit | Ð Hu & Bentler (1999): >.95 = good >.90=adequate

Answer 20

Purpose of model fitting is to rule out bad models – cannot prove a model is good Bad model fit – model doesn’t explain data as well as other models might (e.g. a model with paths dropped/added) - refine or discard model Good model fit – fails to disconfirm your model – you may have good model. But ‘fit’ is with reference to variables in your model. Ur model is not the 1 and only model. (i) an alternative model with different specification of paths might be even better – still worth testing alternative models (ii) maybe there is a more complete model (more variables) But status of ‘not yet disconfirmed’ is powerful in science

Answer 21

Confirmatory Factor Analysis

Answer 22

Ð Helps to confirm a structure and test a theoretically driven model of psychological measures Ð i.e. Once we have an EFA-derived measure, we can administer it to a new sample, and see if we can confirm the original measurement model. Ð Provides imp info on how a measurement tool is structured /and /or how latent factors are related to each other.

Answer 23

Ð In CFA we CONSTRAIN factor loadings (usually to 0) – i.e. we do NOT allow observed items/indicators to load freely on all of the other factors (cutting off some data from some factors – EFA is saturated as they can load freely on all factors without any constraints – CFA is about CONFIRMING Ð So the CFA model is more constrained then the EFA model

Answer 24

Factor loadings: estimate the relationship. Can be thought of as correlation. Need to be >.50

Answer 25

Factor Covariances: estimates the relationship between latent factors. USED to examine the convergent and discriminant validity of factors

Answer 26

♣ model variation in the indicator variable not accounted for by the factor e.g. anything that accounts for word vocabulary excluding verbal IQ these error terms are usually uncorrelated with each other, but you could model error correlations if you expected that response across indicators would be caused by something other than the factors e.g. method effects

Answer 27

1) Specify the model 2) Model identification 3) Model estimation 4) Testing model fit 5) Interpret model effects 6) Modifying models 7) Reporting results sperm molesting inmates effect modest reporter

Answer 28

(SETTING UP STRUCTURE) Ð cannot know the variance of unmeasured variables Ð fix the error variances to 1 in model specification Ð Factor also unmeasured so again variance unknown Ð Set to 1 again – but only need one factor loading per factor Ð IMPORTANT FOR IDENTIFICATION Ð Software does it for you

Answer 29

o knowns: calculate number of observed covariances and variances e.g. v * (v + 1)/2, where v equals number of variables o unknowns: count up number of free paths and variances o calculate knowns – unknowns for model df o If model df greater than or equal to 0 then proceed If not, need to re-specify model

Answer 30

Ð Estimate model parameters (factor loadings and covariances) Ð Test global model fit (and against alternative models)

Answer 31

o we use the same model fit indices from earlier e.g model chi- square, RMSEA etc o there is no gold standard fit index o lot of debate about golden rules (and otherwise) for various fit indices o need to consider and report a range of fit indices o think about the fit indices in the context of your specific model, rather than blindly apply rules of thumb

Answer 32

Ð a residual correlation is the difference between a sample correlation and the implied correlation Ð the SRMR is based on the average absolute value of the residual correlations Ð an SRMR of zero would equal perfect fit (no residual) Ð SRMR

Answer 33

Ð If a x2 of 0 it will be the saturated model Ð Error increases as paths are reduced and x2 goes up Ð Significance of x2 is a measure of bad fit. So we are looking for non-significance as we are looking to show the results are not significantly different from the saturated model despite having fewer paths.

Answer 34

Ð Analogous to R2 (estimates total variance accounted for by the model) Ð Values closer to 1 are better fit Ð Hu & Bentler (1999): >.95 = good >.90=adequate

Answer 35

Ð popular fit measure Ð designed to assess the approximate fit of a model rewarding parsimony of two models with similar explanatory power, the simpler model -- fewer paths (df) -- will be favoured Ð Browne and Cudeck (1993) suggested: • RMSEA < .05 – good fit * RMSEA < .08 – reasonable fit * RMSEAs above .10 poor fit

Answer 36

We can test different factor models against each other – the hunt for parsimony

Answer 37

building or trimming - i.e. adding or deleting paths to or from original model

Answer 38

Ð Starts with a bare-bones model then adds path(s) Ð If extra paths significantly improve fit these are added to model

Answer 39

Ð Typically starts with a saturated model and simplifies it by eliminating paths Ð If the model fit does not significantly deteriorate then paths can be removed (model is no worse but is simpler)

Answer 40

re-specify the model as a path model

Answer 41

DOUBLE HEADED ARROWS are specified as a PATH. Turning a measurement model (CFA) turns into full SEM by changing double headed into direct single headed arrow paths.

Answer 42

structural equation model SEM.

Answer 43

Deletion/adding of paths can be theoretically or empirically driven

Answer 44

Ð model trimming/building guided by theoretical a priori considerations e.g. ‘ I hypothesise that ethnicity & family background have no direct effect on grades (effects are likely to be indirect ones) and therefore adding them as direct paths will not result in a significantly improved model’

Answer 45

Ð Paths are added or deleted from model purely on basis of statistical criteria Ð In model building, Modification Indices (MI) – another route (improvement in chi 2 value) for all paths are examined to see which ones significantly improve model Ð can capitalise on chance correlations Ð this type of SEM is more exploratory (cannot claim you are ‘confirming’ theory) Ð credibility of model improved if model structure replicated in another sample

Answer 46

Categorical variable = Multiple – group SEM (testing SEM across categorical variable like gender) Hierarchical data = Multi-level SEM for data with hierarchical structure Repeated measures = latent growth modelling Categorical Latent variables = Mixture modelling

Answer 47

The assumptions largely follow from those for correlation/regression analyses (see the appropriate lecture). ``` Linearity dependent (endogenous) variables should be linearly related to independent variables SEM programmes can handle continuous and categorical variables, but check for coding of categorical variables and make sure programme knows what codes are being used ``` Normality residuals should be normally distributed and homoscedastic Identification models cannot be under-identified Adequate sample size Kline recommends at least 10 times as many cases as parameters (paths) – ideally 20 times 5 times as many cases is often insufficient Proper Model Specification specification error occurs when common causal variables are left out of the model Disturbances uncorrelated with endogenous variables same as MR – errors uncorrelated with independent variables No multicollinearity Exogenous variables are reliably measured

Answer 48

7 Steps to setting up a CFA model? 1) Specify the model 2) Model identification 3) Model estimation 4) Testing model fit 5) Interpret model effects 6) Modifying models 7) Reporting results

Answer 49

The comparative fit index (CFI) analyzes the model fit

Answer 50

by examining the discrepancy between the data and the hypothesized model

Answer 51

for the issues of sample size inherent in the chi-squared test of model fit,[20] and the normed fit index

Answer 52

god fucks in 1 huge bentley

Answer 53

CFI values range from 0 to 1, with larger values indicating better fit. Previously, a CFI value of .90 or larger was considered to indicate acceptable model fit.[31] However, recent studies have indicated that a value greater than .90 is needed to ensure that misspecified models are not deemed acceptable (Hu & Bentler, 1999).

Answer 54

run d-m-c cued the brown note on the decks in 1993

Answer 55

Browne and Cudeck (1993) suggested RMSEA fit: • RMSEA < .05 – good fit * RMSEA < .08 – reasonable fit * RMSEAs above .10 poor fit

Answer 56

1) You get an overall test of model fit that can disconfirm whether your model fits the data. 2) You also get indices of approximate fit. Parameter estimates are better estimated in one go if possible than estimating in multiple steps as bias is introduced in unnecessary multiple step estimation. 3) it is possible to estimate the impact of the unreliability of the composite measures and their impact on the regression coefficients, this is a key advantage of SEM.

Answer 57

However, we’re still fascinated by the idea of bundling different variables together into a single causal effect, and maybe evaluating the relative contribution of each of those variables within a model. In SEM, this is known as the creation of a Composite Variable. This composite is still an unmeasured quantity – like a latent variable – but with no error variance, and with “indicators” actually driving the variable, rather than having the unmeasured variable causing the expression of its indicators.

Answer 58

Sample size, Standard errors of estimates, whether estimates are standardized or not.

Answer 59

``` Exogenous variables are specified to have no causal predictor in the model, although they can co-vary with other exogenous variables. Endogenous variables are predicted by exogenous variables and other endogenous variables included in the model, as well as unspecified variables (via an error term). ```

Answer 60

1) indirect pathways 2) but also a mediators relationship to another varible which also leads to the DV = three-way multiplication of the constituent paths 3) Add up all the pathways

SEM Flashcards

(84 cards)