SEM Flashcards
What is path analysis typically used for?
Examine the size and direction of direct and indirect effects between multiple variables
Examine the goodness of model fit between the researchers hypothesised model and the observed data
Compare the observed model fit of competing theoretical models
Path analysis is what to structural equation modelling?
Path analysis is a very simple form of structural equation modelling
When typically is the term ‘path analysis’ used?
When we are modelling observed variables
This means we have a single measure of the construct e.g. Word vocabulary test
When is the term SEM used?
When we have multiple indicators of a construct and we create latent variables
How is confirmatory factor analysis used in SEM?
Confirmatory factor analysis is used to create a measurement model
In SEM we then examine the relationship between these latent variables
Outline a full SEM model
A full SEM is simply a combination of a measurement model (confirmatory factor analysis) and a structural model (path analysis)
Give an example of a simple path model?
A mediated regression is a simple path model
In a mediated model, the relationship between an iv and out on is accounted for or ‘mediated’ by a third variable
Mediation implies a causal chain series of relationships between the three variables. (The researcher must have clear theoretical or logical grounds for choosing the mediator and iv variables.
What are the requirements for mediation?
- Predictor (X) must predict mediator (Z)
- Mediator (Z) must predict criterion (Y)
- Predictor (X) must predict criterion (Y)
- The X and Y relationship must shrink in the presence of (Z)
When assessing a mediation effect of the relationship between the predictor and the criterion shrinks (beta weight gets smaller) but is still sig. What does this mean?
Possibly partial mediation
What needs to happen in order to conclude that full mediation has occurred?
The x and y beta weight should be 0 (or at least non sig.) for full mediation
When do researchers argue for partial mediation?
If the beta weight drops substantively but does not reach 0
Sobel test of the indirect effects
State the 6 steps that should be undertaken when conducting a path analysis
Specify the model
Model identification
Model estimation
Interpret model effects
Evaluate model fit
Modifying the model (examining alternative models)
(So If Emma Interviewed Everyone’s Mum)
What arethe advantages of running a SEM over multiple ordinal least squares regressions?
When testing a large number of effects the analysis of multiple regressions can become very complex and SEM use maximum likelihood based methods to calculate the effects simultaneously.
Advantages of path analysis
- simpler and quicker estimation of model effects
- obtain global model fit indices
- encourages researcher to specify causal relationships between variables beforehand
- more direct and easier to tests of alternative theoretical models and their fit
The first step in path analysis is to specify the model, how should this be done?
Using theory and/or previous research, as well as logical relations between variables, to justify your path model:
The path model can then be drawn using a path diagram
In a path diagram what are typically represented by squares?
Observed variables
In a path diagram what are typically represented by circles or ellipses?
Latent (unobserved) variables
What do single- headed arrows represent in a path diagram?
Causal relationships between variables
What is an exogenous variable?
These variables are considered as IV’s in the model
They have no specified predicted cause in the model, genes they have no single-headed arrow going into them
You can have multiple exogenous variables in the model; these are usually free to correlate with rah other, although you can specify that they be uncorrelated (correlations between two or more exogenous variables are represented by a double headed arrow between variables)
In a path diagram what do a double-headed arrow represent?
Correlations
What are endogenous variables?
These variables are considered DV’s in the model
They will have a directional arrow coming into them & may also have one or more directional arrows moving away if it is a mediator variable
Basically these are downstream variables caused by exogenous variables
Which variables typically have an error or disturbance term associated with them?
Endogenous variables
This reflects that there are also u measure and unspecified causal effects on these variables
These disturbance terms are usually modelled as latent variables, hense they are represented by circles
What does a path model need to be in order to be analysed?
It needs to be identified
There needs to be sufficient unique pieces of information (i.e. Correlations in the observed data) to allow mathematical estimation of the model given the model that has been specified.
Identification can become tricky when dealing with complex latent variables and non-recursive models, but there are some shorthand methods for checking identification in observed variable path models.
What is the basic rule for model identification?
Maximum number of single connections between observed variables must equal or exceed the number of paths specified in the model
What is the formulae to calculate the maximum number of single connections between observed variables?
(V*V-1)/2
Where V = number of variables
Using the formulae to calculate the maximum number of single connections between observed variables what do you have to compare this number to to check model identification?
Count all of the model pathways (ignoring disturbance/error terms)
And then compare these two numbers
The maximum number must equal or exceed the number of paths counted in the model
There are three outcomes when checking model identification, name and explain these & state which outcomes enable models to then be estimated?
Over-identified model (more correlations than free paths in the model)
Just-identified model (saturated model) (correlations equal the number free paths in the model)
Under-identified model (fewer correlations than free paths - model cannot be estimated)
-only over or just identified models can be estimated
What is a recursive model?
This is a model where all causal pathways are moving in the same direction i.e. Effects are uni-directional. (This is the most common form of model and is always identifiers
What is a non-recursive model?
This is where there are reciprocal relationships between variables
- more complex to analyse
- identification issues can be very problematic in complex non-recursive models
- not as common in the psychology literature
After the model has been specified and constructed what happens?
Model estimation
The are two primary interests estimated by the model, what are they?
The direct and indirect effects between variable
Global model fit
This in the context of regression would by
-regression coefficients for individual predictors
Test of overall regression model fit i.e. ANOVA for R squared
Paths in models can be decomposed into what?
Direct and indirect effects (&error)
Explain direct paths
The oath regression coefficients reflect direct relations between one variable and another (controlling for the effect of any other variable also effecting the endogenous variable).
These are the same as the beta weights in normal MR (we can obtain these by simply running separate OLS regression models)
The number next to a path in a path diagram is what?
They are standardised regression coefficients the beta weights from a regression output
How do you interpret direct effect results from a path diagram?
You can fast the significance of (unstandardised direst effects)
However, you should consider the magnitude of direct effects not just the sig.
(Use last research as a guide, consider substantive real-world meaning of effects, use cohens rule of thumb .1 = small, .3 = medium and .5 = large)