Week 12 S.E.M. Structural Equation Modelling Flashcards

To provide S.E.M. revision for the exam Based on Kate's slides and tutorial content

1
Q

What is S.E.M. (Structural Equation Modelling)?

A

S.E.M. is a combination of Regression and PCA (Principle Component Analysis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

S.E.M. can be defined as…..

A

a family of statistical models that seek to explain the relationships among multiple variables.
*by examining the structure of interrelationships expressed in a series of equations, similar to a series of multiple regression equations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What do the equations produced via S.E.M. depict?

A

The equations depict all of the relationships among constructs (DV’s & IV’s) involved in the analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does S.E.M. define a construct?

A

Constructs are unobservable (aka ‘latent’) factors that are represented by multiple variables
e.g. DASS variables represent the unobservable constructs of Depression, Anxiety, & Stress

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the 4 distinguishable features of SEM?

A
  • it provides an estimation of multiple and interrelated relationships
  • it represents unobservable (AKA latent) concepts & corrects for measurement error
  • It produces a model to explain an entire set of relationships
  • it defines error terms for each parameter
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What exactly is SEM?

A
  • SEM is a stats package the presents models detailing structure and measurement
  • It is included in AMOS, LISREL, SAS & MPlus
  • SEM allows testing of a theoretical combination of variables that can represent a cause and effect relationship
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Barbara Byrne is a bit of a SEM expert, I heard she conceptualises SEM as Confirmatory Factor Analysis, tell me more?

A
  • Byrne (2010) suggests SEM takes a confirmatory rather than exploratory approach to analyse the data for inferential aims.
  • Other multivariate procedures can not assess or correct for measurement error. SEM provides estimates of error variance parameters at every stage of the model.
  • SEM can incorporate both latent, observed variables, covariances and error terms.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does Byrne say about analysis that does not utilise SEM?

A

*Other methods ignore the errors in explanatory variables and therefore can lead to serious errors in interpretation. *Analyses using other methods are based on observed measurements only and don’t incorporate measurement and structure simultaneously.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does the ‘full latent variable model’ allow in terms of the regression structure among latent variables?

A

The full latent model allows specification of the regression structure among latent variables
*This allows the researcher to hypothesis the impact of one latent construct on another in the modelling of causal direction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does a full or complete model depict?

A

A full or complete model’s measurement depicts the links between the latent variables and their observed measures and the structural model depicts the links among the latent variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a recursive model?

A

a recursive model is one in which I specify the direction of causality from only one direction.
i.e. single direction arrow

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a non-recursive model?

A

A model that allows reciprocal or feedback effects

i.e. double headed arrow

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What information does a researcher get from SEM to test the plausibility of the model?

A
  • The plausibility of the model is based on sample data that comprises all observed variables in the model
  • The goal is to determine the goodness of fit (GoF) between the hypothesised model & sample data to estimate the difference between the 2.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the residual?

A

The residual is the difference between the observed and hypothesised model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How can the SEM model be summarised?

A

Data = Model + Residual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are Exogenous Constructs?

A

*IV
Exogenous constructs are the latent, multi-item equivalent to independent variables.
*They use a variate (linear combination) of measures to represent the construct, which acts as an IV in the model.
*Multiple measured variables (x) represent the exogenous constructs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are Exogenous variables synonymous with?

A

Exogenous variables are synonymous with IVs, may cause fluctuations in the values of other latent variables. Gender, age and SES are possible influences external to the model that cause change in the exogenous variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are Endogenous Constructs?

A

*DV
Endogenous constructs are the latent, multi-item equivalent to DVs.
(enDogenous = D for DV)
*These constructs are theoretically determined by factors within the model.
*Multiple measured variables (y) represent the endogenous constructs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What are Endogenous Constructs synonymous with?

A

Endogenous latent variables are synonymous with DVs, which are influenced by exogenous variables and explained by the model because all latent variables that influence them are included.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the difference between Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA)?

A
  • In EFA the extent to which a group of items measure a given construct is evaluated through factor loadings.
  • In contrast CFA is driven by knowledge of the underlying latent construct structure, which tests the structure of the hypothesised model and the strength of the regression paths.
  • The CFA model focuses on the link between factors and their measured variables, within the framework of SEM, it represents what has been termed the measurement model.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What are the 6 stages of SEM?

A

Stage 1: Defining Individual Constructs
Stage 2: Developing the Overall Measurement Model
Stage 3: Designing a Study that’s Empirically driven
Stage 4: Assessing the Measurement Model Validity
Stage 5: Specifying the Structural Model
Stage 6: Assessing Structural Model Validity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

How can SEM Models be represented?

A

SEM Models can be represented visually in a path diagram.

  • Dependence relationships are represented with single headed directional arrows.
  • Correlational (covariance) relationships are represented with two-headed arrows.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

When is SEM not allowed?

A

When there is no underlying theory

*The theory is required for the measurement model specification and the Structural model specification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What conditions need to be present for SEM to be possible?

A

There must be a genuine (i.e. non-spurious), unique covariance relationship between cause & effect variables which does not include other constructs
There must be theoretical support for the cause & effect relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What other conditions are necessary for using SEM?

A

Models developed with a developmental strategy should be cross-validated with an independent sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Other than using a Path Diagram to represent SEM, what other way can SEM appear?

A

Structural equations can be schematically presented or through a series of equations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What function does the measurement model perform?

A

The measurement model defines relations between the observed and unobserved variables.
*It provides the link between scores on a measuring instrument and underlying constructs, which they are designed to measure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What function does the structural model perform?

A

The structural model defines relations among the unobserved variables.

  • It specifies the manner by which particular latent variables directly or indirectly influence changes in the values of certain other latent variables in the model.
  • CFA models can be analysed using AMOS with first and second order hierarchical models.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What do loadings or Lambda weights represent?

A

Loadings or Lambda Weights represent the relationships from constructs to variables as in factor analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What do path estimates represent?

A

Path estimates represent the relationships between constructs as does β in regression analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Which 3 types of relationships can a Measurement Model represent?

A

A. a relationship between a Construct and a Measured Variable
B. a Relationship between a Construct and Multiple Measured Variables
D. Correlational Relationships between Constructs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Which type of relationships can a Measurement Model not represent?

A

C. a Dependence relationship between two Constructs (A Structural Relationship)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Which type of relationship can the Structural Model include?

A

*all of them:
A. a relationship between a Construct and a Measured Variable
B. a Relationship between a Construct and Multiple Measured Variables
C. a Dependence relationship between two Constructs (A Structural Relationship)
D. Correlational Relationships between Constructs

34
Q

In SEM what is the Model?

A

the hypothesized structure linking the observed variables to the latent variable

35
Q

In SEM what is the Residual?

A

The Residual represents the discrepancy between the hypothesized model & observed data

36
Q

What 4 geometrical shapes schematically represent structural models?

A
  • A circle or ellipse represents observed latent variables.
  • A rectangle represents directly observed variables.
  • Single-headed arrows rep the impact of one variable on another.
  • Double-headed arrows representing covariances or correlations between pairs of variables
37
Q

How is a Path coefficient for regression of an observed variable onto an unobserved latent variable (or factor) represented?

A

by a rectangle joined by a single arrow to an ellipse

i.e. rectangle, single arrow, ellipse

38
Q

How is a Path coefficient for regression of one factor onto another factor represented?

A

by an ellipse, single arrow, joining second ellipse

39
Q

How is the Residual error in the prediction of an unobserved factor represented?

A

by a circle

40
Q

Structural Models can be just-identified, Over-identified, or Under-Identified. Which is ideal?

A

Over-identified is the ideal model: an Over-identified model is one where the number of estimable parameters < the number of data points. Allows for rejection of the model & results in a positive number of degrees of freedom – positive # of dfs.

41
Q

What’s the problem with a just-identified model?

A

Just-identified – one to one correspondence between the data & structural parameters. No df left so cannot be rejected!!

42
Q

What is the problem with an under-identified model?

A

Under-identified – one where the number of parameters to be estimated exceeds the number variances and covariances to be identified. Therefore, the model contains insufficient information for the purpose of attaining a determinate solution of parameter estimation – negative # of dfs.

43
Q

How do I work out whether I have sufficient degrees of freedom (df’s) for my model?

A

This is based on the # of Parameter Estimates:
With a model with 4 factors and 3 items loading on each (4 X 3 = 12), we can ascertain whether the model is identified or not.
How many parameters can you identify? Formula: p(p+1)/2, 12(12+1)/2=78

44
Q

There are 4 requirements for causal inference, what are they?

A
  1. Covariation – Statistically significant estimated paths
  2. Sequence – Change in one variable occurs in one direction.
  3. Non-spurious Covariance (i.e. genuine covariance) – the size and nature of the relationship between a cause and effect inference should not include other constructs. Should not be false or misleading.
  4. Theoretical Support – Requires a compelling rationale to support the cause & effect relationship.
45
Q

The first stage of SEM involves Defining Individual Constructs. What does this involve?

A
  • Operationalizing the Constructs
  • Scales from Prior Research
  • New Scale Development
  • Pretesting
46
Q

The second stage of SEM involves Developing the Overall Measurement Model. What do I need to consider to address this?

A
  • Can the validity and uni-dimensionality of the constructs be supported?
  • How many indicators for each construct?
  • Is the measurement model reflective or formative? (asking what causes what?)
47
Q

Part of the second stage involves determining whether the Measurement model is Reflective or Formative. What is important to remember?

A

Kate told us to learn that the Reflective Model Theory and the Formative Measurements Theory are opposites and to learn this for the exam :-)

48
Q

Tell me about The Reflective Model Theory

A

Reflective Model Theory: based on assumptions that

(1) the latent constructs cause measured variables.
(2) The measurement error results in an inability to fully explain these measures.
* It is the typical representation of a latent construct.

49
Q

Tell me about The Formative Measurements Theory

A

Formative Measurement Theory: based on assumptions that

(1) the measured variables cause the construct.
(2) The error in measurement is an inability to fully explain the construct.
* The construct is not latent in this case.

50
Q

In the second stage of SEM once I have determined whether the measurement is reflective or formative, what else do I need to do when Developing the Overall Measurement Model?

A
  • Make constructs from measured variables.

* Draw a path diagram for the measurement model

51
Q

What are the 3 questions that arise when specifying a measurement model?

A
  1. Can we empirically support the validity and uni-dimensionality of the constructs?
  2. How many indicators should we use and what’s the trade off for increasing or decreasing the number of indicators?
  3. Is it a descriptive or explanatory model being depicted? Interpretations will depend on answering this issue, exploratory or confirmatory?
52
Q

Stage 3 involves Designing a Study to Produce Empirical Results – how do I check assumptions?

A
  • Assess the adequacy of the sample size.

* Select the estimation method and missing data approach.

53
Q

What do I do if I find I have missing data?

A

Missing Data Options:

  • Complete Case or List-Wise Deletion.
  • All-Available or Pair-Wise Deletion.
  • Model-Based Deletion.
54
Q

What are the 6 issues related to Stage 3 (Designing a Study to Produce Empirical Results)?

A

Research Design:

  1. Type of data analysed: covariances or correlations.
  2. Missing data.
  3. Sample size.

Model Estimation:

  1. Model structure.
  2. Estimation techniques.
  3. Computer software used.
55
Q

What are the 5 sample size issues in SEM?

A
  1. multivariate distribution of the data
  2. estimation technique
  3. model complexity
  4. amount of missing data
  5. amount of average error variance among the reflective indicators.
56
Q

During stages 1-3 why might we use a pre-test or pilot study?

A

A pretest using respondents similar to those from the population to be studied is recommended to screen items for appropriateness is useful when a model has scales borrowed from various sources reporting other research

57
Q

What is the recommended method of handling missing data during stages 1-3?

A
  • Pair-wise deletion of missing cases (all available approach) is a good alternative for handling missing data when the amount of missing data is less than 10% & the sample size is approx 250+
  • As sample sizes become small or when missing data exceeds 10%, one of the replacement (imputation) methods for missing data becomes a good alternative.
  • When the amount of missing data becomes very high (15% or more), SEM may not be appropriate.
58
Q

Determining the minimum sample size for a particular SEM model depends on several factors including model complexity and communalities (average variance extracted among items) in each factor. If a SEM models containing 5 or fewer constructs, each with more than 3 items (observed variables), & with high Item communalities (.6 or more) what is considered an adequate sample size?

A

*SEM models containing 5 or fewer constructs, each with more than 3 items (observed variables), & with high Item communalities (.6 or more), can be adequately estimated with samples as low as 100-150.

59
Q

Determining the minimum sample size for a particular SEM model depends on several factors including model complexity and communalities (average variance extracted among items) in each factor. If a SEM models containing When the number of factors is larger than six, some of which have fewer than 3 measured items as indicators, & multiple low communalities are present, what is considered an adequate sample size?

A

*When the number of factors is larger than six, some of which have fewer than 3 measured items as indicators, & multiple low communalities are present, sample size requirements may be > 500.

60
Q

Determining the minimum sample size for a particular SEM model depends on several factors including model complexity and communalities (average variance extracted among items) in each factor. What is a general rule of thumb for SEM sample sizes?

A
  • The sample size must be sufficient to allow the model to run, but more important, it must adequately represent the population.
  • as with all studies, the greater the complexity of the model, the larger the sample size required
61
Q

What is the biggest limitation in SEM?

A

Mahalanobis Distance:

Measure the influence of a case by measuring the distance of cases from the mean of the predictor variable

62
Q

Stage 4 involves Assessing Measurement Model Validity, how do we do this?

A

We assess measurement of the model validity using Goodness of Fit (GoF) which evaluates the construct validity of the measurement model
*The number of parameters +1 & divided by 2 tells us how many ‘lines’ we can have in our matrix

63
Q

There are 3 types of Goodness of Fit, name them?

A
  • Absolute Fit Measures.
  • Incremental Fit Measures.
  • Parsimonious Fit Measures.
64
Q

SEM has no single statistical test that best describes the “strength” of the model. Instead, researchers have developed 3 different types of measures that in combination assess results from 3 perspectives: overall fit, comparative fit & model parsimony.
Tell me about Absolute Fit Measures.

A

Absolute (overall) = measures overall goodness-of-fit for both the structural and measurement models collectively. This type of measure does not make any comparison to a specified null model (incremental fit measure) or adjust for the number of parameters in the estimated model (parsimonious fit measure).

65
Q

SEM has no single statistical test that best describes the “strength” of the model. Instead, researchers have developed 3 different types of measures that in combination assess results from 3 perspectives: overall fit, comparative fit & model parsimony.
Tell me about Incremental Fit Measures.

A

Incremental (comparative) = measures goodness-of-fit that compares the current model to a specified “null” (independence) model to determine degree of improvement over the null model.

66
Q

SEM has no single statistical test that best describes the “strength” of the model. Instead, researchers have developed 3 different types of measures that in combination assess results from 3 perspectives: overall fit, comparative fit & model parsimony.
Tell me about Parsimonious Fit Measures.

A

Parsimonious = measures goodness-of-fit representing the degree of model fit per estimated coefficient. This measure attempts to correct for any “over-fitting” of the model & evaluates the parsimony of the model compared to the GoF. Selecting a rigid cut-off for fit indices is like selecting a minimum R2 for a regression equation. Almost any value can be challenged. Awareness of the factors affecting the values & good judgement are the best guides to evaluate the size of the GoF indices.

67
Q

Stage five involves specifying the structural model - how do we do this?

A
  • Stage five involves specifying the structural model by assigning relationships from one construct to another based on the proposed theoretical model. That is, the dependence relationships that exist among the constructs representing each of the hypotheses are specified.
  • The end result is to convert the measurement model to a structural model
68
Q

As models become more complex, the likelihood of alternative models with equivalent fit increases. To address this, multiple fit indices should be used to assess a model’s goodness of fit. What are the names?

A
  • The χ2 value and the associated df
  • One absolute fit index (like the GFI, RMSEA or SRMR)
  • One incremental fit index (like the CFI or TLI)
  • One goodness of fit index (GFI, CFI, TLI, . . . )
  • One badness of fit index (RMSEA, SRMR, . . . )
69
Q

There are several requirements for Goodness of Fit indices. What is the RMSEA? and how is it different to the other Goodness of Fit indices?

A

RMSEA = Root mean square error of approximation <.05
RMSEA is only GoF index that needs to be less than .05, all others need to be between 0-1 with the closer to 0.95 the better

70
Q

There are several requirements for Goodness of Fit indices. What is χ2 ?

A

Chi-square – Likelihood Ratio Test statistic. Appears as CMIN/DF. Large χ2 relative to df indicate a model that needs modification.

71
Q

There are several requirements for Goodness of Fit indices. What is CMIN?

A

CMIN – Minimum discrepancy – represents the discrepancy between the unrestricted sample covariance matrix S, and the restricted covariance matrix. Representing the Likelihood Ration Test statistic.

72
Q

There are several requirements for Goodness of Fit indices that need to be between 0-1 with the closer to 0.95 the better. What are they?

A

CFI – Comparative Fit Indices – 0-1, with .95 indicating good fit
GFI – Goodness of Fit Indices - Index – 0-1, with .95 indicating good fit. Closer to 1 the better
AGFI – Adjusted Goodness of Fit (adjusts for the # of df in the model)
TLI – Tucker-Lewis Index – 0-1, with .95 indicating good fit
AIC – Comparing two competing models.

73
Q

Stage 6 involves assessing the structural model validity & predictive accuracy, how do we do this?

A

No single “magic” value for the fit indices that separates good from poor models. It’s not practical to apply a single set of cut-off rules that apply for all measurement models.
The quality of fit depends heavily on model characteristics, including sample size and complexity

74
Q

Stage 6 involves assessing the structural model validity & predictive accuracy, how do we do this with simple models?

A

Simple models with small samples should be held to very strict fit standards. Even an insignificant p-value for a very simple model may not be very meaningful.

75
Q

Stage 6 involves assessing the structural model validity & predictive accuracy, how do we do this with complex models?

A

More complex models with larger samples should not be held to the same strict standards. Thus, when samples are large and the model contains a large number of measured variables and parameter estimates, cut-off values of .95 on key GoF measures may be unrealistic.

76
Q

How do we assess Normality in SEM?

A

Go to Analysis Summary - Assessment of Normality and check the Multivariate Normality, according to Bentler (2005, cited Byrne, 2010, p. 104) this figure should be <.5

77
Q

How do we assess for multivariate outliers?

A

An outlying case will have D2 (Mahalanobis squared) value that stands distinctively apart from all the other D2 values.

78
Q

What are Standardised Residuals and what is the cut off score for Standardised residuals in SEM?

A
  • Any difference between the hypothesized model & sample covariance is captured in the Standardized residuals
  • the cut off score for standard residuals in SEM is 2.58
79
Q

What can you tell me about Modification indices (MI)?

A
  • The M.I. value indicates how much the overall χ2 would drop by at least the indicated amount in the table and the Par Change you will find the value the estimated parameter will assume.
  • Large MI’s suggest cross loadings. We can modify the model by applying covariances to items that appear to cross-load. However, this must have a conceptual or theoretical basis for application. Look at including covariances to assess whether the model’s estimation is improved.
  • Can check the value in M.I. and Par Change.
80
Q

How does “Model Maximum Likelihood (ML)” treat true and estimated values of the model parameters?

A

*Model maximum likelihood (ML) estimates of the true values of the model parameters are considered to be fixed but unknown, whereas their estimates are considered to be random but known.

81
Q

How does the Bayesian Estimation differ in its treatment of true and estimated model parameter values?

A

*In contrast, from the Bayesian perspective, true model parameters are unknown and therefore considered to be random as Bayesian estimation considers any unknown quantity as a random variable and therefore assigns it a probability distribution