4 introduction to mediation analysis Flashcards

Question

How would the total effect be infered statistically?

Answer 1

the sum of the direct eﬀect of X on Y and indirect eﬀect of X on Y through M Although the total effect is the sum of two pathways of influence, it can be estimated by regressing Y on just X, without M in the model. null hypothesis test (H0: tc = 0) ”no association between X and Y”

Answer 2

The direct eﬀect quantiﬁes the estimated diﬀerence in Y between two cases that are equal on M but diﬀer by one unit on X . standard method used for inference for any regression coeﬃcient in a regression model null hypothesis testing about tc´ is X related to Y independent of M?

Answer 3

The indirect eﬀect quantiﬁes how much two cases that diﬀer by a unit on X are estimated to diﬀer on Y as a result of X ’s inﬂuence on M , which in turn inﬂuences Y . X → M → Y causal chain of events null hypothesis test about tatb or by constructing an interval estimate

Answer 4

product of coefficients, Sobel test, delta method = estimation of SE of ab and assuming the sampling distribution of ab is normal, a p-value for ab can be derived given a specific null hypothesized value of a, b, or an interval estimate

Answer 5

seab = the root of (a2seb2 + b2sea2 + sea2seb2) and the indirect effect would be Z = ab/seab

Answer 6

ab - Zci% * seab < tatb < ab + Zci% * seab Zci% for 95% is 1.96 -> fixed value, independent of sample data -> corresponds to 97.5th percentile in normal distribution

Answer 7

method assumes that sampling distribution of ab is normal and simulation research has shown it is one of the lowest in power and generates confidence intervals that are less accurate then other methods

Answer 8

bootstrap confidence interval resampling methods versatile method can be applied to many inferential problems when behaviour of statistic over repeated sampling is not known, too complicated or context-dependent

Answer 9

1. **Resampling:** From your original dataset of size *N*, create a "bootstrap sample" by randomly selecting *N* observations with replacement. This means some original observations may appear more than once in a bootstrap sample, while others may not appear at all. 2. **Calculate the Statistic:** Compute the statistic of interest (e.g., mean, median, correlation coefficient) for the bootstrap sample. 3. **Repeat:** Repeat steps 1 and 2 many times (usually thousands of times) to generate a distribution of the statistic. Each time, you'll likely end up with a slightly different bootstrap sample and hence a slightly different statistic. 4. **Confidence Interval Estimation:** Once you have a bootstrap distribution of your statistic, you can estimate its confidence interval. For a 95% confidence interval, you would typically take the 2.5th percentile and the 97.5th percentile of the bootstrap statistics as the lower and upper bounds, respectively.

Answer 10

- **Flexibility:** Bootstrap methods do not require strong assumptions about the data distribution (e.g., normality). - **Applicability:** They can be used with complex statistics where traditional methods for confidence interval estimation are not available or difficult to apply. - **Intuitive:** The process of resampling with replacement is conceptually straightforward and easy to implement with modern computing power.

Answer 11

unbiased sample -> quality of sample as representation of population assumption of good representation otherwise method will inherit biases large sample -> otherwise outliers -> normality is more likely number of resampling observation selection should be fixed

Answer 12

The sampling distribution of the indirect effect (a*b) is often not symmetric, which can lead to biased estimates if one assumes normality. This asymmetry arises because the product of two normally distributed variables (like a and b) does not follow a normal distribution itself.

Answer 13

2. **Monte Carlo Confidence Intervals (Simulation-Based):** - This approach involves generating a large number of simulated samples (via resampling techniques) from the observed data. For each simulated sample, the indirect effect is computed. By aggregating the indirect effects across all simulations, you can construct an empirical distribution and then determine the confidence interval based on the percentiles of this distribution. This method naturally accommodates the asymmetry in the distribution of the indirect effect. 3. **Distribution of the Product Approach:** This is a more mathematically complex method that involves approximating the sampling distribution of the product of a and b. The method acknowledges the non-normality and potential skewness of this distribution. By employing the distribution of the product approach, researchers can derive confidence intervals that better reflect the actual, skewed distribution of the indirect effect. 4. **Transformation of ab to a Standardized Metric:** - Sometimes, the product of a and b is transformed into a standardized metric to facilitate the estimation of confidence intervals. One common transformation is to standardize the indirect effect by its standard error before using a standard normal distribution to derive confidence intervals. However, this approach also needs to consider the underlying distribution's skewness and kurtosis. 5. **Upper and Lower Bounds:** - In the context of asymmetric confidence intervals, the upper and lower bounds are not equidistant from the point estimate of the indirect effect. This asymmetry in the bounds reflects the underlying distribution's shape, providing a more accurate and informative interval estimate for the indirect effect.

Answer 14

tend to produce same susbtantive inference about the indirect effect, sometimes they will not tho depends on relative concern about type 1 (claiming an indirect effect exists when it does not) and type 2 (failing to detect an indirect effect thats real) Sobel test - higher 2 bias correction can inflate 1 percentile bootstrap ci has become recommended

Answer 15

1. Conduct a simple linear regression, including the X and Y variables to see if a relationship exists between the two. 2. Conduct a Mediation Analysis, including the X, Y and M, to investigate if the relationship between X and Y is changed in the presence of variable M.

Answer 16

ANOVA + coefficient p-value -> identical is X on Y significant? yes, consider unstandardized coefficient -> direction of the relationship

Answer 17

1. If the p-value is no longer significant then this suggests the M variable is fully mediating the relationship between X and Y. 2. If the p-value is still significant then this does not mean that M does not mediate at all but that the effect of M on the X to Y relationship is not the only explanation for why changes in X lead to changes in Y.

Answer 18

1. To establish the significance, you need to look at the bootstrapped lower limit confidence interval (LLCI) and the upper limit confidence interval (ULCI). 1. If the range from LLCI to ULCI includes zero, then the indirect effect is not significant. 2. If the range from LLCI to ULCI does not include zero, then the indirect effect is significant. 3. The coefficient of the indirect effect can be calculated by multiplying the coefficient of X to M with the coefficient of M to Y. (ab)

Answer 19

To calculate the total effect, you need to add the coefficient for the direct and indirect effect together.

Answer 20

- require large sample sizes - assumption of no mediated moderation (constant across levels of the independent variable) - multicollinearity -> if mediator and X are highly correlated it might be difficult to disentangle separate effects on Y - mediator-outcome confounding, and other confounds -> failing to control for confounds to M-Y - assumption of linearity - temporal ambiguity X-M-Y -> what if that order is not always the case? - model specification statistical accuracy/power - measurement error - causality does not prove anything

4 introduction to mediation analysis Flashcards

(44 cards)