Midterm Flashcards

Lectures 1-5

1
Q

What is the goal of the counterfactual model?

A

To estimate a true causal contrast?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define confounding, selection bias, and information bias.

A

Confounding is a non-causal association
that is observed between a given exposure and an
outcome owing to a third variable. Lowers exchageability.

Selection bias is

Information bias is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

DAGs can be used to communicate hypothesized relationships and ________.

A

identify and understand potential sources of measured/unmeasured selection bias or confounding.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How is the line of best fit obtained?

A

By minimizing the sum of squared errors (i.e. the vertical distance between each point
and line)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

The _______ is the
proportion of the total
variability explained by the model that includes the covariate x.

A

coefficient of determination (R^2)

1-SSy-SSxy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Name the four assumptions of linearity.

A

Constant variance, normality of errors, X-Y relationship is linear, and errors are all independent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What do we call the difference between the measured population mean of X (μx) and the true population mean of T (μT)? How is this different from precision?

A

Bias. Precision is about low variance of the measurement error itself.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When is measurement error non-differential?

A

When the sensitivity and
specificity of exposure assessment is equal for both groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which type of measurement error moves estimates closer to the null?

A

Non differential.

Differential moves estimates toward or away or change directions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

If exposure measurement error is non-differential, the bias
in the effect estimate is a function of ________.

A

precision (random error)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How does measurement error on exposure differ from outcome?

A

Outcome measurement, because it’s the dependent variable, won’t change the slope estimate but will increase its standard error and widen the confidence interval. Exposure will bias toward the null.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The sample size needed under non-differential measurement error is proportional to ______________, which is the assumed common exposure variance among cases and controls.

A

standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When everyone is assigned the same exposure, true exposures vary normally around ____.

A

Group values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Berkson-type random exposure measurement error is not expected to bias effect estimates (i.e. slopes in regression models) but there is still a loss of ______ (i.e. wider confidence intervals) and __________.

A

precision; reduced power

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Describe the Berkson model of bias.

A

Random error is attached to the true exposure value, independent of the observed; lowering precision.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Increasing sample size can minimize the impact of measurement
error in continuous outcome variables (T/F).

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How does error in outcome measurement change categorical/continuous variables?

A

Categorical outcome measurement error will produce a bias toward the null. Continuous outcome error has no effect, if you recall, on slope and just reduces precision.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Even when the
expected direction of bias is toward the null (because of non-differential exposure misclassification) bias away from the null can occur because ______.

A

Any study is just one realization, one sample which could be affected by random error in large/small ways each time and only evens out with multiple trials. So what we expect doesn’t always happen.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Misclassification and mixing levels leads to bias toward the null (T/F)

A

False, it can happen away!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Confounders are typically only CAUSALLY associated with the exposure (T/F).

A

False, the outcome!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Subject matter expertise is the best way to identify potential confounders (T/F).

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

How do we adjust for confounding through stratification?

A

Separate confounder into strata and calculate stratum-specific associations, then pool if homogenous.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Name two limits of stratification as a method for controlling confounding.

A

Leaves room for residual confounding in continuous variables and adjusting for multiple variable requires specific estimates for every different combination.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

The adjusted regression coefficient is the expected change in the mean value of ___ per unit change in X keeping _______.

A

Y; all other variables constant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Residual confounding occurs when categories are _______, or when confounders are measured with _____ or ______.

A

too broad; error; unmeasured / left out

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

The distribution of a confounder has to be different across groups (e.g. case/controls) to
cause confounding (T/F).

A

False, especially when it’s associated with the outcome.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

The overall impact of including multiple parameters in the
model depends on their _____ with X.

A

correlation

Variance of B1 is based on variance in X, error in Y, sample size, and the correlation between parameters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What are 3 methods for modelling nonlinear relationships?

A

Dummy variables, quadratic terms, and splines.

29
Q

Selection bias occurs when exposure and ______ both
affect inclusion in the analysis.

30
Q

What does conditioning on a collider do?

A

Induces a spurious relationship between the variables leading into it.

31
Q

Confounding is the presence of common effects while selection bias is conditioning on common causes (T/F).

A

False, the opposite!

32
Q

Effect modification refers to the situation where the
strength of association between exposure and
outcome differs across ____________.

A

levels of a third variable

33
Q

Explain the difference between additive and multiplicative interaction.

A

Additive interaction is when the absolute risk changes across levels of a third variable, multiplicative interaction is present when the RR varies. You can have additive without multiplicative if the ratios are all the same.

34
Q

The interaction term is the excess change in the outcome not explained by the _____.

A

Sum of the individual effects of two independent predictor variables.

35
Q

What are two things to be aware of when interpreting interaction terms?

A

Heterogeneity due to small sample size and confounding/error across strata of the effect modifier.

36
Q

Case-crossover studies are used to examine the acute
health effects of ____________.

A

intermittent exposures with short induction times

37
Q

What kind of confounding is not present in case-crossover studies?

A

Confounding due to variables which do not change within single individuals across the reference/case periods.

Transient co-exposures may cause confounding, though.

38
Q

What are the three ways you can have selection bias in a case crossover study?

A

nonrepresentative case selection, differential case survival (exposure influences survival), control time not independent of exposure.

39
Q

Exposures during case and reference periods must be _______, to avoid ______________.

A

independent of each other; carryover effects

40
Q

We use a time-stratified design if experiencing the event doesn’t impact the likelihood of ___________, so control periods can
be selected before and after the event.

A

subsequent exposures.

41
Q

What analysis do we perform for case-crossover studies?

A

Logistic regression to get Odds Ratios

42
Q

Case-crossovers look at the ____ in exposure between two periods, rather than the absolute level.

43
Q

What’s the major con of case-crossover studies?

A

They only use information from a single point in time, so they capture prevalence rather than incidence. Also cannot determine relationships.

44
Q

Ecological studies are good for studying variability __________ and for prevention at a _____________.

A

between homogenous populations; population level

45
Q

Name the three types of ecological measures.

A

Aggregate measures (characteristics within
the group)

Environmental measures (physical
characteristics of a location)

Global measures (group characteristics not
reducible to characteristics at the individual level)

46
Q

What do you call it when within-group correlations are different from between-group and you try to conflate the two?

A

Cross-level inference (ecological bias)

47
Q

What three steps have to be at the group level in order to make an ecological study?

A

measurement, analysis, and inference

48
Q

What are the pros and cons of ecological studies?

A

Less expensive (good starting point + existing measurements) and increased generalizability potential.

Hard to control confounding, identify temporal relationships, and threat of ecological fallacy.

49
Q

The Poisson distribution is used for counts—if events
happen at a constant rate over time, the Poisson distribution
gives the probability of ___________.

A

X number of events occurring in time T

50
Q

Count frequencies are often positively skewed with ________________.

A

most values being low and relatively few high values

51
Q

In Poisson distribution, the mean event rate is supposed to be equal to the ______.

52
Q

In Poisson Regression the dependent variable is the log of ________.

A

the expected count

53
Q

How does logE[Yi] respond with each parameter in the model?

A

linear increase!

54
Q

exp(B1) in a Poisson model approximates to?

A

Rate Ratio for a one unit increase

55
Q

What is the offset parameter?

A

logTi (the
time that unit “i/p” was at risk)

The offset parameter is used to account for differences in exposure time or population size when modeling count data.

56
Q

The effect of over-dispersion is that the precision of effect
estimates is not correct, leading to ______.

A

too-narrow confidence intervals

57
Q

θ is the _____ over-dispersion parameter

A

Quasi-Poisson (more flexible variable function)

58
Q

Splines fit a number of different ______ (usually
cubic) over the range of the data and are joined smoothly at
_______ to examine nonlinear forms.

A

polynomial curves; knots

59
Q

Splines can be used to flexibly adjust for ______.

A

confounders

60
Q

What are two ways to optimize time-series models?

A

lag models to correct for delayed effects and sensitivity analyses related to spline knots. Autocorrelation (nonindependent days) is unclear though.

61
Q

_____ analysis is useful when a report makes policy recommendations/actions and draws inferences about
causality

62
Q

Non-participation always leads to selection biased results (T/F).

A

False, only if participation depends on exposure AND outcome (confounding)

We can know this by asking nonparticipants to fill out a short questionnaire

63
Q

Where do we get information on an unmeasured confounder?

A

A sub-sample of the study population OR educated guesses based on other studies

64
Q

______ can be conducted to collect
measurements using both the “gold standard” method and the
less accurate/reliable method

A

Internal validation studies

65
Q

What does a simple bias analysis leave out of the picture, other than the other types of biases?

A

uncertainty in the bias parameter itself

66
Q

What are the parameters for simple bias analysis for selection bias vs. confounding vs. measurement error?

A

Contingency table –> multiply OR by selection OR.

Specify association between confounder and the outcome for the unexposed, as well as the prevalence in both groups

Reshuffle contingency table based on sensitivity and specificity values for each cell.

67
Q

By repeatedly sampling from bias parameter distributions,
the output out probabilistic analysis is ________ which can be interpreted as a point estimate
and frequentist confidence interval.

A

a distribution of adjusted estimates of
association

68
Q

Multidimensional bias analysis is an extension of simple bias
analysis that assigns probability distributions to ________ at the same time

A

multiple bias analysis