Midterm Flashcards

Question 1

Q

What is the goal of the counterfactual model?

Answer

A

To estimate a true causal contrast?

Question 2

Q

Define confounding, selection bias, and information bias.

Answer

A

Confounding is a non-causal association
that is observed between a given exposure and an
outcome owing to a third variable. Lowers exchageability.

Selection bias is

Information bias is

Question 3

Q

DAGs can be used to communicate hypothesized relationships and ________.

Answer

A

identify and understand potential sources of measured/unmeasured selection bias or confounding.

Question 4

Q

How is the line of best fit obtained?

Answer

A

By minimizing the sum of squared errors (i.e. the vertical distance between each point
and line)

Question 5

Q

The _______ is the
proportion of the total
variability explained by the model that includes the covariate x.

Answer

A

coefficient of determination (R^2)

1-SSy-SSxy

Question 6

Q

Name the four assumptions of linearity.

Answer

A

Constant variance, normality of errors, X-Y relationship is linear, and errors are all independent.

Question 7

Q

What do we call the difference between the measured population mean of X (μx) and the true population mean of T (μT)? How is this different from precision?

Answer

A

Bias. Precision is about low variance of the measurement error itself.

Question 8

Q

When is measurement error non-differential?

Answer

A

When the sensitivity and
specificity of exposure assessment is equal for both groups

Question 9

Q

Which type of measurement error moves estimates closer to the null?

Answer

A

Non differential.

Differential moves estimates toward or away or change directions.

Question 10

Q

If exposure measurement error is non-differential, the bias
in the effect estimate is a function of ________.

Answer

A

precision (random error)

Question 11

Q

How does measurement error on exposure differ from outcome?

Answer

A

Outcome measurement, because it’s the dependent variable, won’t change the slope estimate but will increase its standard error and widen the confidence interval. Exposure will bias toward the null.

Question 12

Q

The sample size needed under non-differential measurement error is proportional to ______________, which is the assumed common exposure variance among cases and controls.

Answer

A

standard deviation

Question 13

Q

When everyone is assigned the same exposure, true exposures vary normally around ____.

Answer

A

Group values

Question 14

Q

Berkson-type random exposure measurement error is not expected to bias effect estimates (i.e. slopes in regression models) but there is still a loss of ______ (i.e. wider confidence intervals) and __________.

Answer

A

precision; reduced power

Question 15

Q

Describe the Berkson model of bias.

Answer

A

Random error is attached to the true exposure value, independent of the observed; lowering precision.

Question 16

Q

Increasing sample size can minimize the impact of measurement
error in continuous outcome variables (T/F).

Question 17

Q

How does error in outcome measurement change categorical/continuous variables?

Answer

A

Categorical outcome measurement error will produce a bias toward the null. Continuous outcome error has no effect, if you recall, on slope and just reduces precision.

Question 18

Q

Even when the
expected direction of bias is toward the null (because of non-differential exposure misclassification) bias away from the null can occur because ______.

Answer

A

Any study is just one realization, one sample which could be affected by random error in large/small ways each time and only evens out with multiple trials. So what we expect doesn’t always happen.

Question 19

Q

Misclassification and mixing levels leads to bias toward the null (T/F)

Answer

A

False, it can happen away!

Question 20

Q

Confounders are typically only CAUSALLY associated with the exposure (T/F).

Answer

A

False, the outcome!

Question 21

Q

Subject matter expertise is the best way to identify potential confounders (T/F).

Question 22

Q

How do we adjust for confounding through stratification?

Answer

A

Separate confounder into strata and calculate stratum-specific associations, then pool if homogenous.

Question 23

Q

Name two limits of stratification as a method for controlling confounding.

Answer

A

Leaves room for residual confounding in continuous variables and adjusting for multiple variable requires specific estimates for every different combination.

Question 24

Q

The adjusted regression coefficient is the expected change in the mean value of ___ per unit change in X keeping _______.

Answer

A

Y; all other variables constant

Question 25

Q

Residual confounding occurs when categories are _______, or when confounders are measured with _____ or ______.

Answer

A

too broad; error; unmeasured / left out

Question 26

Q

The distribution of a confounder has to be different across groups (e.g. case/controls) to
cause confounding (T/F).

Answer

A

False, especially when it’s associated with the outcome.

Question 27

Q

The overall impact of including multiple parameters in the
model depends on their _____ with X.

Answer

A

correlation

Variance of B1 is based on variance in X, error in Y, sample size, and the correlation between parameters.

Question 28

Q

What are 3 methods for modelling nonlinear relationships?

Answer

A

Dummy variables, quadratic terms, and splines.

Question 29

Q

Selection bias occurs when exposure and ______ both
affect inclusion in the analysis.

Question 30

Q

What does conditioning on a collider do?

Answer

A

Induces a spurious relationship between the variables leading into it.

Question 31

Q

Confounding is the presence of common effects while selection bias is conditioning on common causes (T/F).

Answer

A

False, the opposite!

Question 32

Q

Effect modification refers to the situation where the
strength of association between exposure and
outcome differs across ____________.

Answer

A

levels of a third variable

Question 33

Q

Explain the difference between additive and multiplicative interaction.

Answer

A

Additive interaction is when the absolute risk changes across levels of a third variable, multiplicative interaction is present when the RR varies. You can have additive without multiplicative if the ratios are all the same.

Question 34

Q

The interaction term is the excess change in the outcome not explained by the _____.

Answer

A

Sum of the individual effects of two independent predictor variables.

Question 35

Q

What are two things to be aware of when interpreting interaction terms?

Answer

A

Heterogeneity due to small sample size and confounding/error across strata of the effect modifier.

Question 36

Q

Case-crossover studies are used to examine the acute
health effects of ____________.

Answer

A

intermittent exposures with short induction times

Question 37

Q

What kind of confounding is not present in case-crossover studies?

Answer

A

Confounding due to variables which do not change within single individuals across the reference/case periods.

Transient co-exposures may cause confounding, though.

Question 38

Q

What are the three ways you can have selection bias in a case crossover study?

Answer

A

nonrepresentative case selection, differential case survival (exposure influences survival), control time not independent of exposure.

Question 39

Q

Exposures during case and reference periods must be _______, to avoid ______________.

Answer

A

independent of each other; carryover effects

Question 40

Q

We use a time-stratified design if experiencing the event doesn’t impact the likelihood of ___________, so control periods can
be selected before and after the event.

Answer

A

subsequent exposures.

Question 41

Q

What analysis do we perform for case-crossover studies?

Answer

A

Logistic regression to get Odds Ratios

Question 42

Q

Case-crossovers look at the ____ in exposure between two periods, rather than the absolute level.

Question 43

Q

What’s the major con of case-crossover studies?

Answer

A

They only use information from a single point in time, so they capture prevalence rather than incidence. Also cannot determine relationships.

Question 44

Q

Ecological studies are good for studying variability __________ and for prevention at a _____________.

Answer

A

between homogenous populations; population level

Question 45

Q

Name the three types of ecological measures.

Answer

A

Aggregate measures (characteristics within
the group)

Environmental measures (physical
characteristics of a location)

Global measures (group characteristics not
reducible to characteristics at the individual level)

Question 46

Q

What do you call it when within-group correlations are different from between-group and you try to conflate the two?

Answer

A

Cross-level inference (ecological bias)

Question 47

Q

What three steps have to be at the group level in order to make an ecological study?

Answer

A

measurement, analysis, and inference

Question 48

Q

What are the pros and cons of ecological studies?

Answer

A

Less expensive (good starting point + existing measurements) and increased generalizability potential.

Hard to control confounding, identify temporal relationships, and threat of ecological fallacy.

Question 49

Q

The Poisson distribution is used for counts—if events
happen at a constant rate over time, the Poisson distribution
gives the probability of ___________.

Answer

A

X number of events occurring in time T

Question 50

Q

Count frequencies are often positively skewed with ________________.

Answer

A

most values being low and relatively few high values

Question 51

Q

In Poisson distribution, the mean event rate is supposed to be equal to the ______.

Question 52

Q

In Poisson Regression the dependent variable is the log of ________.

Answer

A

the expected count

Question 53

Q

How does logE[Yi] respond with each parameter in the model?

Answer

A

linear increase!

Question 54

Q

exp(B1) in a Poisson model approximates to?

Answer

A

Rate Ratio for a one unit increase

Question 55

Q

What is the offset parameter?

Answer

A

logTi (the
time that unit “i/p” was at risk)

The offset parameter is used to account for differences in exposure time or population size when modeling count data.

Question 56

Q

The effect of over-dispersion is that the precision of effect
estimates is not correct, leading to ______.

Answer

A

too-narrow confidence intervals

Question 57

Q

θ is the _____ over-dispersion parameter

Answer

A

Quasi-Poisson (more flexible variable function)

Question 58

Q

Splines fit a number of different ______ (usually
cubic) over the range of the data and are joined smoothly at
_______ to examine nonlinear forms.

Answer

A

polynomial curves; knots

Question 59

Q

Splines can be used to flexibly adjust for ______.

Answer

A

confounders

Question 60

Q

What are two ways to optimize time-series models?

Answer

A

lag models to correct for delayed effects and sensitivity analyses related to spline knots. Autocorrelation (nonindependent days) is unclear though.

Question 61

Q

_____ analysis is useful when a report makes policy recommendations/actions and draws inferences about
causality

Question 62

Q

Non-participation always leads to selection biased results (T/F).

Answer

A

False, only if participation depends on exposure AND outcome (confounding)

We can know this by asking nonparticipants to fill out a short questionnaire

Question 63

Q

Where do we get information on an unmeasured confounder?

Answer

A

A sub-sample of the study population OR educated guesses based on other studies

Question 64

Q

______ can be conducted to collect
measurements using both the “gold standard” method and the
less accurate/reliable method

Answer

A

Internal validation studies

Answer 59

A

uncertainty in the bias parameter itself

Answer 60

A

Contingency table –> multiply OR by selection OR.

Specify association between confounder and the outcome for the unexposed, as well as the prevalence in both groups

Reshuffle contingency table based on sensitivity and specificity values for each cell.

Answer 61

A

a distribution of adjusted estimates of
association

Answer 62

A

multiple bias analysis

Midterm Flashcards

Lectures 1-5