Epi Bio Review Slides Flashcards

Question

3 criteria for confounding

Answer 1

Confounding occurs when there is an imbalance in risk factors across study arms 1. Risk factor for outcome 2. Associated with the exposure under study (unevenly distributed among exposure groups) 3. Cannot be a mediator in causal pathway between exposure and outcome

Answer 2

1. Randomization: helps balance risk factors across study arms (RCT) 2. Restriction (restrict to all women if worried about gender) 3. Matching (Exp. If worried about confounding by smoking - match one exposed (smoker) to one unexposed who smokes) 4. Stratification of results (Adjusting for confounders) - calculate association (RR or OR) within confounding groups 5. Multivariate analysis (adjustment for multiple cofounders at once) - fancy form of stratification; key words are “adjustment” or “control”

Answer 3

Measure of association between the exposure and outcome is not representative of the true association in the target population (depression --> stress and aging) *inflated risk ratio To avoid selection bias, study subjects should be assembled in a manner that dissociates the exposure from the outcome; volunteers should not be aware of the study hypothesis primarily a problem for retrospective studies as the outcome has already occurred Randomized trials cannot have selection bias (joined the study before they know their exposure status, and because the outcome has not yet occurred)

Answer 4

- Always towards the null (RR=1) - happens in same groups (exp. subjective measures like in yoga trial - exposure affected) (exp. difficult disease to diagnose accurately - outcome affected), affects both exposure or outcome groups equally - to avoid --> use well-defined and precise measures

Answer 5

- Can bias toward OR away from null - effects only ONE of the groups (moms with FAS babies lie about alcohol use --> outcome group affected) (Investigators over-diagnose lung cancer (outcome) in smokers --> exposure group affected) - to avoid--> use blinding of investigators or volunteers

Answer 6

- Exp. in RCT, if sickest people leave placebo group to get other tx, results show healthier placebo group --> underestimated difference between experimental tx and placebo - if the loss to follow-up is not related to the exposure or outcome, it occurs at random, and does NOT cause bias. - To avoid --> stay in touch with volunteers

Answer 7

-NOT needed for RCTs 1. Strong association 2. Consistency across studies 3. Specific result (one cause-->one outcome) 4. Temporality (exposure precedes outcome?) 5. Biological gradient (direct relationship between the risk factor and people’s status on the outcome variable) 6. Biological plausibility 7. Coherence (between epidemiological and laboratory findings) 8. Experimental Evidence 9. Analogy (exp. induced smoking in lab rats --> lung cancer

Answer 8

1. Nominal: unordered (hair color), binary if only 1 of 2 values (HIV --> pos/neg) 2. Ordinal: breast cancer staging (0-IV) 3. Discrete data: restricted to specific values, exp. number live births, number joints with arthritis 4. Continuous: not restricted to any specific value (height, weight, LDL CHL) *use histogram or box plots

Answer 9

median < mean | *more common

Answer 10

Mean < median

Answer 11

Compute the difference between the 75th and 25th percentiles.

Answer 12

``` In a normal distribution: - 68% of the values lie within 1 standard deviation of the mean - 95% lie within 2 standard deviations of the mean - 99% lie within 3 standard deviations of the mean ``` * The mean, median, and mode are equal * The curve is bilaterally symmetric

Answer 13

z-score: number of ‘Standard deviations’ that ‘the value’ is above or below the ‘Mean’ z = (Value – Mean) / Standard Deviation If a person’s z-score is 0, then you know their data value is average. If it is 1, then you know their data value is slightly higher than average, but not impressively so. If their z-score is 6, then you know their data value is WAY out in the upper tail of the distribution. This could be a good thing (if we are measuring high jump ability, for example) or a bad thing (if we are measuring LDL cholesterol). z-score of 0 --> at the median (50th percentile)

Answer 14

Range of values that we expect includes the true population parameter we are estimating (whereas a sample mean is our best estimate of the population parameter) We are 95% confident that the lower and upper bounds of the interval will contain the true value we are trying to estimate. exp. "95% certain that the true population mean glucose is between 105.7 and 107.9 mg/dl." Bc in repeated samples, 95% of the confidence intervals will contain the unknown population parameter we are trying to estimate. -studies w/ larger sample size --> smaller CI

Answer 15

• If a CI includes the null value, do not reject HO - Example: RR = 1.23 CI = 0.95, 1.43, do not reject Ho • If a CI does not include the null value, reject Ho

Answer 16

standard error (SE) is the standard deviation of many sample means standard deviation (SD) --> single sample

Answer 17

Increasing the level of confidence, say from 95% to 99%. More variability among the observations (i.e., larger standard error) A smaller sample size

Answer 18

If P-value < .05 Reject Null Hypothesis If P-value < .05 DO NOT Reject Null Hypothesis "Given the null hypothesis is true, the probability of obtaining a result at least as extreme as the one observed." ``` The p value does NOT tell us: How likely the Ho is true How likely it is that Ha is true The clinical significance of the result Anything about the role of bias or confounding in the study ```

Answer 19

Used to compare the means of two groups (the outcome is a continuous variable and the exposure is binary) • Example: Comparing the mean blood pressures between men and women Abstract: "compared two means" and outcome is continuous --> two-sample t-test

Answer 20

Appropriate when both exposure and outcome are nominal (often binary) Used to compare two or more proportions (percentages) Allows you to calculate a p-value associated with RRs and ORs • Compare observed values to expected values (expected values if Ho were true) - If observed values are similar to expected values (small χ2), Ho is plausible - If observed values are different than expected values (large χ2), Ho is rejected

Answer 21

Type I error rate: reject HO, when HO is actually true | - α = 0.05 means that there is a 5% chance of a type I error

Answer 22

do not reject HO, when HO should | be rejected in favor of HA

Answer 23

probability that a statistical test will result in | a correct rejection of HO

Answer 24

quantifies the strength (magnitude) and direction of linear relationships between two continuous variables ``` Has no units - Ranges from -1 to +1 - 0 = no linear relationship -1 = perfect inverse linear relationship +1 = perfect positive linear relationship ``` Problems: some variables may be related to each other, but not linearly; may distort data if extreme data points exist

Answer 25

y= β0+ β1 X+ ε • Used to determine how the outcome variable (Y) changes as we change the level of exposure (X) • Allows us to quantify the association (slope of the line, β1) • Outcome must be continuous (exp. bp); exposure may be dichotomous, ordinal, discrete, or continuous • The regression line minimizes the space between the data points and the best fit line • Slope (β1) is the change in Y for a one-unit change in X • Intercept (β0) is the value of Y when X is zero

Answer 26

used when the outcome is binary (yes/no, exp. mortality, MI); produces ADJUSTED odds ratios; can be used with one or multiple exposures

Answer 27

outcome is also binary (mortality) but INCLUDES TIME ELEMENT useful for adjusting survival analyses for extraneous variables (i.e. confounders and effect modifiers); yields hazard ratios (like relative risks)

Answer 28

assesses multiple exposures and potential confounding, used to identify effect modification (same as linear regression, but with at least two exposure variables)