Epi Flashcards

1
Q

Selection Bias

A

Error due to systematic differences in characteristics of those who do or do not participate in the study. This occurs bc of the action of researchers and/or participants.

  • -Study population is not representative of source population
  • -“Systematic” : associated with both exposure and outcome status.
  • -“Participate”: enrollment or retention

Can lead to incorrect measure of association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Evaluating The effects of Bias on the validity of effect estimate (OR RR)

A

Potential alternative explanation for observed effect (need to consider the direction and magnitude)

Effects of bias are difficult to quantify and also hard to remove once introduced. This is why we try to prevent it from the beginning and consider it as a possible variable when analyzing data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Five Sources of Selection Bias

A

In case control studies

  1. Selection of control group
  2. Self selection
  3. Differential surveillance/diagnosis/referral patterns

In cohort studies

  1. selection of external comparison group
  2. losses to follow up
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Bias in Selection of Control Group

A

Bias can occur is control group is not representative of the underlying source population with respect to exposure.

This can happen when different criteria are used to select cases and controls, and those criteria are related to exposure i.e., difference between cases and controls that is related to exposure. Results in invalid OR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How can selection Bias be minimized

A

Ensure that control condition does not share exposure as risk factor
–Based on external knowledge e.g., knowing about ulcers and Coffee

Use same selection criterion for cases and controls
–Do not impose explicit or implicit restrictions that you do not on the other

Use controls from same catchment area
–E.g., Restrict cases to diabetic patients who reside in general catchment area for hospital where injury controls and enrollment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Self Selection Bias

A

AKA: non-response, non-participation, or refusal bias

Occurs when there is a systematic difference in who participates (volunteers) and who does not participate

Can result in invalid OR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How to minimize self selection bias

A

Ensure high participation rates in all groups

  • -Motivation
  • -Ease/feasibility
  • -Incentives/rewarding
  • -Study staff
  • -Etc.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Differential diagnosis and/or referral (case control)

A

Participants (cases/controls) are made known to investigators and thus enrolled in a way that is differential related to exposure.

  • -Can be in how cases are detected
  • -Can be in how controls are IDed
  • -Also called detection bias
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Minimizing Differential detection bias

A

Use multiple control groups (i.e., population based and hospital based)

Selection of control condition with similar detection

Use a case definition that addresses detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Selection of Comparison Group (cohort study)

A

May occur if external comparison group is used

This is a type of selection bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Loss to follow up bias (cohort studies)

A

People who are lost to follow up may differ from those who remain in the study

A major threat to the internal validity in cohort and intervention studies

Losses to follow up may or may not result in biased estimates of relative risk depending on whether or not the difference is systematic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Differential and Non-Differential

A

Differential (systematic) : Related to both exposure and disease resulting in biased estimates of relative risk (RR), either toward or away from the null.

Non-Differential: Related only to exposure or disease but not both. Generally does not create problems with RR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Sources of Information Bias

A

Non- Differential Misclassification

Differential Misclassification

  • -Recall bias
  • -Interviewer bias
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Directionality of Bias

A

When working with relative risk that are <1.0 we can still think about the directionality of bias similarly

Is the biased/observed measure of association closer or further from the null (RR = 1.0?)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Information Bias

A

Error in classification of exposure and/or outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Non-Differential Misclassification

A
  • -Errors associated with exposure OR disease, but not both
  • -For ex: error in exposure is not related to disease
  • -Effect on OR/RR is biased towards the null (usually)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Differential (systematic) Misclassification

A
  • -Errors related to exposure and disease
  • -For ex: error in exposure is related to disease
  • -Effect on OR/RR is biased away form or toward the null (this is unpredictable)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Case-Control Studies + Misclassification

A
  • -Generally observe the same pattern

- -Case control studies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Confounding Variables

A

An unmeasured third variable that influences both the supposed cause and the supposed effect.

The observed measure of association is distorted
because the effect of the extraneous factor is mixed with
the effect of the exposure on the outcome
– Get an incorrect effect estimate
– Either overestimation (positive confounding) or underestimation
(negative confounding)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Association

A

Smoke asbestos/radon exposure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Predictors (risk factors)

A

Family history alcohol use

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Associations

A

Carrying Matches

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Like Bias

A
  • -Alternative explanation for observed effect
  • -Distorts the true measure of association
  • -Can overestimate, underestimate, or reverse direction of effect
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Unlike Bias

A
  • -Not systematic error committed by investigator or participants
  • -Reflects the nature of what we are studying
  • -Complex relationships between many variables
  • -Can be controlled in design and analysis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Assessing Confounding (1): Exposure-Confounder Relationship

A
  • -These two variables (exposure + Confounder) need to be associated in your data
  • -Not necessarily a risk factor or causally related
  • -Uneven distribution of confounder in exposed and unexposed groups for any reason (associated in your data or a true risk factor)
  • -So what is one good method to control confounding

(randomization helps control this)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Assessing confounding (2): Confounder-Disease Relationship

A
  • -Is the confounder a risk factor for the disease/outcome
  • -Association between confounder and disease exists independant of exposure
  • —Association is present among both the exposed and the unexposed
  • —-Can check this in your data
  • -This is a more stringent measure
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Assessing confounding (3)

A

A confounder cannot be an intermediate step in the causal pathway between exposure and disease

28
Q

Randomization

A
  • -Random assignment to treatment condition in an intervention study
  • -If done properly with large enough sample size this has a high likelihood of producing an even distribution of all other variables between study groups.
  • -Comparison groups will have a similar distribution of all potential confounders (known and unknown)
  • -> Exposure is not associated with potential confounders
29
Q

Problems with Randomization

A

Not always feasible

Sometimes it does not work (can be checked in the analysis)

30
Q

Controlling Confounders

A

To remove distortion in observed measure of association
between exposure and outcome

– For valid interpretation effect estimates

• Design
– Randomization
– Restriction
– Matching

• Analysis
– Standardization
– Stratified analysis including adjustment
– (Matched analysis in case-control studies)
– (Multivariate regression)

31
Q

If Randomization is not possible

A

–Must ID potentially confounding variables prior to conducting study

–Consider all known or suspected risk factors for disease under study that might also be associated with exposure (often consider age, sex, and race to be potential confounders of many associations)

–Should also be done if randomization does not work

32
Q

Restriction (Confounders)

A

Limiting eligibility criteria to certain levels of suspected confounders

Advantages: Fairly simple approach

Disadvantages

  • -Generalizability
  • -Sample Size
  • -Multiple potential confounders
33
Q

Matching

A

Selecting study subjects so that potential confounders will be evenly distributed between study groups
–Can do this in case-control studies or cohort studies

Some situations, this is desirable

  • -small # of cases (limited statistical power)
  • -When confounders are complex to measure (e.g., SES )

Some potential disadvantages

  • -Can be logistically difficult (esp. with multiple confounders)
  • -Requires the use of different statistical analysis in case-control studies
  • -Cannot analyze matching variables
34
Q

Methods to Control Confounding Analysis

A
  • -Standardization
  • -Stratified analysis including adjustment
  • -(Matched analysis)
  • -(Multivariate regression)

General advantages

  • -Can do post-ho (if you’ve collected data)
  • -Data analysis is more informative
35
Q

Stratified Analysis

A

Conducting analysis within levels (strata) of the potentially confounding variable
–Each stratum is restricted

Objectives

  • -Evaluate the presence of confounding
  • -Control for confounding
  • -Learn more about complex relationships between variables of interest
36
Q

Steps in Stratified Analysis

A
  1. Examine crude measure of association between exposure and outcome
  2. Stratify data by level of confounder, and examine stratum-specific measures of association between exposure and outcome
  3. If stratum-specific estimates are similar to each other and different from crude estimate then confounding is present
  4. Report the stratum-specific estimates, OR
  5. Calculate a summary measure of association (adjusted estimate) using formulas provided in table
37
Q

Measurement Error

A

Error in measurement that are random and occur in an unpredictable way.

Can minimize by taking repeated measurements

38
Q

Precision

A

Lack of random error

39
Q

Random Error

A

Error that occurs due to chance (bad luck)

Uncontrollable forces that have no explanation

40
Q

Sampling Variability

A

A sample may be unrepresentative of the source population due to chance, In this way, it is unpredictable.

41
Q

Reducing Sampling Error

A

Increase Sample Size

  • -Small samples are more likely to produce erroneous results due to chance alone
  • -

Take Repeated samples

42
Q

Random vs Systematic Error

A
43
Q

Statistical Inference

A

Goal: To lear about the true exposure-outcome association

Reality

  • -You do a study, the results of which are dependent on who is in your study sample (sampling variability)
  • -You may select an unrepresentative sample that produce
44
Q

Two Methods to assess random error

A

Hypothesis testing (p testing)

  • -Quantifies the degree to which sampling variability may explain observed association
  • -The likelihood (probability) that chance is an explanation
45
Q

Hypothesis testing: 3 Steps

A
  1. Specify null and alternative hypothesis
  2. H(0) is assessed by a statistical test that gives you a p value after computation using known formulas
  3. Reject or fail to reject the null hypothesis based on p-value
46
Q

Statistical significance

A

By convention: If the probability of observing a results given that the null hypothesis is true is p

47
Q

P Values

A

Small P value can occur for small effects in large studies

  • -It is possible to have a statistically significant trivial risk increase when the sample size is large
  • -If you have a small p value is it bc of a strong association or a large study

Large P values can occur for large effects in small studies
– it is possible when the sample is small that a large risk increase is not statistically significant.

P values are confounded statistics: They mix effect size and

48
Q

Confidence Intervals

A

Definition of a 95% CI: You do a study 100 times and got 1– point estimates and 100 CIs in 95 of the 100 results, the true point estimate would lie within the given interval. In 5 sentences, the true point estimate would not lie within the given interval

Interpretation of single 95% CI: Range of plausible values within which the true

49
Q

Confidence intervals

A

Width of CI indicates: Amount of sampling variability (random error) in the data and therefore a level of certainty

All else being equal, a large sample size will produce a more narrow CI

50
Q

Why CI intervals are preferred over p values

A

–The p value and CI are calculated form the same info

Put unlike P values…
–CIs separate magnitude of the effect from sample size

51
Q

Hierarchy of Populations

A

External : Population to whom we hope our findings apply
——-> Generalizability
Target : Population of interest / feasible to study
——–> Selection Bias
Study : Population from whom we sample study participants
———> Sampling Error
Sample : Individuals who are enrolled in our study / provide study data

52
Q

Generalizability

A

Considerations for generalizability
–Knowledge of study population; comparison to external
populations
–Understanding of disease process (biology)
–Effect Modification is an important consideration

Largely a qualitative exercise

Note: Internal validity is a prerequ for generalizability

53
Q

Multiplicative Effect Modification

A

RR (11) = RR when both factors are present compared to neither
RR (10) = RR when only one factor is present compared to neither
RR(01) = RR when only the other factor is present compared to neither
RR (00) = 1

54
Q

Assessing Additive Effect Modification

A

R(11) = R(10) + R(01)- R(00) –> No additive effect

R(11) = Risk present in both factors 
R(10) = Risk Present in one factor alone
R(01) = Risk in the presence of the other factor alone 
R(00) = Risk in absence of both factors 

OR

RR(11) = RR(10) + RR(01) -1 (divide each term by RR(00))

55
Q

Screening

A

The presumptive identification of unrecognized disease or condition by application of test, exams, or procedures which can be applied rapidly to sort out apparently well persons who probably have a disease from those who probably do not

A screening test is performed to identify disease or precursors of disease at an earlier more treatable stage in asymptomatic individuals.

56
Q

Primary Prevention

A

Activities that prevent the disease process from Starting

Reduce incidence of disease

57
Q

Secondary Prevention

A

Early Detection (Screening)

  • -Activities that reduce the expression or severity of disease
  • -IDs asymptomatic individuals with disease
  • -Delay onset of clinical disease: Shorten duration; improve survival
  • -May see an increase in incidence
58
Q

Tertiary: Minimizing impairments and disabilities

A
  • -Activities that slow or stop progression of clinical disease
  • -Improve survival/quality of life
59
Q

When is it appropriate to screen?

A
  1. “important” health problem (prevalence, severity, morbidity, mortality)
  2. Detectable pre-clinical phase that is “long” and “prevalent”
  3. Appropriate test (simple, rapid, and convenient to conduct, inexpensive, accessible to population and safe, Accurate (valid).)
  4. “Acceptable” treatment for persons with condition that positively affects the natural history of disease
60
Q

Goal with Screening

A

It is to correctly id individuals with and without the disease

61
Q

Measurements using in screening

A

Sensitivity: % of persons with the disease who test positive (Probably that a test correctly classifies person with disease)

Specificity: % of persons w/o disease who test negative (probability that the test correctly classifies persons w/o disease.)

Positive predictive value: % of people with positive test who have the disease

Negative predictive value: % of persons with negative test who do not have the disease

62
Q

Why is PPV important for screening?

A
  • -Yield
  • -Detection of large number of people with disease among those who test positive
  • -Minimize the number of false positives (“b”)
  • —-Yield will be higher is false positives are lower
  • —-Costly, and harmful
63
Q

Determinants of PPV

A
  • -Prevalence of disease: Population characteristic
    i. e., Screening a high risk population will provide a higher PPV than screening a low risk population
  • -Test Characteristics
  • -> Specificity (more so) –> determines the # of false positives, which should be relatively low for high PPV
  • -> Sensitivity (less so)
64
Q

Consequences of False Positives (b cell)

A
  • -Monetary costs of morbidity associated with diagnostic test
  • -Psychological costs of believing one may have a serious disease; stigma
  • -Discomfort, cost to patient
  • -Burden on health care system
65
Q

Consequences of False Negatives (C cell)

A
  • -Disease will be missed at an early, treatable stage

- -Seriousness of disease

66
Q

Healthy worker effect

A

type of selection bias that occurs in occupational epidemiology and generally occurs in cohort studies when comparing internal to external

people who are working are generally healthier than those who are not…