Biostats/Epi Flashcards

1
Q

What is sensitivity?

A

Indicates how well a test can SCREEN for a disease
A higher sensitivity means taht negative results are less likely to be FNs and more likely to be TNs, thus better able to rule disease out (SNOUT)
Important for screening tests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is specificity?

A

Indicates how well a test can CONFIRM the diagnosis
High specificity means positive results are less likely to be FPs and more likely to be TPs thus positive results better able to rule the disease in (SPIN);
Important for confirmatory tests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is accuracy?

A

true positives + true negatives/ (all results)
Depends on sensitivity and specificity as well as prevalence of condition in population
Increases as total area under the ROC curve increases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

P values and confidence intervals

A

P value is the probability of observing a given (or more extreme) result due to chance alone, assuming the null hypothesis is true. A result is generally considered statistically significant when p<0.05.
For study results to be stat sig, the confidence interval must NOT contain the null value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is susceptibility bias (selection bias)?

A

When the treatment regimen selected for a pt depends on the severity of the pt’s condition, a form of selection bias known as susceptibility bias (confounding by indication) can result.
To avoid selection bias, pts are randomly assigned to treatments to minimize potential confounding variables and there is intention to treat analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Lead-time bias.

A

Happens when two disease interventions are compared and one diagnoses the disease earlier than the other w/o an effect on the outcome (survival).
This would make it appear that the intervention prolonged survival when it really just diagnosed the disease sooner.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Measurement bias.

A

Occurs from poor data collection with inaccurate results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Observer bias.

A

Occurs when the observer is influenced by prior knowledge or details of the study in a way that affects the results.
Avoided by blinding observers from knowing tx assignments and by measuring objective outcomes (mortality) that are less likely to be skewed by observers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Recall bias.

A

Occurs when a study participant’s answer to a question is affected by prior exposures.
More common in retrospective studies than prospective studies (RCTs).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Precision.

A

The measure of random error.
The tighter the confidence interval, the more precise the result.
Increasing the sample size increases precision.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is confounding?

A

Refers to the bias that results when the exposure-disease relationship is mixed with the effect of the extraneous factors (i.e., confounders)
Confounders influence both the exposure and outcome.
Methods to deal with confounding: matching of cases and controls based on the confounding factor, or stratification of the study population based on the confounding factor.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is selection bias?

A

Results from the manner in which people are selected for the study, or from the selective losses from follow-up.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

P value and confidence interval.

A

Power of a study: represents ability to detect a different b/n 2 groups (exposed versus unexposed) when there truly is a difference
Increasing the sampel size increases the power of a study and narrows the CI surround the point estimate
CI express ss and are interrelated w/ p values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the null hypothesis?

A

Statement of no relationship/no association between the exposure and the outcome.
To state the null hypothesis correctly, the study design should be considered.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Cohort study

A

A cohort study design is best for determining the incidence of a disease.
Comparing the incidence of the disease in 2 populations with and without a given risk factor allows for calculation of relative risk.
Can also calculate relative rate and median survival.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Case control study

A

Subjects w/ the disease of interest (cases) are compared to an otherwise similar group of disease free subjects (controls)
Informaton is then collected about exposure to risk factors.
CC study is retrospective and meant to determine assoc. b/n risk factors and disease occurrence.
An odds ratio can be calculated in a cc study but the incidence of a disease CANNOT be calculated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Ecological study

A

The unit of observation is a population
Disease rates and exposures are measured in 2 (or more) populations and the assoc. b/n disease rates and exposure is determined.
However, results about assoc/ at the population level may not translate to the individual level.
Study cannot be used to determine incidence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Randomization

A

Successful randomizatioin in a clinical trial allows a study to eliminate bias in treatment assignments.
An ideal randomizatioin process minimizes selection bias, results in near-equal treatment and control group sizes, and achieves a low probability of confounding variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Cross-sectional study

A

An observational study design that may be employed to estimate the prevalence of disease, or to examine associations between risk facotrs and diseae as they exist in a well-defined population at one particular time.
Takes a snapshot and measures prevalence of risk factor and outcome simultaneously.
Major limitation: temporal relationship b/n risk factor and disease is not always clear

20
Q

Randomized clinical trial

A

Experimental study that directly comparies >2= treatments or interventions
Subjects randomly assigned to an intervention (eg med) or placebo and then followed for development of outcome of interest (eg disease).

21
Q

Normal distribution.

A

Symmetric, bell-shaped.

All its measures of central tendency are equal. Mean=median=mode.

22
Q

What is median survival?

A

Calculated in cohort studies or clinical trials
Usually used to compare the median survival times in 2 or more groups of patients (eg receiving a new treatment or placebo)

23
Q

What is prevalence odds ratio?

A

Calculated in cross-sectional studies to compare the prevalence of a disease between different populations.

24
Q

Two sample t test?

A

Statistical method used to compare the means of two groups of subjects.

25
Q

Two sampe z test?

A

Can also be used to compare two means, but populations (not sample) variances are employed in the calculatoins.

26
Q

What is the ANOVA (analysis of variance)?

A

Compares 3 or more means.

27
Q

What is the chi square test?

A

Appropriate for categorical data and proportions.

28
Q

What is a meta-analysis?

A

An epidemiologic method of pooling data from several studies to do an analysis having a relatively big statistical power.

29
Q

What is atributable risk percent/etiologic fraction?

A

Represents the risk in the exposed populatoin that can be attributed to the risk factor.
= (RR-1)/RR

30
Q

Case control, matching, founding

A

Matching is an efficient method to control confounding. Frequently used in case-control studies.
Matching variables should always be the potential confounders of the study (eg age, race)
Caese and controls are then selected based on the matching variables, such that both groups have a similar distribution in accordance w/ the variables.

31
Q

What is the Hawthorne effect?

A

The tendency of the study population to affect the outcome since they are aware that they are being studied.

32
Q

What is sample distortion bias?

A

Seen when the estimate of exposure and outcome assoc. is biased b/c the study sample is not representative of the target population w/ respect to the joint distribution of exposure and outcome.

33
Q

What is information bias?

A

Occurs due to the imperfect assessment of assoc. b/n the exposure and ouotcome as a result of errors in the measurement of exposure and outcome status.
It can be minimized by using standardized techniques for surveillance and measurement of outcomes, as well as trained observers to measure the exposure and outcome.

34
Q

Negative predictive value

A

NPV is the probability of being free of a disease if the test result is negative.
NPV will vary w/ the pretest probability of a disease - a pt w/ a high probability of having a disease will have a low NPV, and a pt w/ a low probability of having a diseae will have a high NPV.

35
Q

What is an outlier?

A

An extreme and unusual value observed in a dataset.
Due to recording error, a measurement error, or a natural phenomenon
It can affect measures of central tendency (mean, median, mode) as well as measures of dispersion (eg standard deviation)
The mean is VERY sensitive to outliers and easily shifts toward them especially w/ a small sample size.
The median and mode are more RESISTANT to outliers.

36
Q

What is attrition bias?

A

Loss to follow-up in prospective studies creates a potential for attrition bias, a subtype of selection bias.
When a substantial number of subjects are lost to follow-up, the study may overestimate or underestimate the assoc. b/n the exposure and the disease.
Investigators try to achieve high rates of followup to reduce the potential for attrition bias.

37
Q

What is surveillance bias?

A

Occurs when the exposed group undergoes increased monitoring relative to the general population.
This tends to increase disease diagnoses compared to the general population.

38
Q

What is a factorial design?

A

Involves 2 or more experimental interventions, each w/ 2 or more variables that are studied independently.

39
Q

What is cluster analysis?

A

The grouping of different data point into similar categories.
Usually involves randomizaiton at the level of groups rather than at the level of individuals.

40
Q

What is a cross over study?

A

One in which group of participants is randomized to one treatment for a period of time and the other group is given an alternate treatment for the same time period.
At the end of the time period, the two groups then switch treatments for another set period of time.

41
Q

What is a parallel study?

A

Randomizes one treatment to one group and a different treatment to the other group, such as treatment drug to one group versus placebo to the other group.
There are usually no other variables measured.

42
Q

What is selective survival bias?

A

Occurs in case control studies when cases are selected from the entire disease population instead of just those that are newly diagnosed.
For instance, a study on cancer survival that is not limited to newly diagnosed pts will contain a high proportion of relatively benign malignancies as these pt generally live longer.

43
Q

What is post-hoc analysis?

A

Refers to performing unplanned statistical tests on patterns that were idenitified after the fact in data from a completed study.
This can lead to incorrect conclusions, particularly if the appropriate statistical measures have not been taken into account for these additional tests.
Post hoc analysis can be problematic w/ non-predefined subgroup analysis.
Randomization does not directly impact post hoc analysis, which can be conducted even on data from randomized samples.

44
Q

What is effect modification?

A

Results when an external variable positively or negatively impacts the effect of a risk factor on the disease of interest.
For example, the risk of venous thrombosis is increased w/ estrogen therapy, and this effect is augmented by smoking.

45
Q

Prevalence and incidence

A

An increasing prevalence but stable incidence of a disease can be attributed to factors that prolong the duration of the disease (eg improved quality of care and disease management)

Increased diagnositc accuracy would significantly increase both incidence and prevalence of disase.

46
Q

Raising cut off point

A

Raising the cut-off point (eg increasing the inclusion criterai) of a screening test result in an increase in specificity and decrease in sensitity.