Biostats Flashcards

1
Q

Prevalence

A

of existing cases of a disease at a specified time / # of people in base population at that time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Incidence

A

new cases occurring in a specific time period / # people initially at risk

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Case fatality

A

of people who die of a disease / total # people with the disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Mortality

A

people dying of a disease in a specified time period / # people alive during that time period

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Years of Potential Life Lost (YPLL)

A

Calculated by multiplying the number of cause-specific deaths in an age group by the difference between the midpoint of the age group and the average age at death (assumed to be 75)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Reasons for an association betweeen a factor and a disease

A
Bias in the sampling of subjects
Bias in the measurement of the factor
Confounding 
Chance
Transposition of cause and effect
Causal
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Relative Risk

A

Incidence of disease in the exposed / incidence of disease in the unexposed

Ex: 3.5x more likely to have cancer if you smoke

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Attributable Risk

A

The difference between incidence of the disease in individuals with a risk factor and in those without a risk factor

AR = risk in the exposed - risk in the unexposed

Ex: In population X, 500 per 1,000 cases are due to smoking

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Number Needed to Treat (NNT)

A

NNT = 1 / attributable risk

If 10 cases out of 50 are due to risk factor X, then NNT = 1 / (10/50) = 5 needed to treat in order to gain 1 outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Population Attributable Risk (PAR)

A

The proportion of cases that would be prevented if the risk factor could be eliminated from the population

PAR = total incidence - incidence in unexposed / total incidence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Cross-Sectional study

A

Usually large surveys of a representational sample group; assesses both risk factors and disease status in the present in order to draw correlations; can give information about the relative risk for certain diseases given exposure to different risk factors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Case-control (retrospective study)

A

A sample of cases and controls are examined for past history of risk factors; association of the disease with the risk factor is assumed to correlate with an association between the risk factor and the disease

Susceptible to sampling bias and re-call bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Cohort Study

A

AKA Prospective, longitudinal study

A cohort (exposed and unexposed) is assembled and followed over time to determine who develops disease

Limited utility for very rare diseases or very long latencies, vulnerable to bias if loss-to-follow up is unequal in exposure vs. unexposure groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Randomized Controlled Trial (RCT)

A

Participants are randomized to trial arm (exposed) or control arm (unexposed) and followed to assess outcomes

High internal validity, lower external validity (groups were artificially designed and so are not representative)

Vulnerable to cross-over; must use an intent-to-treat analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Sensitivity

A

Describes how good a screening test is at detecting true positives.

Sensitivity = TP / (TP + FN)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Specificity

A

Describes how often a test shows true negatives.

Specificity = TN / (TN + FP)

17
Q

Positive Predictive Value (PPV)

A

Describes how often individuals with positive test results actually have the disease

PPV = TP / (TP + FP)

PPV increases with higher disease prevalence. Increased test specificity increases PPV.

18
Q

Negative Predictive Value (NPV)

A

Describes how often individuals with negative test results are actually disease-free

NPV = TN / (TN + FN)

NPV increases with lower disease prevalence and increased sensitivity.

19
Q

Lead-time bias

A

If a new screening test diagnoses a disease one year earlier but treatment has no effect on survival, data will show a false 1-year improvement in survival; this is an artifact of lead-time bias

20
Q

Length time bias

A

Screening tests are likely to pick up individuals with longer asymptomatic phases (because symptomatic patients have already been diagnosed, we assume); therefore, those cases detected by screening are more likely to have longer disease regardless of whether it is detected by screening or not

21
Q

2 methods for combining screening tests

A
  1. Parallel testing - The overall screening result is positive if any one test is positive; increases sensitivity at the expense of specificity
  2. Series testing - The overall screening result is positive only if all tests are positive; increases specificity at the expense of sensitivity
22
Q

2 types of quantitative variables

A
  1. Continuous, i.e. weight, BP, serum levels, etc.
    Continuous data are compared with t-test
  2. Categorical (ordinal), i.e. no disease vs. disease
    Categorical data are compared with Chi Square test
23
Q

Mode

A

The most commonly observed value

24
Q

Median

A

The middle value in a data set arranged lowest to highest

25
Q

Mean

A

The arithmetic average

Mean = sum(x) / n

26
Q

Range

A

The highest value minus the lowest value

27
Q

Variance

A

Variance = sum (x - mean)^2 / (n - 1)

28
Q

Standard deviation

A

The square root of the variance

29
Q

Characteristics of a normal distribution

A

Mean = median
67% of values lie within 1 standard deviation from the mean
95% of values lie within 2 standard deviations of the mean

30
Q

Type I error

A

False Positive

The investigator wrongly concludes that there is a difference, when the difference actually occurred as a result of chance or bias

31
Q

Type II error

A

False Negative

The investigator concludes that there is no difference, when actually there is a true difference obscured by chance or bias

32
Q

P value

A

The probability that chance alone caused the observed association

.05 is a conventional threshold

33
Q

Confidence Interval

A

The range of values within which you are X% confident that the true value lies

95% CI identical to P = .05

34
Q

Effect Modification

A

Occurs when the effect of a risk factor on an outcome is different at different levels of a third factor; the third factor is known as the effect modifier

Ex: age, gender

35
Q

Beta Value

A

Beta is the acceptable risk of making a type II (false negative) error, arbitrarily set at .2

Power = 1 - B x 100

Therefore, a study that accepts the null should be able to shower a power of 80% or higher