Biostatistics Flashcards
How is the sensitivity of a test defined?
What are highly sensitive tests used for clinically?
Sensitivity - ability of a test to detect disease
Mathematically: # of true positives divided by the number of people with the disease.
Tests with high sensitivity are used for disease screening.
False-positive results occur, but the test does not miss many people with the disease (low false-negative rate).
How is the specificity of a test defined?
What are highly specific tests used for clinically?
Specificity - ability of a test to detect health (or nondisease)
Mathematically: # of true negatives divided by the number of people without the disease
Tests with high specificity are used for disease confirmation.
False-negative results occur, but the test does not call anyone sick who is actually healthy (low false-positive rate).
The ideal confirmatory test must have high sensitivity and high specificity; otherwise, people with the disease may be called healthy
Explain the concept of a trade-off between sensitivity and specificity.
changing the cutoff value in screening tests (or changing the value of any of several screening tests) will change the number of true- and false-negative as well as true- and false-positive results.
- If the cutoff glucose value is raised, fewer people will be called diabetic (more false-negatives, fewer false-positives)
- If the cutoff glucose value is lowered, more people will be called diabetic (fewer false-negatives, more false-positives)
Define positive predictive value (PPV). On what does it depend?
When a test is positive for disease, the PPV measures how likely it is that the patient has the disease (probability of having a condition, given a positive test).
PPV is calculated mathematically by dividing the number of true-positive results by the total number of people with a positive test.
PPV depends on the prevalence of a disease (the higher the prevalence, the higher the PPV) and the sensitivity and specificity of the test (e.g., an overly sensitive test that gives more false-positive results has a lower PPV).
Define negative predictive value (NPV). On what does it depend?
When a test comes back negative for disease, the NPV measures how likely it is that the patient is healthy and does not have the disease (probability of not having a condition, given a negative test).
It is calculated mathematically by dividing the number of true-negative results by the total number of people with a negative test.
NPV also depends on the prevalence of the disease and the sensitivity and specificity of the test (the higher the prevalence, the lower the NPV). In addition, an overly sensitive test with lots of false-positive results makes the NPV higher.
Define attributable risk. How is it measured?
Attributable risk is the # of cases of a disease attributable to one risk factor (in other words, the amount by which the incidence of a condition is expected to decrease if the risk factor in question is removed).
Example, if the incidence rate of lung cancer is 1/100 in the general population and 10/100 in smokers, the attributable risk of smoking in causing lung cancer is 9/100
Ca incidence in smokers - Ca incidence in general population = incidence of Ca attributed to smoking
Given the 2 × 2 table below, define the formulas for calculating the following test values
Define relative risk. From what type of studies can it be calculated?
What is a clinically significant value for relative risk?
Relative risk compares the disease risk in people exposed to a certain factor with the disease risk in people who have not been exposed to the factor.
Relative risk can be calculated only after prospective or experimental studies; it cannot be calculated from retrospective data.
- If a question asks you to calculate the relative risk from retrospective data, the answer is “cannot be calculated” or “none of the above.”
Any value for relative risk other than 1 is clinically significant.
- If the relative risk is 1.5, a person is 1.5 x more likely to develop the condition if exposed to the factor in question.
- If the relative risk is 0.5, the person is only half as likely to develop the condition when exposed to the factor; in other words, the factor protects the person from developing the disease.
Define odds ratio. From what type of studies is it calculated?
What is a clinically significant value for odds ratio?
OR attempts to estimate relative risk with retrospective studies (remember that relative risk can be calculated only from prospective or experimental studies; not from retrospective studies). OR compares the
- incidence of disease in persons exposed to the factor and
- incidence of nondisease in persons not exposed to the factor with the
- incidence of disease in persons not exposed to the factor and
- incidence of nondisease in persons exposed to the factor
to see whether there is a difference between the two.
As with relative risk, values other than 1 are significant.
What do you need to know about standard deviation (SD) for the USMLE?
You need to know that with a normal or bell-shaped distribution:
- 1 SD holds 68% of the values
- 2 SD hold 95% of the values
- 3 SD hold 99.7% of the values
Define mean, median, and mode.
Mean is the average value
Median is the middle value once all the values are lined up
Mode is the most common value
What is a skewed distribution?
How does it affect mean, median, and mode?
A skewed distribution implies that the distribution is not normal; in other words, the data do not conform to a perfect bell-shaped curve. Because they are not normal distributions, SD and mean are less meaningful values.
Positive skew is an asymmetric distribution with an excess of high values, in other words, the tail of the curve is on the right (mean > median > mode) (Fig. 3-2).
Negative skew is an asymmetric distribution with an excess of low values, in other words, the tail of the curve is on the left (mean < median < mode).
Define test reliability.
How is it related to precision?
What reduces reliability?
Reliability measures the reproducibility + consistency of a test; synonymous with precision.
- For example, if the test has good interrater reliability, the person taking the test will get the same score if two different people administer the same test.
Random error reduces reliability and precision (e.g., limitation in significant figures).
Define test validity.
How is it related to accuracy?
What reduces validity?
Validity measures the trueness of measurement, in other words, whether the test measures what it claims to measure; synonymous with accuracy
- For example, if you give a valid IQ test to a genius, the test should not indicate that he or she is challenged. Systematic error reduces validity and accuracy (e.g., when the equipment is miscalibrated).
Define correlation coefficient.
What is the range of its values?
A correlation coefficient measures to what degree two variables are related. The value of the correlation coefficient ranges from −1 to +1.
True or false: A correlation coefficient of −0.6 is a stronger correlation coefficient than +0.4
True.
To determine the strength of the relationship between two variables, look at the distance of the value from zero.
- 0 correlation equals no association whatsoever; the two variables are totally unrelated.
- 1 equals a perfect positive correlation (when one variable increases, so does the other)
- 1 equals a perfect negative correlation (when one variable increases, the other decreases).
Therefore, use the absolute value to give you the strength of the correlation (e.g., −0.3 is equal to +0.3).