Biostats Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

How is the sensitivity of a test defined? What are highly sensitive tests used for clinically?

A

SNOUT SN detects Dz True POS/(true+false POS) Screening Sensitivity is defined as the ability of a test to detect disease, and mathematically as the number of true positives divided by the number of people with the disease. Tests with high sensitivity are used for disease SCREENING. False positives occur, but the test does not miss many people with the disease (low false-negative rate). One way to remember this is the word snout, written “Sn-N-out,” meaning with high sensitivity (Sn) a negative (N) test rules out (out) the disease.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How is the specificity of a test defined? What are highly specific tests used for clinically?

A

SPIN SP detects Health True NEG/(false +true NEG) Confirmation Specificity is defined as the ability of a test to detect health (or nondisease), and mathematically as the number of true negatives divided by the number of people without the disease. Tests with high specificity are used for disease confirmation. False negatives occur, but the test does not identify anyone who is actually healthy as sick (low false-positive rate). The ideal confirmatory test must have high sensitivity and high specificity; otherwise, people with the disease may be identified as healthy. One way to remember this is the word SPIN written “Sp-P-in,” meaning that with high specificity (Sp) a positive (P) test rules in (in) the disease.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Explain the concept of a trade-off between sensitivity and specificity.

A

The trade-off between sensitivity and specificity is a classic statistics question. For example, you should understand how changing the cutoff glucose value in screening for diabetes (or changing the value of any of several screening tests) will change the number of true- and false-negative and true- and false-positive results. If the cutoff glucose value is raised, fewer people will be identified as diabetic (more false negatives, fewer false positives), whereas if the cutoff glucose value is lowered, more people will be identified as diabetic (fewer false negatives, more false positives). As an example, if the diagnostic threshold for a fasting blood sugar for diabetes were raised from ≥125 mg/dL to ≥300 mg/dL, most people with diabetes would be missed (low sensitivity because a patient with blood sugar of 285 mg/dL would be negative for diabetes according to this criterion). In addition, the test would be very specific for patients with blood sugar ≥300 mg/dL (a patient would certainly have diabetes if he had a positive test).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define positive predictive value (PPV). On what does it depend?.

A

How likely diseased when test is positive NPV=True Pos/ All Pos Depends on prevalence and SN/SP When a test is positive for disease, the PPV measures how likely it is that the patient has the disease (probability of having a condition given a positive test). PPV is calculated mathematically by dividing the number of true positives by the total number of people with a positive test. PPV depends on the prevalence of a disease (the higher the prevalence, the higher the PPV) and the sensitivity and specificity of the test (e.g., an overly sensitive test that gives more false positives has a lower PPV).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define negative predictive value (NPV). On what does it depend?

A

How likely healthy when test is negative NPV=True Neg/ All Neg Depends on prevalence and SN/SP When a test is negative for disease, the NPV measures how likely it is that the patient is healthy and does not have the disease (probability of not having a condition given a negative test). It is calculated mathematically by dividing the number of true negatives by the total number of people with a negative test. NPV also depends on the prevalence of the disease and the sensitivity and specificity of the test (the higher the prevalence, the lower the NPV). In addition, an overly sensitive test with many false positives leads to a higher NPV.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define attributable risk. How is it measured?

A

Attributable risk is the number of cases of a disease attributable to one risk factor (in other words, the amount by which the incidence of a condition is expected to decrease if the risk factor in question is removed). For example, if the incidence rate of lung cancer is 1:100 in the general population and 10:100 in smokers, the attributable risk for smoking in causing lung cancer is 9:100 (assuming a properly matched control group).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Given the 2 × 2 table in the following table, define the formulas for calculating the test values indicated. + - + A|B - C|D

A

SN = A/A+C SP = D/B+D PPV = A/A+B NPV = D/C+D Odds Ratio = AD/BC Relative risk = (A/A+B)/(C/C+D) Attributed Risk = (A/A+B)-(C/C+D)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define relative risk. From what type of studies can it be calculated?

A

Relative risk compares the disease risk in people exposed to a certain factor with the disease risk in people who have not been exposed to the factor in question. Relative risk can be calculated only after prospective or experimental studies; it cannot be calculated from retrospective data. If a Step 3 question asks you to calculate the relative risk from retrospective data, the answer is “cannot be calculated” or “none of the above.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a clinically significant value for relative risk?

A

Any value for relative risk other than 1 is clinically significant. For example, if the relative risk is 1.5, a person is 1.5 times more likely to develop the condition if exposed to the factor in question. If the relative risk is 0.5, the person is only half as likely to develop the condition when exposed to the factor; in other words, the factor protects the person from developing the disease. RR >1 Risk factor RR <1 Protective

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define odds ratio. From what type of studies is it calculated?

A

The odds ratio attempts to estimate relative risk with retrospective studies (e.g., case-control). An odds ratio compares two factors—(1) the incidence of disease in persons exposed to the factor and the incidence of nondisease in persons not exposed to the factor and (2) the incidence of disease in persons unexposed to the factor and the incidence of nondisease in persons exposed to the factor—to see whether there is a difference between the two. As with relative risk, values other than 1 are significant. The odds ratio is a less than perfect way to estimate relative risk (which can be calculated only from prospective or experimental studies).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What do you need to know about standard deviation (SD) for the USMLE?

A

68-95-99.7 You need to know that for a normal or bell-shaped distribution, the mean ± 1 SD contains 68% of the values, the mean ± 2 SD contains 95% of the values, and the mean ± 3 SD contains 99.7% of the values. A classic question gives the mean and SD and asks what percentage of values will be above a given value. For example, if the mean score on a test is 80 and the SD is 5, 68% of the scores will be within 5 points of 80 (scores of 75 to 85) and 95% of the scores will be within 10 points of 80 (scores of 70 to 90). The question may ask what percentage of scores are over 90. The answer is 2.5% because 2.5% of the scores fall below 70 and 2.5% of the scores are over 90. Variations of this question are common.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Define mean, median, and mode.

A

The mean is the average value, the median is the middle value, and the mode is the most common value. A question may give several numbers and ask for their mean, median, and mode. For example, if the question gives the numbers 2, 2, 4, and 8: The mean is the average of the four numbers: (2 + 2 + 4 + 8)/4 = 16/4 = 4. The median is the middle value. Because there are four numbers, there is no true middle value. Therefore take the average between the two middle numbers (2 and 4), so the median = 3. The mode is 2, because the number 2 appears twice (more times than any other value). Remember that in a normal distribution, mean = median = mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a skewed distribution? How does it affect the mean, median, and mode?

A

A skewed distribution implies that the distribution is not normal; in other words, the data do not conform to a perfect bell-shaped curve. Positive skew is an asymmetric distribution with an excess of high values; in other words, the tail of the curve is on the right (mean > median > mode) (Fig. 1-3). Negative skew is an asymmetric distribution with an excess of low values; in other words, the tail of the curve is on the left (mean < median < mode). Because such distributions are not normal, the SD and mean are less meaningful values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define test reliability. How is it related to precision? What reduces reliability?

A

From a practical perspective, the reliability of a test is synonymous with its precision. Reliability measures the reproducibility and consistency of a test. For example, if the test has good interrater reliability, the person taking the test will get the same score if two different people administer the same test. Random error reduces reliability and precision (e.g., limitation in significant figures).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define test validity. How is it related to accuracy? What reduces validity?

A

From a practical perspective, the validity of a test is synonymous with its accuracy. Validity measures the trueness of measurement; in other words, whether the test measures what it claims to measure. For example, if a valid IQ test is administered to a genius, the test should not indicate that he or she has an intellectual disability. Systematic error reduces validity and accuracy (e.g., when the equipment is miscalibrated).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Define correlation coefficient. What is the range of its values?

A

A correlation coefficient measures to what degree two variables are related. The value of the correlation coefficient ranges from −1 to +1.

17
Q

True or false: A correlation coefficient of −0.6 is stronger than a correlation coefficient of +0.4.

A

True. The important factor in determining the strength of the relationship between two variables is the distance of the value from zero. A correlation coefficient of 0 equates to no association whatsoever; the two variables are totally unrelated. A correlation coefficient of +1 equates to perfect positive correlation (when one variable increases, so does the other), whereas −1 corresponds to perfect negative correlation (when one variable increases, the other decreases). Therefore the absolute value indicates the strength of the correlation (e.g., the strength of −0.3 is the same as that of +0.3).

18
Q

Define confidence interval. Why is it used?

A

When you take a set of data from a subset of the population and calculate the mean, you may want to say that this is equivalent to the mean for the whole population. In fact, the two means are usually not exactly equal. A confidence interval of 95% (the value used in most medical literature before data are accepted by the medical community) indicates that there is 95% certainty that the mean for the entire population is within a certain range (usually 2 SD of the experimental or derived mean calculated for the subset of the population examined). For example, if the heart rate of 100 people is sampled and the mean is calculated as 80 beats per minute with an SD of 2, the confidence interval (also known as confidence limits) is written as 76 < X < 84 = 0.95. In other words, there is 95% certainty that the mean heart rate of the whole population (X) is between 76 and 84 (within 2 SD of the mean).

19
Q

When are a chi-squared test, t-test, and analysis of variance test used?

A

All of these tests are used to compare different sets of data. Chi-squared test: used to compare percentages or proportions (nonnumeric or nominal data) t-Test: used to compare two means Analysis of variance (ANOVA): used to compare three or more means

20
Q

What is the difference between nominal, ordinal, and continuous types of data?

A

Nominal data have no numeric value—for example, the day of the week. Ordinal data give a ranking but no quantification— for example, class rank, which does not specify how far number 1 is ahead of number 2. Most numerical measurements are continuous data— for example, weight, blood pressure, and age. This distinction is important because of question 19: chi-squared tests must be used to compare nominal or ordinal data, whereas a t-test or ANOVA test is used to compare continuous data.

21
Q

Birth rates

A

Live births/1000 population

22
Q

Fertility rate

A

Live births/1000 population

23
Q

Death rate

A

Deaths/1000 population

24
Q

Neonatal mortality rate

A

Neonatal deaths (first 28 days of life)/1000 live births

25
Q

Perinatal mortality rate

A

Neonatal deaths + stillbirths/1000 total births

26
Q

Infant mortality rate

A

Deaths (from 0 to 1 yr old)/1000 live births

27
Q

Maternal mortality rate

A

Maternal pregnancy-related deaths (deaths while pregnant or in the first 42 days after delivery)/100,000 live births

28
Q

What five types of studies should you know for the Step 3 exam?

A

From highest to lowest quality and desirability: (1) experimental studies, (2) prospective studies, (3) retrospective studies, (4) case series, and (5) prevalence surveys.

29
Q

What are experimental studies?

A

Experimental studies are the gold standard. They compare two equal groups in which one variable is manipulated and its effect is measured. Experimental studies use double blinding (or at least single blinding) and well-matched controls to ensure accurate data. It is not always possible to perform experimental studies because of ethical concerns.

30
Q

What are prospective studies? Why are they important?

A

Prospective studies (also known as observational, longitudinal, cohort, incidence, or follow-up studies) involve choosing a sample and dividing it into two groups based on the presence or absence of a risk factor and following the groups over time to see what diseases they develop. For example, individuals with and without asymptomatic hypercholesterolemia may be followed to determine if those with hypercholesterolemia have a higher incidence of myocardial infarction later in life. The relative risk and incidence can be calculated from this type of study. Prospective studies are time consuming and expensive but practical for common diseases.

31
Q

What are retrospective studies? Discuss their advantages and disadvantages.

A

Retrospective (case-control) studies choose population samples after the fact according to the presence (cases) or absence (controls) of disease. Information can be collected about risk factors. For example, you can compare individuals with lung cancer and individuals without lung cancer to determine if those with lung cancer smoked more before they developed lung cancer. In a retrospective study an odds ratio can be calculated, but true relative risk cannot be calculated and incidence cannot be measured. Compared with prospective studies, retrospective studies are less expensive, less time consuming, and more practical for rare diseases.

32
Q

What is a case series study? How is it used?

A

A case series study simply describes the clinical presentation of people with a certain disease. This type of study is good for extremely rare diseases (as are retrospective studies) and may suggest a need for a retrospective or prospective study.

33
Q

What is a prevalence survey? How is it used?

A

A prevalence (cross-sectional) survey looks at the prevalence of a disease and of risk factors. When used to compare two different cultures or populations, a prevalence survey may suggest a possible cause of a disease. The hypothesis then can be tested in a prospective study. For example, researchers have found a higher prevalence of colon cancer and a diet higher in fat in the United States versus a lower prevalence of colon cancer and a diet lower in fat in Japan.

34
Q

What is the difference between incidence and prevalence?

A

Incidence is the number of new cases of a disease in a unit of time (generally 1 year, but any time frame can be used). The incidence of a disease is equal to the absolute (or total) risk of developing a condition (as distinguished from relative or attributable risk). Prevalence is the total number of cases of a disease (new or old) at a certain point in time.

35
Q

If a disease can be treated only to the point that patients can be kept alive for longer without being cured, what happens to the incidence and prevalence of the disease?

A

This is the classic question about incidence and prevalence on the Step 3 exam. Nothing happens to the incidence (the same number of people contract the disease every year), but the prevalence will increase because individuals with the disease live longer. For short-term diseases (e.g., influenza) the incidence may be higher than the prevalence, whereas for chronic diseases (e.g., diabetes or hypertension) the prevalence is greater than the incidence.

36
Q

Define P-value.

A

The significance of the P-value is high yield in the Step 3 exam. If P < 0.05 for a set of data, there is a less than 5% chance (0.05 = 5%) that the data were obtained by random error or chance. If P < 0.01, the chance is less than 1%. For example, if the blood pressure in a control group is 180/100 mmHg but falls to 120/70 mmHg after drug X is given, P < 0.10 means that the chance that this difference is due to random error or chance is less than 10%. It also means, however, that the chance that the result is random and unrelated to the drug may be as high as 9.99%. A value of P < 0.05 is generally used as the cutoff for statistical significance in the medical literature.

37
Q

What three points about the P-value should be remembered for the Step 3 exam?

A
  1. A study with a value of P < 0.05 may still have serious flaws 2. A low P-value does not imply causation. 3. A study that has statistical significance does not necessarily have clinical significance. For example, if drug X can lower blood pressure from 130/80 to 129/80 mmHg with P < 0.0001, drug X is unlikely to be used because the result is not clinically important given the minimal blood pressure reduction, the costs, and probable side effects.