Trial Design and Biostatistics Flashcards
- A quality improvement (QI) initiative is implemented to decrease falls in patients recently discharged from the hospital. As the pharmacist champion at a pharmacy benefit manager (PBM), you have worked with the formulary team to ensure that medications with a high risk of falls have criteria for use. Which factor will be most important in showing the effectiveness of this QI initiative?
A. Obtaining informed consent for participation in
the QI initiative.
B. Identifying patients at risk for falls.
C. Creating a community advertising campaign to
bring awareness to the initiative.
D. Determining the social value of the QI initiative
- Answer: B
A QI study would likely show effectiveness using a pre/
post-intervention cohort design. In this type of study,
informed consent is generally not required because the
treatment is provided to all patients as a standard of care
(Answer A is incorrect). A community advertising campaign might improve the delivery of care and improve
patient adherence, but it is unnecessary to measure the
effectiveness of the QI initiative (Answer C is incorrect).
The QI initiative is believed necessary and has therefore
been deemed to have intrinsic social value (Answer D is
incorrect). To show the effectiveness of the intervention,
populations at risk for falls must be recognized before initiating an intervention (Answer B is correct).
- A study seeks to determine the impact of adverse drug
events on patient outcomes in FDA-approved drugs.
Which would be the best approach to conducting this
study?
A. A randomized controlled clinical trial (RCT) with
a test for continuous variables to determine the
difference in outcomes.
B. A retrospective case-control study with a test
for proportions to determine the difference in
outcomes.
C. A prospective observational study with survival
analysis to determine the difference between
cohorts.
D. A retrospective cohort study with a test for proportions to determine the difference in outcomes.
- Answer: C
To effectively determine the incidence and clinical impact
of adverse drug events on clinical outcomes, it would be
unethical to randomize patients to experience the event
(Answer A is incorrect). A retrospective design would
not be ideal because of the limitations in data extraction, assignment of events, and interpretation of causality
(Answers B and D are incorrect). A prospective observational design would allow the investigator team to identify
the incidence of events and sequential events and determine causality (Answer C is correct)
- A case-control study is conducted to determine
whether proton pump inhibitor (PPI) use is associated
with an increased risk of developing C. difficile infection (CDI). The final analysis shows the odds ratio (OR) for CDI with PPI exposure to be 1.3 (95% confidence interval [CI], 0.8–1.5). Which best describes the results?
A. PPI exposure increases the risk of CDI by 130%.
B. PPI exposure reduces the risk of CDI by 30%.
C. PPI exposure increases the risk of CDI by 30%.
D. PPI exposure is not associated with an increased
risk of CDI.
- Answer: D
A correct interpretation of the results is recognizing that
even though the OR suggests an associated increase of 30%
in the risk of being exposed to CDI, the 95% CI crosses 1,
meaning that the odds of exposure to a PPI are as likely to
increase the risk as to decrease the risk of developing CDI
(Answer D is correct; Answers A–C are incorrect)
Questions 4 and 5 pertain to the following case:
A retrospective case-control study was conducted to assess
peripheral neuropathy in patients taking fluoroquinolone
antibiotics. Risk factors for neuropathy were not assessed
(however, both the cases and the controls of this study had
identical diagnostic evaluations and were stratified according to the duration of quinolone use before the onset of
neuropathy).
- Which type of bias is this study design most susceptible to?
A. Confounding by indication
B. Recall bias
C. Diagnostic bias
D. Misclassification - Which factor will be most affected by the type of bias
likely to occur in this study?
A. External validity
B. Internal validity
C. Assessment of exposure
D. Number of patients needed for the study
- Answer: B
Recall bias is always a potential concern for case-control
studies because of the time that passes between the study
and the drug “ingestion.” Because risk factors were not
included in the study design, this is of concern (Answer B
is correct). Although a study may be susceptible to many
types of bias, the other choices would not pose as much risk
(if any) as recall bias (Answers A, C, and D are incorrect) - Answer: B
Internal validity is greatly jeopardized because the study
is not designed to protect against this possible bias.
Case-control studies are an inappropriate design to access
the frequency of an event. In a sense, this design flaw
eventually jeopardizes external validity (how well does a
study apply to other patients with this condition/disease?),
but a lack of internal validity is most essentially affected
(Answer B is correct). The other answers can adequately
be controlled for in the design and conduct of the study
(Answers A, C, and D are incorrect).
- When describing the results of an RCT, the investigators report using an intention-to-treat analysis to
analyze their data. The results of their investigation
comparing two antivirals for influenza-like illness
show no difference in the number of hospitalizations
for influenza between the treatment groups. Given
their method of data analysis, which statement is most
appropriate?
A. May be susceptible to issues regarding recall bias.
B. Provides a good measure of effectiveness under
typical clinical conditions.
C. Cannot estimate the method’s effectiveness.
D. May overestimate the actual treatment effect
- Answer: B
Intention-to-treat analysis is a comparison of the treatment
groups that includes all patients as originally allocated
after randomization. This is the recommended method
to estimate effectiveness under typical clinical conditions
and avoid any bias. Per-protocol analysis is a comparison
of treatment groups that includes only patients who completed the treatment originally allocated. If performed
alone, per-protocol analysis leads to bias (Answer B is
correct; Answer C is incorrect). Intention-to-treat analysis
is the most common approach to data analysis for RCTs
and may underestimate the treatment effect (Answer D is
incorrect). Recall bias is not a concern with RCTs (Answer
A is incorrect).
Questions 7 and 8 pertain to the following case:
A prospective randomized study compared furosemide
with hydrochlorothiazide when treating patients with newly
diagnosed hypertension. One of the study’s end points was
myocardial infarction one year after treatment initiation.
The following table summarizes myocardial infarction
rates in all patients.
Furosemide Hydrochlorothiazide
All patients,
n (%) 17 / 332 (5.1) 21 / 274 (7.7)
7. The 95% CI for the difference in myocardial infarction
rates between the two groups was −3.2% to 10.3%.
Which conclusion is most appropriate?
A. Furosemide is superior to hydrochlorothiazide.
B. Superiority of furosemide could not be established over hydrochlorothiazide.
C. Hydrochlorothiazide is inferior to furosemide.
D. No conclusion can be drawn because p values are
unavailable.
8. According to the data and the result obtained, which
best represents the number of patients who would need
to be treated with furosemide to prevent one case of
myocardial infarction?
A. Number needed to treat (NNT) would be 3.
B. NNT would be 39.
C. NNT would be 250.
D. NNT should not be calculated because the result
was nonsignificant
- Answer: B
The CI of the difference in myocardial infarction rate
between the two groups includes zero; thus, there is no
statistically significant difference between the two groups
(Answer B is correct). Answers A and C are incorrect
because the 95% CI contains zero and is therefore not
statistically significant. Answer C is incorrect because
insufficient information is provided. Answer D is incorrect
because all the previously stated information can be determined without the benefit of reported p values - Answer: D
Answers A and C are incorrect calculations. Calculating
the NNT to prevent one recurrence using furosemide is
as follows: 0.051 − 0.077 = 0.026 and 1/0.026= 38.46, or
around 39. However, the NNT should not be calculated
when the end point of interest is nonsignificant (Answer B
is incorrect; Answer D is correct).
- An RCT assesses the effects of different inhaled
medication regimens on pulmonary function in three
groups of subjects with chronic obstructive pulmonary
disease. Groups will be randomly assigned to 1) longacting muscarinic antagonist alone, 2) long-acting
muscarinic antagonist combined with a long-acting
beta-2 agonist, or 3) long-acting beta-2 agonist combined with an inhaled corticosteroid. Study outcomes
will be assessed 3 months after randomization.
Investigators wanted to assess pulmonary function
with the Global Initiative for Chronic Obstructive
Lung Disease (GOLD) system, which is an ordered
scale from 1 to 4. Which statistical test is most appropriate to assess differences in functional classification
between the groups?
A. Kruskal-Wallis test
B. Wilcoxon signed-rank test
C. Analysis of variance (ANOVA)
D. Analysis of covariance (ANCOVA
- Answer: A
The GOLD system is an ordinal scale from 1 (mild) to 4
(very severe). Neither ANOVA nor ANCOVA is appropriate for ordinal or noncontinuous data (Answers C and D are incorrect). The Wilcoxon signed-rank test is an appropriate nonparametric test to use for paired ordinal data,
such as the change in GOLD score over time on the same
person (Answer B is incorrect). The Kruskal-Wallis test
is the nonparametric analog of a one-way ANOVA and is
appropriate for this analysis (Answer A is correct).
- You are evaluating a randomized, double-blind, parallel group-controlled trial that compares four different
antihypertensive regimens for their effect on systolic
blood pressure in patients with refractory hypertension:
* Regimen 1: hydrochlorothiazide/diltiazem/
metoprolol
* Regimen 2: furosemide/diltiazem/metoprolol
* Regimen 3: diltiazem/metoprolol/enalapril
* Regimen 4: hydrochlorothiazide/metoprolol/
enalapril
The authors conclude that regimen 1 is better than
regimen 2 (p<0.05) and that regimen 4 is better than
regimen 1 (p<0.01), but no difference is observed
between any other regimens. The investigators used
an unpaired (independent sample) t-test to test the
hypothesis that each antihypertensive regimen was
equivalent. Which statement is most appropriate?
A. Investigators used the appropriate statistical test
to analyze their data.
B. Regimen 4 is the most effective of the tested antihypertensive regimens.
C. ANOVA would have been a more appropriate test.
D. A paired t-test is a more appropriate test
- Answer: C
You cannot determine which finding is more important
(in this case, the best antihypertensive regimen) based on
the p value (i.e., a lower p value does not mean it’s more
important; Answer B is incorrect). All statistically significant results are interpreted as significant without respect
to the size of the p value. This trial had four independent
samples, and the unpaired (independent samples) t-test is
not appropriate because it requires several unnecessary
tests (multiple comparisons) and increases the chances
of making a type I error (Answer A is incorrect). In this
setting, ANOVA is the correct test (Answer C is correct),
followed by a multiple comparisons procedure to determine where the actual differences between the groups lie.
A paired t-test is inappropriate because this is a parallelgroup trial (Answer D is incorrect). Using ANOVA in this
case assumes a normal distribution and equal variance in
each of the four groups
- In the results of a randomized, double-blind, controlled clinical trial, the difference in 10-year mortality
between the intervention group and the control group
is 6% (p=0.01), and it is concluded that there is a statistically significant difference between the groups.
Which statement is most consistent with this finding
and conclusion?
A. The chance of making a type I error is 5 in 100.
B. The trial does not have enough power.
C. There is a high likelihood of having made a type
II error.
D. The chance of making a type I error is 1 in 100
- Answer: D
The typical a priori error (type I) rate is 5% (i.e., when the
study was designed, the error rate was designed to be 5%
or less). The actual type I error rate is reported in the question as 0.01 (1%) (Answer A is incorrect). Information is
insufficient to select Answer B (Answer B is incorrect). A
type II error was not made because this error has to do with
not finding a difference when one truly exists (Answer C is
incorrect). In this question, the type I error rate is 1%, the
value of the p value (Answer D is correct)
- Researchers planned a study to evaluate the percentage of subjects who were cured from hepatitis C virus
(HCV) infection with direct-acting antivirals (DAAs)
compared with interferon. In the study of 1000 subjects, the DAA group (n=500) and the interferon group
(n=500) were compared. The investigators used HCV
cure as their primary end point. Which is the most
appropriate statistical test to answer such a question?
A. Independent-samples t-test
B. Chi-square or Fisher exact test
C. Wilcoxon signed-rank test
D. One-sample t-test
- Answer: B
The primary end point in this study, the percentage of
subjects cured of HCV, is nominal data. This type of data
requires either a chi-square test or a Fisher exact test,
depending on the sample size or, more accurately, the
number of counts in the individual contingency table cells.
(Answer B is correct). An independent-samples t-test is
not appropriate because continuous data (e.g., viral load)
is not being compared for the primary end point (Answer
A is incorrect). If we were comparing viral load between
the two groups, the test might be appropriate if parametric
assumptions were met. The Wilcoxon signed-rank test is
the appropriate nonparametric test for comparing paired
samples (usually in a crossover trial; Answer C is incorrect). Finally, a one-sample t-test is used to compare the
mean of a single group with the mean of a reference group.
This is also incorrect in this situation because two groups
are being compared (Answer D is incorrect)
- An investigational drug is being compared with an
existing drug for the treatment of diabetes in subjects
> 65 years of age. The study is designed to detect a
minimum 20% difference in response rates between
the groups, if one exists, with an a priori alpha (α)
of 0.05. The investigators believe that a smaller difference (10% difference vs 20% difference) would be
more clinically meaningful. In revising their study,
they decide they want to be able to detect a minimum
10% difference in response. Which change to the
study parameters is most appropriate?
A. Increase the sample size.
B. Change the study design to compare the investigational drug with placebo.
C. Select an alpha (α) of 0.01 as a cutoff for statistical significance.
D. Decrease the sample size.
- Answer: A
Detecting the smaller difference between the treatments
requires more power. Power can be increased in several
different ways. Answer A is correct because the most common approach is to increase the sample size. Answer D is
incorrect because smaller sample sizes diminish a study’s
ability to detect differences between groups. Power can
also be increased by increasing the α value, but doing so
increases the chances of a type I error. Answer C decreases
α, thus making it more difficult to detect differences
between groups. Although the subjects are older patients
with diabetes, Answer B is likely unethical because treatment options for diabetes are available and thus should not
be withheld from study subjects
- You are designing a new computer alert across your
clinic system for patients on anticoagulants at high
risk for bleeding. You want to develop a model to predict which patients are most likely to have a bleed.
You plan to retrospectively assess the presence or
absence of several different variables based on data
from a cohort across your clinic system. The comparison will consist of all patients on anticoagulants who
had bleeds compared with all other patients on anticoagulants over the last 3 years. Which technique will be
most useful in completing such an analysis?
A. Correlation
B. Kaplan-Meier curve
C. Regression
D. ANOVA
- Answer: C
Regression analysis is the most effective way to develop
models to predict outcomes or variables (Answer C is correct). There are many different types of regression, but all
share the ability to evaluate the impact of multiple variables
simultaneously on an outcome variable. Correlation analysis is used to assess the association between two (or more)
variables, not to make predictions (Answer A is incorrect).
Kaplan-Meier curves are used to graphically depict survival curves or time to an event (Answer B is incorrect).
Although ANOVA can adjust for covariates, it is limited
on the number of variables. In addition, ANOVA evaluates
continuous outcomes, and the outcome in the question is
categorical (Answer D is incorrect)