HYMR Biostatistics Flashcards
```
~~~
Which of the following best describes the proper use of an odds ratio (OR)?
A. Estimates the relative risk in retrospective studies
B. Estimates the absolute risk reduction in prospective studies
C. Use of the odds ratio in prospective studies will underestimate the risk
D. An odds ratio is used in a prospective study to show the strength of the relationship between 2 variables
Answer: A - Estimates the relative risk in retrospective studies
- OR is an estimate of the relative risk (RR) since the number of patients at risk for the condition is not always known, thus preventing the calculation of incidence
- OR are used in retrospective studies (ex. case-controlled studies), RR are used in prospective studies
A group of investigators have designed a study to determine if ezetimibe (Zetia) was more effective than cholestyramine (Questran) for the treatment of hyperlipidemia. They designed the study so that 500 patients would be randomly assigned to one of 2 groups: ezetimibe 10 mg once daily or cholestyramine 4 g by mouth twice a day with meals. At baseline, patients had fasting lipid profile done. These patients were then prospectively followed for 6 months. At the end of the study (6-months) the patients had a follow up visit where fasting lipid profiles completed. The primary endpoint was to determine the change in LDL-c (mg/dL) from baseline. The secondary endpoints included: 1-The proportion of patients developing cholestasis or needing a cholecystectomy. Which of the following statistical tests would you recommend for answering the primary objective (assume sample or population studied is normally distributed)?
A. Student’s t-test
B. Paired t-test
C. Chi Square
D. ANOVA
A. Student’s T-test
Which of the following best describe alpha?
A. The level or chance of making a Type II error
B. The ability to predict the effect of one variable on another
C. The level or chance of making a Type I error
D. The degree of external validity
C. The level/chance of making a Type I error
Type 1 error occurs when the study/investigators say there is a difference between two studied interventions, but in reality there is no difference at all
The alpha value is determined a-priori and is most commonly set to 0.05 (meaning there is a 5% chance the study will make a Type 1 error)
Answer choice A refers to beta. As beta decreases the power of the study increases and the chance of making a Type II error decreases.
Which of the following provides the best interpretation of a p=0.001 (assuming alpha= 0.05)?
A. I am 99.9% confident that the results of the is study are clinically significant
B. I am 0.1% confident that the results of this study are clinically significant
C. There is a 99.9% chance that the results of this study are due to chance
D. There is 0.1% chance that the results of this study are due to random error
D. There is 0.1% chance that the results of this study are due to random error
*P-value NEVER indicates “clinical” significance.
*P-value ONLY helps you know whether the results that were found in the study were due to random error or due to chance alone
*P-value tells you how much of the sample data supports the null hypothesis to be true (ie are results due random error or chance alone)
True/False: A “p-value” helps the investigator determine the degree of clinical significance found in the study?
A. True
B. False
B. False
Which one of the following best defines Power= 1- Beta?
A. The probability that you will make a Type 1 error
B. The probability that you NOT make a Type II error
C. To estimate the percentage of baseline risk that was removed because of the treatment used
D. The strength of the relationship between 2 variables
B. The probability that you NOT make a Type II error
- The first answer choice is what an alpha value determines. Note: most
clinicians/researchers will accept a 5% chance (or alpha = 0.05) that when we do a study that the results we found are not true or accurate.
*The second answer choice is correct because of the word “not” in the answer. Since Beta (B) by definition is the probability of a Type II error, then 1-Beta is the probability of NOT making a Type II error. Note: the larger the Beta the lower the power of study and greater chance of making a Type II error. Type II error occurs when a study states there is no difference between the groups assessed but in reality there is a difference. Any time a study fails to find a statistical difference between two groups the greater the chance that a Type II error has occurred.
- The third answer choice reflects the relative risk evaluation.
- The last answer choice reflects the correlation coefficient (r).
A simple linear regression can be utilized to predict the outcome of the following?
A. There is a single independent and single dependent variable
B. There is more than one independent and only one dependent variable
C. There is a single dichotomous independent variable
D. There are two unrelated independent variable
A. There is a single independent and single dependent variable
*Regression analysis is a mathematic description or equation that provides “predictability” of one variable on another variable
*For simple linear regression analysis, this usually analyzes the relationship of two variables (1 being an independent variable and other being dependent variable)
*Multi-linear regression: relationship between >1 independent variable and 1 dependent variable
A prospective, clinical trial comparing two treatment regimens and their ability to prevent the development of a DVT was done in post-operative patients who underwent a total hip replacement (THR). Patients were randomized to receive either warfarin at 5 mg by mouth daily (n=500) starting the day of surgery or enoxaparin 40 mg SC once daily (n=500) starting the day of surgery. Both treatment options were used for a total of 5 consecutive days. Fifty patients receiving warfarin developed a DVT, whereas only 25 patients receiving enoxaparin developed a DVT at the two week follow up evaluation. What is the absolute risk reduction for patients receiving enoxparin in this study?
A. 0.05
B. 0.10
C. 0.15
D. 0.25
A. 0.05
- First you must calculate the relative risk for each group. The warfarin group: 50/500 = 0.1. The enoxparin group: 25/500 = 0.05 (note this is half of the warfarin group, no calculation even needed).
- Now, subtract the two to find the difference giving an absolute risk reduction (ARR) = 0.1-0.05 -0.05.
*ARR is the difference in risk of an outcome between patients who received treatment and those of another treatment (or control group).
*Note: ARR is used to calculate number needed to treat (NNT), NNT = 1/ARR or 1/0.05= 20. You would need to treat 20 post-THR patients w/ enoxaparin for 5 days to prevent 1 patient from getting a DVT.
Which of following best describes a 95% confidence interval?
A. The width of the confidence intervals is not dependent on the standard error of the mean (SEM)
B. It provides you the predictive properties of a test
C. If you were to repeat the study under ideal conditions, you would be 95% confident that your result would fall within that confidence interval.
D. It helps you to determine if your result was due to chance or random error
C. If you were to repeat the study under ideal conditions, you would be 95% confident that your result would fall within that confidence interval.
*The wider the confidence interval, the more likely that the “true” population value will be anywhere within that interval. HOWEVER, a wide confidence interval may indicate a problem w/ the study or data obtained. One of the main reasons for a wide CI is insufficient sample size.
*A: dependent and influenced by the standard error of the mean (SEM)
B: reflects of positive or negative predictive value for a test result
D: describes what a p-value is helping you to determine
See Q10 picture.
A clinical trial was completed and a review of the data from the study is being analyzed to determine the right statistical test that should be used. Based on the graphic representation of the data, which of the following statements is most accurate?
A. The data is considered to be homogenous and can be analyzed by using parametric statistical analysis
B. The mean is greater than the median
C. The data is negatively skewed because the median is larger than the mean
D. The data is positively skewed because the median is smaller than the mean
C. The data is negatively skewed because the median is larger than the mean
Which of the following statistical tests would be considered non-parametric?
A. 1-way ANOVA
B. Student’s T-test
C. Fisher’s Exact
D. Paired T-test
C. Fisher’s Exact
See Q12 table
A group of investigators have designed a study to determine if rosuvastatin (Crestor) was more effective than atorvastatin (Lipitor) for the treatment hyperlipidemia. They designed the study so that a 1000 patients would be randomly assigned to one of 2 groups: rosuvastatin 40 mg once daily or atorvastatin 40 mg once daily. At baseline, patients had fasting lipid profile done and were assessed for myopathy (muscle aches/pain). These patients were then prospectively followed for 3 months. At 3 months the patients had a follow up visit where fasting lipid profiles and patient assessments on myopathy (none, mild, moderate, or severe pain) were completed. The primary endpoint was to determine the change in LDL-c (mg/dL) from baseline. The secondary endpoints included: The proportion of patients achieving their LDL-c goal per guidelines and the patient’s rating of myopthy. What type of data is the primary endpoint?
A. Nominal
B. Ordinal
C. Continuous
D. Categorical
C. Continuous
- This question is specifically asking about the primary endpoint and thus you should focus on that specific wording in the case.
In this case, the investigators specifically want to know the change in LDL-C or the amount of lowering in the LDL-c in mg/dL. This is a concentration and each mg/dL is the same in magnitude and thus considered to be continuous (note: the change can be to infinity, there is no ranking or scale here).- If the case had said the proportion of patients who achieved an LDL-c goal of < 100 mg/dL, then the investigators would be looking at this endpoint as a “yes” or “no” question (ie., the patient either achieved an LDL-c goal of less than 140 or they did not), thereby making it nominal or categorical.
- If the investigators had defined the LDL-c into a sense of order or ranking such as <100 (mild), 100-130 (moderate), or > 130 (severe) then the endpoint would be treated as ordinal data.
High-Yield Core Concept:
* It is always important to read the endpoint in question carefully to see the subtle difference in wording as it changes everything as it relates to the type of data it is assigned. Board exams like to do this so make sure you understand this concept.
High-Yield Fast Fact:
* Nominal data is considered non-numerical data but is also sometimes referred to as “binary” or “dichotomas” data since it is either one thing or another (i.e., two options). For example, the use of mortality as an endpoint of the study. At the end of the study the subject is either dead or alive. There is no in-between.
A prospective randomized controlled trial has set out to determine if warfarin caused more GI bleeds than placebo This study contained 2 groups independent of each other. Group A was assigned to take warfarin 2.5 mg by mouth once a day Group a was assigned to take placebo by mouth once a day. After 12 months of therapy, the incidence of Gl bleeds in Group A was 0.41 and Group B it was 0.28. Which of the following best describes the results of this data?
A. The relative risk is 0.68
B. About 32% of the risk for GI bleed was from the use of warfarin
C. The relative risk is 0.53
D. There was a 46% excess risk in GI bleeds with the use of warfarin
D. There was a 46% excess risk in GI bleeds with the use of warfarin
*This question requires that you calculate the relative risk (RR) which is the incidence of
the exposed group divided by the incidence non-exposed group. Therefore the RR
would be 0.41/0.28 = 146.
* Since the RR is greater than 1, the difference from or above “1.0’ is the “excess risk”,
which is 0.46 or 46%.
High-Yield Core Concept:
The relative risk (RR) is determined by calculating the incidence of the exposed group
divided by the incidence non-exposed group
A prospective trial has set out to determine if warfarin (Coumadin) caused more minor bleeding than placebo his study contained 2 groups independent of each other Group A took warfarin 2.5 mg by mouth daily. Group a took placebo by mouth once daily After 12 months of therapy, the incidence of bleeding in Group A was 0.41 and Group a it was 0.28. Which of the following best describes the results of this data?
A. The relative risk is 0.68
B. 32% of the risk for bleeding was from the use of warfarin
C. The relative risk is 0.53
D. There was a 46% excess risk in bleeding with the use of warfarin
D. There was a 46% excess risk in bleeding with the use of warfarin
*They gave you the incidence for the events in each group studied. Now all you have to do is calculate the relative risk (RR), which is the incidence of Group A divided by the incidence of Group B (RR = 0.41/028 = 1.46).
*A RR of 1 means there is no difference in risk at all. If the RR is greater 1 then there is an excess risk with that intervention (i.e., use of warfarin in this case).
- If the RR is less than 1, the intervention of interest will take away risk of the outcome.
Therefore, a RR of 1.46 means that patients taking warfarin have a 46% excess risk
compared to those taking placebo. Do not make the mistake of calculating the absolute risk reduction (AAR) on accident which is determined by 0.41 - 028 = 13.
Which of the following can have a significant impact on the power of a study?
A. Sample Size
B. Ethnicity of the subjects enrolled
C. Location of the study
D. Type of blinding method used
A. Sample Size
*Most common factors that impact the power of a study include: sample size and the a-priori setting of the alpha and beta values
A prospective study was done over 3 years to determine if a new antihypertensive (Drug A) would offer any benefit over placebo for the treatment of hypertension. 500 patients were randomized to one of 2 groups: Group A (250 patients total) Drug A 50 mg by mouth once daily or Group B (250 patients total) placebo by mouth once daily. The primary endpoint was the development of a stroke At the end of the trial, 88 patients in Group A had a stroke and 95 patients in Group B had a stroke What would be an appropriate statistical test for analyzing the primary endpoint of this study?
A. Fisher’s Exact
B. Chi-Square
C. Students t-test
D. 2-way ANOVA
B. Chi-Square
A study was completed and revealed a p-value between the two treatment groups to be p = 0.01. Assuming an alpha = 0.05, which of the following offers the best interpretation of this p value?
A. There is a 5% chance the study has made a type Il error
B. There is a 1% chance that the study results are due to random error
C. There is a 5% chance that the study results are due to chance alone
D. There is a 0.01% chance that the results are due to chance alone
B. There is a 1% chance that the study results are due to random error
A p-value only indicates the chance the results that were found are due to chance alone or random error. Thus the smaller the p-value, the more likely the results are real the p-value has nothing to do with *clinical significance’ or *clinical relevance”. That is determined by the researcher and/or reader of a study.