Statistics Flashcards
Levels of evidence: what is 1a?
Meta-analysis and RCTs
Levels of evidence: what is 1b?
At least 1 RCT
Levels of evidence: what is 2a?
One well-designed controlled trial which is not randomised
Levels of evidence: what is 2b?
One well-designed experimental trial
Levels of evidence: What is 3?
Evidence from case, correlation, and comparative studies
Levels of evidence: what is 4?
Evidence from a panel of experts
Define randomised control trial
Participants randomly allocated to intervention or control group (e.g. standard treatment or placebo)
Describe cohort study
Observational and prospective
Two (or more) are selected according to their exposure to a particular agent and followed up to see how many develop a disease or other outcome
The usual outcome measure is relative risk
Describe case-control study
Observational and retrospective
Patients with a particular condition are identified and matched with controls
Data is collected on a past exposure to a possible causal agent for the condition
The usual outcome measure is the odds ratio
Describe a cross-sectional study
Provides a snap shot in time, sometimes called a prevalence study
Provides weak evidence of cause and effect
Define relative risk (RR)
RR is the ratio of risk in the experimental group (experimental event rate, EER) to the risk in the control group (control event rate, CER)
Also sometimes called relative risk ratio
RR = EER / CER
A risk ratio of >1 means what?
The rate of an event is increased compared to the control
A risk ratio of <1 means what?
The rate of an event is decreased compared to control
Define incidence
Number of new cases per population in a given time period
Define prevalence
Total number of cases per population at a particular point in time
What is selection bias?
- Error in assigning individuals to groups leading to differences which may influence the outcome
Give some examples of selection bias
Sampling bias: subjects not representative of the population
Volunteer bias: people more at risk of a disease may volunteer for a study
Non-responder bias
What is recall bias?
Difference in the accuracy of the recollections retrieved by study participants
A particular problem in case-control studies
What is publication bias?
Failure to publish results as they showed a negative or uninteresting result
Define the P value
Probability of obtaining a result by chance at least as extreme as the one that was actually observed, assuming the null hypothesis is true
What is the null hypothesis?
Two treatments are equally effective
What is a type 1 error when testing the null hypothesis?
The null hypothesis is rejected when it is actually true (i.e. showing a difference between two groups when it doesn’t exist i.e. a false positive)
What is a type 2 error when testing the null hypothesis?
The null hypothesis is accepted when it is false (i.e. failing to spot a difference when one really exists i.e. a false negative)
What is the power of a study?
The probability of (correctly) rejecting the null hypothesis when it is false i.e. the probability of detecting a statistically significant difference
Power can be increased by increasing the sample size
What will influence which significance test you use?
Whether the data is parametric (something which can be measured, usually normally distributed) or non-parametric
Name two parametric tests
- Student’s t test (paired or unpaired)
- Pearson’s product-moment coefficient correlation
What is paired data?
Data obtained from a single group of patients e.g. measurement before and after an intervention
- Parametric and must be normally distributed
What is unpaired data?
Comes from two different groups of patients e.g. comparing response to different interventions in two groups
Name some non-parametric tests
- Mann-Whitney U test
- Wilcoxon signed-rank test
- Chi-squared test
- Spearman, kendall rank
When would you use Mann-Whitney U test?
- Non-parametric data which is unpaired
When would you use the wilcoxon signed-rank test?
- Non-parametric
- To compare two sets of observations on a single sample
When would you use the chi-squared test?
- Non-parametric
- Used to compare proportions or percentages
Define ‘numbers needed to treat’ (NNT)
A measure that indicates how many patients would require an intervention to reduce the expected number of outcomes by one
- Calculated by 1/absolute risk reduction
How do you calculate the absolute risk reduction?
The difference between the control event rate (CER) and the experimental event rate (EER)
EER = (number who had particular outcome with the intervention) / (total number who had the intervention)
CER = (number who had a particular outcome with the control) / (total number who had the control)
Define sensitivity
Proportion of patients with the condition who have a positive result
i.e. we need to know how often a test will be positive if a patient has the disease
True positives / (true positives + false negatives)
Define specificity
Proportion of patients without the condition who have a negative result
I.e. we need to know how often a test will be negative if the patient is healthy
True negatives / (true negatives + false positives)
Define positive predictive value
The chance that the patient has the condition if the diagnostic test is positive
True positives / (true positives + false positives)
Define negative predictive value
The chance that the patient does not have the condition if the diagnostic test is negative
True negatives / (true negatives + false negatives)
Define the likelihood ratio for a positive test result
How much the odds of the disease increase when a test is positive
Sensitivity / (1 - specificity)
Define the likelihood for a negative test result
How much the odds of a disease decrease when a test is negative
(1-sensitivity) / specificity
When is the mean used?
When the spread of the data is fairly similar
on each side of the mid point
e.g. when the data are “normally distributed”.
What is the disadvantage of using the mean as an average?
Skewed by extremes of data, so not giving a typical picture in this instance
The median is often better in these circumstances
When would you use the median?
When data is not symmetrical / i.e. is skewed distribution
When would you use the mode?
The most is the most common set of events, and so used when a label is needed for the most frequently occurring event
What does standard deviation mean?
- Used for data which is normally distributed
- Provides information on how much data varies around their mean
- How much a set of values is spread around the average
How much of the data values is within 1 standard deviation of the mean?
68.2%
How much of the data values is within 2 standard deviations of the mean?
95.4%
How much of the data values is within 3 standard deviations of the mean?
99.7%
What is a confidence interval?
Instead of simply wanting the mean value of a
sample, when you want a range that is likely to contain the true population value.
- A range (interval) in which we can be fairly sure (confident) that the “true value” lies.
What happens to the confidence intervals in larger studies?
The CI is narrower because of the larger sample size
What does a P value of 0.5 mean?
The probability of the difference having happened by chance is 0.5 in 1, or 50:50.
What does a P value of 0.05 mean?
The probability of the difference having happened by chance is 0.05 in 1, i.e. 1 in 20.
When do you use parametric tests?
To compare samples of normally distributed data
If the data is not normally distributed, do not use parametric tests
FSRH grading of evidence
- What is Grade A?
- Based on RCTs
- At least one meta-analysis or RCT or systematic review rated as 1++ and directly to the target population
FSRH grading of evidence
- What is Grade B?
- Based on other robust or experimental or observational studies
- Body of evidence including studies rated as 2++ directly applicable to the target population
FSRH grading of evidence
- What is Grade C?
- Evidence is limited but relies on expert opinion and endorsement of respected authorities
FSRH grading of evidence
- What is Grade D?
Evidence level 3 or 4 (case, correlation etc)
What are descriptive studies?
- Describing a population / giving a picture of what is happening
- e.g. case reports, case series, qualitative reports, surveys
What are analytic studies?
- Quantifies the relationship between two factors
- i.e. effect of an intervention (I) or exposure (E) on an outcome (O) in a population (P)
- To quantify this effect, we need a comparison group (C)
Was the intervention randomly allocated
- yes = RCT
- no = observational study (i.e. cohort, cross-sectional, case-control)
What are the 3 types of observational studies?
- Cohort (prospective: exposure -> outcome)
- Case-control (retrospective: exposure <- outcome)
- Cross-sectional (exposure and outcome simultaneously)
What are the advantaged of a cohort study?
- Ethically safe
- Subjects can be matched
- Can establish timings and directionality of events
- Administratively easier and cheaper than RCT
What are the disadvantages of a cohort study?
- Controls can be difficult to identify
- Exposure may be linked to a hidden confounder
- Blinding is difficult
- Randomisation not present
- For rare diseases, large sample sizes or long follow-up is necessary
What are the advantages of a case-control study?
- Quick and cheap
- Only feasible method for very rare disorders or those with long lad between exposure and outcome
- Fewer subjects needed
What are the disadvantages of a case-control study?
- Reliance on recall or records to determine exposure status
- Confounders
- Selection of control groups is difficult
- Potential bias: recall and selection
What are the advantages of a cross-sectional study?
- Cheap and simple
- Ethically safe
What are the disadvantages of a cross-sectional study?
- Establishes association at most, not causality
- Recall bias susceptibility
- Confounders may be unequally distributed
- Group sizes may be unequal
What are the clinical trial phases?
0 - human microdosing studies
1 - healthy people
2 - people with relevant illness in lab setting
3 - people with illness in clinical setting
Market authorisation
4 - post-marketing surveillance studies
Name descriptive statistics
- Mean, median, mode, standard deviation
- Confidence intervals
- P-values (parametric, non-parametric, chi-squared)
Name summary measures
- RR
- OR
- RD
- NNT
Name accuracy statistics
- Sensitivity, specificity, predictive values, likelihood values
What is the variance of a set of data?
- A measure of variability (SD is square root of variance)
- Calculated by taking the average of squared deviations from the mean
- Tells you the degree of spread in the data set
- The more spread the data, the larger the variance is in relation to the mean
How do you calculate risk ratio?
RR = risk in treated or exposed group / risk is unexposed or control group
How do you calculate the odds ratio?
OR = odds of having been exposed to a risk factor in case group / odds of having been exposed to a risk factor in control group
What is the absolute risk reduction? (ARR)
- Difference between the event rate in the intervention group and control group
= improvement rate in the intervention group - improvement rate in control group
What is the relative risk reduction? (RRR)
- The proportion by which the intervention REDUCES the event rate
RRR = ARR / control (placebo) no improvement event rate
What the number needed to treat?
The number of patients who need to be treated for one to get benefit
= 100/ARR
What is a student’s t-test?
- Tests a hypothesis on the basis of a difference between sample means
- i.e. the t test determines a probability that two populations are the same, with respect to the variable tested
- An example null hypothesis might be there is no difference in the mean BMI of patients undergoing vaginal and c-section deliveries
Describe a chi-squared test
- Chi-squared test of proportions
- Non-parametric test used to compare numerical or categorical data sets
- E.g. investigating the proportion of women taking pre conception folic acid in different socioeconomic groups
Describe ANOVA (analysis of variance)
- ANOVA tests the hypothesis that there is no difference between two or more population means (usually at least 3)
- Can test for differences without increasing the Type I error rate (which can happen if comparing multiple means by conducting multiple t-tests)