Stats Flashcards
Null hypothesis
A strawman set to argue against the data
-If your results prove your hypothesis, you reject the null hypothesis
Type 1 error
Rejecting the null hypothesis when it is in fact true
A False positive result.
You’re wrong, but you don’t realise it.
P value
The probability of a Type 1 error occurring
i.e. the chance that you’re wrong, but you don’t realist it and the null hypothesis is actually true
Usually <0.05 arbitrarily set
Type 2 Error
Accepting the null hypothesis when it is in fact false
A false negative result
You decided you weren’t right when you were
Power
Likelihood of finding an effect when it is present
Power = 1-p(Type 2 Error)
So If most studies aim for a power of 80%, then it means that 80% of the time if the effect is there it will be noted.
Alternatively, the type 2 error rate would be 20%
Modifiers of Power
Bigger is better:
- Size of effect
- Sample size
Lower is preferred:
- Desired significance
- Standard deviation
Risk
The chance of something occuring
E.g. A population has a 5% chance of dying when they present with PE
Odds
The chance of something occurring compared with it not occurring
E.g A population has a 5% chance of dying compared to a 95% chance of surviving when they present with a PE.
Therefore: 5/95 = 1/19
For every 1 that dies, 19 survive
Relative Risk
The chance of something occurring relative to the chance of it occurring under different circumstances.
E.G. A population has a 5% chance of dying when they present with PE and a 15% chance of dying when they present with PE + hypotension
Therefore: 5%/15% -> 1/3
Therefore: A person presenting with a PE without hypotension has 1/3 the risk of death relative to one presenting with a PE with hypotension
Absolute Risk Reduction
The absolute difference in chance of something occurring compared to the chance of it occurring under different circumstances
E.G. A population has a 5% chance of dying when they present with PE and a 15% chance of dying when they present with PE + hypotension
Therefore: 15% - 5% -> 10%
Therefore: the absolute risk of death has increased by 10%
Number Needed to Treat
NNT = 1/ARR
E.G ARR = 10%
Taking 10 people and treating them prevents 1 death
Odds Ratio
The Odds version of relative risk
The odds of something occurring relative to it occurring under different circumstances
E.G. A population has a 5% chance of dying when they present with PE and a 15% chance of dying when they present with PE + hypotension
Odds death nohypo = (1/20)/(19/20) = 1/19
Odds death hypo = (3/20)/(17/20) = 3/17
Odd ratio death hypo relative to death nonhypo = (3/17)/(1/19)
Therefore: (3/17)x(19/1)
Therefore: (3x19)/17 -> 57/17 -> 3.35
The odds of death are 3.3x higher with hypo than without
Note: it’s not 3x higher like with the relative risk
Which study uses odds ratio instead of relative risk
Case control
Cross Sectional Study
Measure the prevalence of disease and exposure in a random sample of a population at a time point
Pros
- Cheap and easy
- Questionnaires
Cons
- Recall bias from self reporting
- Can’t determine which came first (Temporality)
- Non response bias (Those who participate may differ from those who don’t)
- Bad for rare issues as it randomly samples a population
- Confounding
Case Control Study
Sample disease states and then ask retrospectively about exposure
Pros
- Efficient for rare diseases and outbreaks
Cons
- Hard to find matched controls
- Can’t determine which came first
- Cause/effect impossible to ascertain
- Can’t be used for prevalence, incidence, and risk
- You select a control for every case, so you can’t compare one to the other like that
- Have to use regression to spit out an odds ratio
- Recall bias
- Confounding
Prospective Cohort study
Measure risk factors in people disease free at baseline
Follow them over time, wait for them to develop the outcome, and calculate risk/rates of developing disease
Pros
- Exposure occurs prior to outcome
- Able to study multiple outcomes
- Can be used for rare exposures and multiple outcomes
- Can be used for prevalence and incidence
- Usually generalisable due to sampling from general community
- Avoid recall bias
Cons
- Expensive
- Take a long time
- Confounding
- Loss to follow-up
Retrospective cohort study
Cohort assembled after an outcome has occurred using stored data
Pros
- Exposure occurs prior to outcome
- Cheaper and faster than prospective
Cons
- Data quality may be limited