stats Flashcards
odds
a+c/ B+D (yes/no)
odds ratio
Odds treatment/Odds control
ad/bc
use in case control/cross section
risk
yes/Yes+no
A+C/ A+B+C+D
relative risk
Risk treatment/Risk control
a/a+b devide by c/c+d
Confidence interval
range of plausible values for some summary measure
In repeated studies 95% of the confidence intervals will cover the true value of the summary measure
sensitivity
proportion of true positive
A/A+C
specificity
proportion of true negative
D/D+B
Positive predictive value
probability that a subject with a positive test result actually has the disease
A/A+B
sensitivity x prevalence/ sensitivity x prevalence + (1-spec)x (1-prev)
depend on sensitivity and specificity
Depend on prevalence
Negative predictive value
probability that a subject with a negative test result does not have the disease
D/C+D
spec x (1-prev)/ (1-sen)xprev + spec x(1-prev)
depend on sensitivity and specificity
Depend on prevalence
If the prevalence is low
false positives are likely,
even if sensitivity is high
selection bias
2 groups in study different due to allocation flaws, drop out, chance
Information bias
info collected incorrect
e.g. recall bias
confounding
stratify
a variable that influences both the dependent variable and independent variable, causing a spurious association
But how do we stratify confounders that we don’t know
about yet?
Randomise
Intention-to-treat
- Analysing data according to the treatment assigned, rather than what treatment was actually administered
Dropouts are included in the analysis - Avoids selection bias due to unequal dropouts
May underestimate treatment efficacy, as the treatment
effect is diluted by
* Dropouts
* Crossover
* Drop-ins to other treatment