Statistics Flashcards
Definition of funnel plot
a scatterplot of treatment effect (e.g. OR on x axis) against a measure of study precision (e.g. SEM on y axis)
Commonly used in meta-analyses to assess publication bias
Definition of forest plot
graphical display of estimated results from a number of scientific studies addressing the same question
gives visual suggestion of the amount of heterogeneity
and can show the estimated common effect
Definition of standard deviation
measure of amount of variation or dispersion of a set of values
Definition of per-protocol analysis
only subjects who completed the entire protocol are included in the analysis of a randomised clinical trial
Definition of intention to treat analysis:
all subjects randomised are included in the analysis regardless of whether they completed the study
Definition of clinical equipoise
state of genuine uncertainty on relative value of two interventions being compared in a trial - requirement for RCTs to be ethical
Definition of MHRA Yellow Card scheme
provides early warning that safety of a medicine or medical device may require further investigation
Definition of standard error
SD / square root of (N)
looks at how accurate the mean of the study population is compared to the true population
(whereas standard deviation compares how participants in the study population compare to each other)
Definition of null hypothesis
hypothesis that there is no significant difference between specified populations
Definition of type I error
falsely rejecting a null hypothesis that is true in the population (false positive)
Definition of type II error
failing to reject null hypothesis that is false in the population (false negative)
Definition of power
probability of picking up a significant difference, if there is one (probability of not making type 2 error (false negative))
1 – probability of Type II error
Definition of p-value
probability of event happening by chance = probability of wrongful rejection of the null hypothesis = probability that the null hypothesis is true = type 1 error
Definition of confidence interval
range within which the true answer will lie 95% of the time
Definition of a priori
pre-specifying end-points & outcomes of a study to reduce reporting bias
Definition of surrogate endpoint
variable relatively easily measured that predicts a distant outcome of the intervention being tested
Definition of composite outcome
combination of two or more outcomes into single endpoint
Definition of Cohen’s kappa coefficient
statistic used to measure inter-rater reliability (degree of agreement between raters/observers )for qualitative variables
Definition of absolute risk
probability that an event will occur (incidence)
number of events/total number of people
Definition of absolute risk reduction
difference in rate of events between 2 groups
ARR = AR (C) – AR (T)
Incidence in control - incidence in intervention
Definition of relative risk
risk ratio = relative likelihood of an event occurring in the treatment vs control group throughout study period
RR = AR (T) / AR (C)
cumulative risk
Definition of relative risk reduction
reduction in rate of outcome in treatment group vs. control group
RRR = ARR / AR (C)
Definition of number needed to treat
number of pts needed to treat to prevent 1 additional bad outcome, e.g. death, stroke
NNT = 1 / ARR
Definition of hazard rate
probability of the event occurring in the next time interval divided by the length of that time interval
time-sensitive = instantaneous risk
Definition of hazard ratio
relative likelihood of an event occurring in the treatment vs control group at any given point
Definition of logistic regression
statistical analysis method to predict a binary outcome, such as yes or no, based on existing independent variables
Definition of linear regression
regression model that estimates relationship between one independent variable and one dependent variable using a straight line
Definition of chi-squared test
hypothesis test to determine whether observed frequencies are significantly different to expected frequencies if the null hypothesis was true
categorical variables
Definition of t-test
hypothesis test to determine whether means of two groups are significantly different from each other
continuous variables
Definition of ANOVA
hypothesis test to determine whether means of three or more groups are significantly different from each other
continuous variables
Definition of log-rank test
hypothesis test to compare the survival distributions of two samples
Definition of Kaplan Meier curves
probability of survival curves for categorical values
Definition of Cox proportional hazards regression analysis
survival analysis for both quantitative & categorical variables, which can simultaneously assess the effect of several risk factors on survival time
Definition of correlation coefficient
how closely 2 continuous variables move with each other
Types of correlation coefficient
parametric: Pearson’s R
non-parametric: Spearman’s rank correlation Rho
Definition of receiver-operating characteristic (ROC) curve
a graph showing the performance of a classification model at all classification thresholds. This curve plots two parameters: True Positive Rate. False Positive Rate
ROC curve axes
X axis: 1-specificity (false +ves)
Y axis: sensitivity (true +ves)
Funnel plot axes
X axis: study outcome, e.g. OR
Y axis: study precision, e.g. SEM
Measure of forest plot heterogeneity
I squared
How do you calculate relative risk?
RR = A/(A+B) / C/(C+D)
(those who got the disease in all exposed vs those who got the disease in all not exposed)
What types of studies can RR be used in?
Prospective studies
Odds ratio
Ratio of odds of something happening vs the odds of something not happening with a particular exposure