Research Studies - Analysis of Data Flashcards
Confidence intervals
-what is this
-how is it calculated
A range of values within which the true effect of intervention is likely to lie
If at 95%, the true effect of the intervention should lie within this range 95% of the time
Standard error of the mean = SD/square root of n
-as n increases, SEM decreases
Lower limit = mean - (1.96 x SEM)
Upper limit = mean + (1.96 x SEM)
Correlation
-what is this
-interpretation of r
How closely the points lie to a line drawn through the plotted data
r = 1 strong positive correlation
r = 0 no correlation
r = -r strong negative correlation
No info on cause and effect
No info on how much y will change if x is changed
Linear regression
-what is this and how does it differ from correlation
Can predict how y changes if x changes
Forest plots
-what is this used for
Graphical display of multiple results from different studies in a meta analysis
Shows
-CI
-mean
-line of no effect - studies that cross this line are insignificant
-diamond - result of the meta analysis
Funnel plot
-what is this used for
Demonstrate the existence of publication bias in meta analyses
Box and whisker plot
-what is this used for
Graphical representation of the sample minimum, lower quartile, median, upper quartile and sample maximum
Histogram
-what is this used for
Graphical display of continuous data where values have been categorised into multiple categories
Scatter plot
-what is this used for
Graphical representation using Cartesian coordinates to display values for 2 variables for a set of data
Kaplan-Meier survival plot
Plot estimating the survival function showing decreasing survival with time
Intention to treat analysis
-what is this
All patients randomly assigned to 1 treatment are analysed together, regardless of whether they completed the treatment
Avoid effects of crossover and drop out which may affect the randomisation to the treatment groups
Distributions
-what is normal distribution and the percentages
Symmetrical - mean = mode = median
1SD - 68.3% of values
1.96SD - 95% of values
2SD - 95.4% of values
3SD - 99.7% of values
SD = square root of variance
Variance = measure of spread of scores away from the mean
Positive skew and negative skew
-distribution of mean, median, mode
Positive skew
mean > median > mode
Negative skew
mean < median < mode
Significance test types
-definition of parametric vs non-parametric
Parametric - can be measured
Non-parametric - cannot be measured
Parametric
-Student’s t test - paired, unpaired
-Pearsons - correlation of linear data
Non-parametric
-Mann-Whitney U - compares ordinal, interval or ratio scales of unpaired data
-Wilcoxon signed-rank - 2 observations on 1 sample after an intervention
-Chi-squared - compare proportions, percentages after an intervention
-Spearman - correlation of non linear data
Significance tests
-H0, H1
-p value
-type 1, 2 errors
-power
H0 - 2 treatments are equally effective
H1 - there is a difference between the 2 treatments
p - probability of obtaining getting this result by chance if H0 is true
T1 error - H0 rejected when true (a)
T2 error - H0 accepted when false (b)
Power - probability of correctly rejecting H0
1-b
Can be increased by increasing sample size
Number needed to treat
-what is this
-how to calculate it
Absolute risk reduction
-what is it
-how to calculate it
How many patients would require intervention to reduce the expected number of outcomes by 1
1/absolute risk reduction
Absolute risk reduction - absolute difference between the control event rate and experimental event rate