Statistics, epidemiology and study design Flashcards
What formular should be used to calculate combined sensitivity or specificities for multiple diagnostic tests?
Combined probablity (specificty or sensitivity):
= 1 - (1- spec/sen 1) x (1 - spec/sen 2)
E.g. A test has a specificity of 60%. Another test has a specificity of 80%. If both tests are performed, the specificity of the result is:
= 1- (1-0.6) x (1-0.8)
= 1 - (0.4 x 0.2)
= 1 - 0.08
= 0.92
= 92%
What is the geometirc mean? It’s a measure of central tendency.
A special type of mean that corrects better for very large a very small numbers. It’s calculate by multiplying n values together, then taking the nth root of the results.
E.g. geometric mean of 2, 3 and 10 (n = 3)
= cubed root of(10 x 2 x 3)
= 3.91
Compare with the arithemetic mean:
(10+2+3)/3
= 5 (shifted much more greatly towards the 10)
What’s the arithemtic mean?
It’s the normal mean - (1+2+3+… + x) / n (where n = number of numbers)
What is the mode?
The value that occurs the most often in a set of numbers. It’s a meausre of central tendency.
What are non-parametric tests used for?
Statistical tests to be used when data is nor noramlly distributed.
What has more statistical power, parametric or non-parametric testing, for discovering significant effects?
Parametric tests on normally distributed date.
What might lead to data not being normally distributed and therefore requiring non-parametric stastical tests?
When the data is ranked (aka ordinal data, when central tendency is better represented by the median (many or large outliers), when the sample size is too small, when the outliers cannot be removed.
What some examples of parametric statistical tests?
T-test and ANOVA
What are you comparing with a T-test? How does this compare to ANOVA?
You are comparing the means of 2 normally distributed data sets to look for a meanful difference between the two. If you wanted to compare the means of more than 2 groups, you need to do a ANOVA.
What’s a paired t-test used for?
When each value in one dataset corresponds meaningully to a value in the the other dataset. E.g. when dataset 1 is a measurement before a durg is given, and dataset 2 is a measurement after the drug was given on the same set of people.
What is 1 tailed t-test ued for?
When you need to no if the mean you are interested in was higher (or lower) than the mean you are comparing to. A 2 tailed t-test will only tell you mean 1 is significantly different to mean 2 or not.
What are some examples of non-parametric statistical test?
Chi-square test, Kruskal Willis test (ANOVA equivalent for non-parametric datat), and the Mann Whitney tests (the equivalent to the
What are some examples of non-parametric statistical test?
Chi-square test, Kruskal Willis test (ANOVA equivalent for non-parametric datat), and the Mann Whitney tests.
What is the purpose of comparative statistical tests?
Comparing the same measure of central tendency (e.g. mean/median/mode) from 2 or more data sets to establish whether or not there is a significant difference between the two groups with respect to 1 or more outcomes.
When two data sets are normally distributed, and parametric testing is indicated to establish if there is a significantly different measure of central tendency present or not, what measure of central tendency should be compared?
Mean
What are the data types that are important before choosing a statistical test?
Quantitative (might be continuous or discrete) data is that which is made up of numbers on an infinite scale that can be added, subtrated and divivded. Categorical data is that which falls into discrete buckets - it can be nominal (e.g. yes, no, maybe,) Ordinal (ranked, e.g. unlikley, likely, very likely), interval (ranked and with fixed intervals - e.g. scores 1-10, 11-20, 21-30).
Which comparative statistical test can be used for non-normally distributed data that is catagorical to compare two datasets?
Chi-square test of independence, spearman’s r
What’s the name of the test used typically in table 1 to determine if a study population between two arms are heterogenous?
Chi-square test of best fit (or cochrane Q)
Which comparative statistical test can be used for non-normally distributed data that is catagorical to compare 3
or more datasets?
Kruskal-Wallia (the non-parametric test version of the ANOVA).
If you wanted to compare 3 parametric datasets with regards to 2 or more outcome variables, which statistical test would you use?
MANOVA (modfied ANOVA allows multiple outcomes to be tested)
If you have wanted to compare to non-parametric quantitative data sets that are un-paired, how would you do this?
Wilcocon Rank-Sum test. For paired data, do the paired Wilcoxon Signed-rank test.
What is the formular for number needed to treat?
The inverse of the absolute risk reduction.
So 1 / (risk 1- risk 2)
What are odds?
The number of times something happened compared to the number of times something didn’t happen to members of a population expressed as a ratio.
e.g. There was a population of 100 people who went for a walk. 10 fell over. What are the odds of falling over when walking if in this population?
10 fell over
90 did not fall over
Odds of falling over
=10:90
=1:9
What is relative risk?
The chance something will happen to a proprotion of the population expressed as percentage or a decimal. It is calculated by dividing the number of people who the event occured to by the total number in the population.
E.g. There was a population of 100 people who went for a walk. 10 fell over. What is the risk of falling over when walking if in this population?
Relative risk:
10 fell over
100 went walking (population)
= 10/100
= 1/10