EPIDEMIOLOGY - Biostatistics Flashcards
What are the two types of statistics?
Descriptive statistics
Inferential statistics
What are the two types of data?
Quantitative
Qualitative
Describe the two types of quantitative data
Continuous: data that does not have fixed values
Discrete: data that has fixed values
Describe the two types of qualitative data
Nominal: distinct, unordered categories of data
Ordinal: categories of data with some order or hierarchy
What are probabilistic outcomes?
Probabilistic outcomes are the degree of randomness resulting from the result of an experiment or trial
What are the measures of central tendency?
Mean
Median
Mode
What are the measures of dispersion?
Max and min values
Standard deviation
Interquartile range
In practice, which combinations of central tendency and dispersion would you typically report?
- Mean and standard deviation
- Median and interquartile range
What is the purpose of frequency tables?
Frequency tables summaries the frequency of each possible value in data collection
What is the difference between continuous and relative frequency?
Continuous frequency: running total of frequencies in a frequency distribution
Relative frequency: ratio of the frequencies (%)
Give six examples of graphs that can be used to visualise data
Box plots
Bar plots
Density plots
Pie charts
Scatter plots
Line plots
What is the null hypothesis?
The null hypothesis is a statement in which there is no relation between the two variables
What is the alternative hypothesis?
The alternative hypothesis is a statement in which there is some statistical relationship between the two variables
What is statistical hypothesis testing?
Statistical hypothesis testing is the use of data to determine the plausibility of a hypothesis
What is a test-statistic (T-statistic)?
A test statistic (T-statistic) is a number calculated by a statistical test which describes how far your observed data is from the null hypothesis
What is the probability value (P-value)?
The probability value (P-value) calculates the likelihood of your test statistic (T-statistic) to tell you how likely it is that your data could have occurred under the null hypothesis
Describe how a 0.05 probability value (P-value) works in regards to the null hypothesis?
A p-value less than 0.05 is typically considered to be statistically significant, in which case the null hypothesis should be rejected. A p-value greater than 0.05 means that deviation from the null hypothesis is not statistically significant, and the null hypothesis is not rejected
What is the main difference between parametric and non-parametric tests?
Parametric tests compare the mean values of normally distributes data and non-parametric tests compare the median values of abnormally distributed data
What are four examples of parametric tests?
One sample t-test
Two sample t-test
Paired sample t-test
Analysis of variation (ANOVA) test
When would you use a one sample t-test?
To compare the mean value of a sample with an expected mean value
When would you use a two sample t-test?
To compare the mean values of two different samples
When would you use a paired sample t-test?
To compare the mean values of two paired samples
When would you use an analysis variance (ANOVA) test?
To compare more than two mean values with eachother
What is the corresponding non-parametric test to a one sample t-test?
Wilcoxon Signed Rank test
What is the corresponding non-parametric test to a two sample t-test?
Mann-Whitney test
What is the corresponding non-parametric test to a paired sample t-test?
Wilcoxon Signed Rank test
What is the corresponding non-parametric test to an analysis variation (ANOVA) test?
Kuscall-Willis test
When would you use a Chi-squared test?
To compare the proportions of categorised data
What is a 95% confidence interval?
A 95% confidence interval is a range of values above and below the point estimate within which the true value is likely to lie with 95% confidence
What is the correlation coefficient?
The correlation coefficient in the measure of a relationship between two numerical values
What is represented by the correlation coefficient value of 1?
1 = Perfectly correlated (as one value increases, the other variable also increases)
What is represented by the correlation coefficient value of 0?
0 = No correlation (no association between the variables)
What is represented by the correlation coefficient value of -1?
-1 = Perfectly anti-correlated (as one value increases, the other variable decreases)
What is linear regression analysis?
Linear regression analysis is the prediction of the value of a variable based on the value of another variable
What are the two parameters estimated by linear regression analysis?
Intercept and gradient
What is extrapolation?
Extrapolation is the prediction of a new Y-value from an X-value outside the range covered by given data
What is intrapolation?
Extrapolation is the prediction of a new Y-value from an X-value within the range covered by given data