Descriptive Comparisons Of Groups Flashcards
What descriptive statistics would be used for continuous data that was normally distributed?
Mean and standard deviation
What descriptive statistics would be used for continuous data that was not normally distributed?
Median and interquartile range
How would you determine if the outcome variable was normally distributed?
Draw a histogram (either for the whole sample or in two groups separately)
Our data shows men have a mean forced expiration volume (FEV) of 3.19 litres compared to that of 2.44 litres in women. What can we interpret from this data?
That women have a lower mean FEV compared to men
Constructing a confidence interval around the mean can help us determine whether there is a true difference between the groups. How does this work?
If the confidence intervals for both means do not overlap then you can be fairly confident there is a true difference
When the confidence intervals for your two means do overlap, what should you interpret instead?
The difference between the two means (mean difference) and the 95% CI for that difference
Our mean difference is 0.64 and our CI is 0.45-0.85. What does this tell us?
On average group1 have a higher variable than group2
The CI tells gives us an idea of how good the estimate is compared to our true mean difference. Here, because it is above zero it indicates this is likely to be a true difference
For normally distributed data we compute the difference in means. What do we compute for non-normally distributed data? What is the problem surrounding this?
We compute the difference in medians. However, the 95% CIs around medians or difference in medians cannot easily be computed
How would you calculate the mean difference and CIs for a situation with more than two exposure groups?
- decide which is the unexposed (baseline/reference)
- calculate the stats in the same way, relative to the baseline
Does a CI for the mean difference that spans zero always give a non-significant result?
Yes
When we have more than two exposure groups, CIs cannot be used to assess whether there is a true overall association/difference or not. What would need to be carried out instead?
An appropriate hypothesis test
Which statistics can be used for a descriptive comparison between two categorical groups?
- Risk statistics
- Odds of developing disease (odds ratio)
What type of study are risk statistics limited to?
Cross-sectional studies
- How do you calculate the overall risk (prevalence) in the study?
- How do you then calculate the risk ratio?
- Number of people with disease divided by total number of people in study population
- Risk of disease in exposed divided by the risk of disease in unexposed
How would you interpret the following risk ratios: 1, >1?
1 = no change in risk >1 = increase in risk