Stata Lecture 3 Flashcards
What 3 things must you consider when deciding how to analyse data?
Is the data normally distributed or skewed?
Are you comparing two groups or more?
Are the groups independent or are they paired?
What is the difference between independent groups vs paired groups?
Independent groups come from separate populations (e.g. how men vote vs how women vote)
Paired groups are drawn from the same population (e.g. womens views on politics vs womens views on art)
What are 3 possible reasons for incurring a high p value?
There truly is no difference between the two groups
The sample was atypical and not representative of the population
The sample was not large enough for the data to be deemed significant
3 Disadvantages of non-parametric statistical tests?
Do not provide a confidence interval
Based on rankings, not scores
Less sensitive than parametric test - best to use parametric test if assumptions hold for it
How do you check if data is normally distributed?
Box and Whisker plot for the data in each individual group
What are the assumptions of all parametric statistical tests?
Normally distributed data
Standard deviation is similar across all groups
3 reasons when could you potentially use a parametric test for skewed data?
When there are 50 observations in each group
The skewness is not very large
The standard deviations are similar across groups
How to calculate variance?
Standard deviation squared
How do you determine if standard deviation is similar across groups?
The variance (standard deviation squared) should be no more than 4 times larger the variance of any other group
When would you use a non-parametric test?
Data is skewed
Small sample size
Standard deviation differs across groups
Which test would you use for:
parametric
2 groups
independent
Two sample/unpaired t-test
Which test would you use for:
parametric
2 groups
paired
Paired t-test
Which test would you use for:
parametric
3 or more groups
independent
ANOVA (analysis of variance test)
Which test would you use for:
parametric
3 or more groups
paired
Repeated measures analysis of variance
Which test would you use for:
non-parametric
3 or more groups
independent
Kruskal-Wallis test