Week 7 day 2 Flashcards
When do we use independent samples t-test?
When we have data from two independent samples and we want to know whether they are from the same population or not.
What is the difference between a Student t-test and a Welch t-test?
The student t-test is dependent on homoscedasity between the two samples, i.e. they have the same variance/sd.
The Welch test can account for difference variance.
What influences the size of the t-statistic?
- How different the means are (obviously).
- The degrees of freedom/ sample size.
- The variance of the two sample. Increased variance decreases t.
Should you always say that you use a Welch independent samples t-test?
Yes.
What type of t-test do we use if the data of each sample is not independent?
A paired-samples t-test.
Are paired-samples t-tests always repeated-measures tests?
No.
A paired samples t-test would also be used if there were multiple responses from an individual/s in different contexts.
In other words, for every row there are two data points.
Is a repeated measures paired samples t-test essentially just a one sample t-test done on the mean difference in response at each time point?
Yes.
What is the null hypothesis for a paired-sample t-test?
That the difference is 0.
What are the assumptions required for doing t-tests?
- That the distribution of the each population is normal.
- The data are independent, unless you are doing a paired-samples t-test that allows for specific dependence.
- For Student tests, the assumption is that the two groups being tested have the same variance.
How do we test for normality of data?
One way is to use a quantile-quantile plot. This graphs the theoretical quantiles against the observed quantiles. If there is a straight line then we can assume normality.
Another way is to look at a histogram.
Both these require a qualitative judgement.
For a quantitative check, you can do a Shapiro-Wilk test.
What are some of the reasons why checking for normality is not necessary?
- Most statistical tests are quite robust against violations of normality.
- Tests for normality will say any set of data is not normal if the sample size is large enough.
What is the Shapiro-Wilk test?
It is a test for normality of data.
In R shapiro.test() generates a W stat and a p-value. A W less than 1 indicates data not normal.
The null for a shapiro-wilk test is that the data IS normally distributed. Therefore, a p value < .05 is saying that the data is NOT normally distributed.
If you are testing for normality, do you need to test normality for the distribution of each sample/group?
Yes.
When checking the normality of data in a paired-samples t-test, do we need to test normality of the difference or the two data sets?
A paired-samples t-test is a one-sample t-test on the difference between groups. This means that we need to check the normality of the difference variable, and not the distribution of each group.
What are non-parametric tests?
These tests do not make assumptions about the shape of the distribution of the data, i.e. does not require that they are normally distributed.