STATS- Lec 12 T tests Flashcards
1
Q
Normal distribution
A
- You can only use t tests that are normally distributed
- (68% = 1 SD, 95% = 2SD)
*
2
Q
Quick reminder
A
- Normal distribution
- 60% with 1 SD
- 95% within 2 SD
- Z scores
- -1 to 1: 60%
- -2 to 2: 95%
- Need to know
- True mean
- SD
3
Q
Guinness
A
- ‘Student’ alias Willian Gosset (1908) was an employee of Guinness breweries
- He was looking for a metric of quality control
4
Q
“Student” realised that
A
- What if true mean and SD are unknown
- Have to estimate mean (and variability) from a limited number of measurements
- Cannot calculate z scores
- Creates a “t distribution”
- NUMBER of measurements important
- A large number of estimates (>30) then t distribution is just like z distribution
- Number of measurements related to “Degrees of freedom” of an experiment
5
Q
Degree of freedom
A
- This is the difference between the number of measurements made the number of parameters estimated
- A parameter is a value that describes something
- For example
- Estimating the mean (1 parameter) from 5 numbers leaves 4 degrees of freedom in an estimate, the more reliable the estimate
6
Q
T and degree of freedom
A
- Unlike z, the t distribution depends on the number of degrees of freedom
- Hence t relies on the number of measurements made the number of parameters
- The more degrees of freedom, the closer t comes to z
- If t and z were exactly the same then 95% of data would fall between t = -2 and +2: This only happens when the number of degrees of freedom is 30 or more
- If they are smaller than 30 df than the critical value (t value between which 95% is held) will be greater than 2
- The concept of a CRITICAL VALUES: tcrit
7
Q
Is that t-value significant
A
- Depends on the Df
- [t] >3.18 Df=3, p<0.05; 5% probability it is not significant
- [t] >2.45 Df=6, p<0.05
- [t] >1.98 Df=30, p<0.05 (same as z)
- The fewer measurements, the larger the t values have to be to reach significance
- small t value unlikely to be significantly different from chance
8
Q
How to calculate a t values
A
- Independent measures
- For independent samples
- Between subjects design
- t=difference in means/ difference expected by chance
- t= Mean 1- Mean 2 / SD of the means
- Data HAS to be normally distributed
9
Q
The 2 sample t-test visualized
A
*
10
Q
For repeated measures design
A
- WITHIN subjects design
- 2 measurements made in EACH subject
- e.g. Before treatment and after
- Calculate CHANGE in EACH SUBJECT
- t= Mean change / Change expected by chance
- Measure the height of a person, measure the height of the same person with socks on
11
Q
The repeated measures t-tests
A
- Join up scores from the same subject
- Get a list of difference d1,d2…d100
12
Q
Reporting results
A
- The statistic value (t=1.95)
- Degrees of freedom (df=39)
- Significance level (p<0.025)
- There was a significant difference between the 2 groups (t=1.95, df=39, p<0.025)
- NB. Interpretation of the meaning of that difference depends on the research design
13
Q
Standard error and repeated measures t-test- Visualising data
A
14
Q
The sample mean
A
- In practice, you can’t survey everyone in a population
- You take a sample of that population and assume that it is representative
- The sample mean is an estimate of the true population mean
15
Q
But how accurate is your sample mean
A
- If you were to estimate the mean over and over again what would be the spread
- The standard error (known as the standard error of the mean)
- The standard error depends on two things: the standard deviation of the original distribution and the number of samples