Research Skills 10 : Student's t-test Flashcards
what are steps when studying the difference between two groups of observations?
- We first look at the effect size
- Then we compare the effect size to the variation
- Using raw data
- And s.e.m error bars
- To judge statistical confidence in the result
What is a more exact way to judge statistical confidence/ uncertainty ?
- to calculate the 95% confidence interval (C.I.) for the difference between the two groups
- The p-value can sometimes also be useful
How do w calculate these?
Open a statistics package on the computer
Type in the data
Request a t-test
What is students’ t test?
Commonest test for comparing the mean of two groups of observations
When do we use the t test?
Designed for cases where you have small numbers of observations
And you cannot tell the actual distribution of the data
Assumes normal distribution
But robust- gives a reasonable answer even when the data doesn’t exactly fit a normal distribution
How does t test work?
It compares effect size with variation
Uses this to calculate a 95% C.I.
And to calculate an NHST p-value
T test and Skewed data
What do t test assume?
The t-test mathematically assumes a normal distribution
But it is robust to variations from a normal distribution
And, with small numbers of observations, it is virtually impossible to be sure whether the data does or does not fit a normal distribution
When is it safe to use t test
- For large numbers of observations, you are safe to use the t-test
- For small numbers of observations, the t-test may not be accurate
- Although, in most cases the p-value will be raised, you are more likely to get a Type 2 error than a Type 1 error
Why is it not useful to use “non - paramric) test instead of the t test for skewed data?
- Non-parametric tests don’t work well with small numbers of observations (they have low statistical power)
- They do not provide a 95% C.I.
What can you do instead for skewed data?
you can make your data closer to a normal distribution by a mathematical transformation
paired data
eg. investigating a human population and a drug effect
each individual is different there fore looking at before and after results in a single individual ignores variation
study the effect and the variability of the effect itself
Paired data can also arise in laboratory based experiments
One example
- We perform an experiment on a cell line on three different days
- The values for the controls differ on each day, because of slight differences in the growth state of our starting cells
- If we analyse the treated and control readings for each day as paired data, we can minimise the effect of day to day variation