Exam Flashcards
What are measures of variability?
- Statistical procedures to show how spread out the data is around a point
- The difference between the mean and the data point tells us how far away each data point i from the mean
- Range, SD & variance
What is standard deviation?
Represents how far from the average each data point is
What is variance?
A number indicating how spread out the data is
What is the population?
The complete collection to be studied
What is a Sample?
A section of the population
Why do we need measures of variability?
You want the sample to reflect the overall population, so you use measures of variability to see how comparable your sample data is to the estimated population
How do you prepare the data before calculating variance/SD
- Take each value and square it
- Add all of the values together
- Add the squared values together
How do you calculate variance?
- Sample size x Calculated squared value
- Square the standard value
- subtract one from the other
- Divide this by sample size times sample size -1
n(sum of x squared) - (sum of x) squared
_______________________________
n(n-1)
How do you calculate the standard deviation?
Find the square root of the variance.
What is the standard error (of the means)?
Tells you how good your sample is
The standard deviation of the sample distribution
The average difference between our sample and the target population
Measure of uncertainty in the mean.
What is the sampling distribution?
This is used to estimate how much deviation we will find between our sample and the target population.
A distribution of something like the sample mean - not the raw data values
How do you calculate the standard error of the mean?
Standard deviation
______________
Square root of n
What does the SD (high/low) for the SE tell us?
SD high number = more variable = mean is not as good estimate of he population mean
SD low number = less variable = mean is a much more reliable estimate of the population mean
What does N (high/low) for the SE tell us?
Higher N = lower SE
Lower N = higher SE
What does the 95% confidence interval tell us?
That we can be 95% confident that the population mean will fall between the upper and lower boundaries
What do error bar graphs represent?
The 95% confidence interval
What does overlap of the error bars mean?
Less overlap indicates more effect.
Statistical significance is…
A difference that is most probably not due to chance.
This is affected by the sample size. A large enough sample will always result in statistical significance.
Effect size is…
The magnitude of the relationship or difference found.
Looks at whether the difference is enough to be of practical significance.
This is not affected by the sample size.
How so you calculate the Effect size (Cohen’s d)?
Mean of group 1 - Mean of group 2
___________________________
SD of the population
What does the Effect value (high/low) tell us?
Cohen’s convention for interpretation
Large Effect = .8
Medium Effect = .5
Small Effect = .2
What does Cohen’s d actually mean?
The differences in two groups based on the SD.
(can be thought of as the % of non-overlap between a condition and a control group on a bell curve - the two would sit bang on top of each other.)
Larger effect = less overlap
Smaller effect = more overlap
When would you use Cohen’s d?
- When comparing 2 means
- T-test
When would you use a partial eta squared?
When statistical tests are based on variance rather than SD
ANOVA
What is partial eta squared?
An indication of the proportion of variance in the dependant variable that is explained by the independent variable.
How do you interpret a partial eta squared value?
Ranges from 0 - 1
Small Effect = .01
Medium Effect = .06
Large Effect = .138
- The % of variance in the DV that is explained by the IV
What is the correlation coefficient?
Pearson’s r
The effect for hypothesis that look at relationship.
How do you interpret Pearson’s r?
correlation coefficient
No relationship = 0
Small = .10 - .29
Medium = .30 - .49
Large = .50 - 1
Positive or negative - still interpreted the same.
r squared = the % of variance in one that is explained by the other.n
What is the relationship between effect and error?
If you have more effect than error then you’re more likely to see the results in the target population
Effect
_____
Error
How do you Interpret the t value?
T = > 1 - There is more effect than error so it is significant
T = < 1 - There is more error than effect so it is NOT significant
What is the p value?
The probability as to how true our results are.
The probability of our null hypothesis being true.
A percentage
Reported to 3 decimal places.
What is a Type 1 error?
When it looks like there is an effect but THERE IS NOT AN EFFECT
We have selected a sample that was better overall, before out manipulation.
What is a Type 2 error?
Sampling from the correct group and thinking that there isn’t an effect but THERE IS AN EFFECT
How do you interpret the p value?
P<0.05 = significant
Bigger than the p value = accept the null hypothesis - no effect
Smaller than the p value = reject the null hypothesis and accept the directional hypothesis - effect.
What is a sampling error?
When you use the wrong population
What is the standard deviation error?
A measure of how much error we would expect.
What do you aim for with the sampling error and the standard deviation error?
The difference between the two to be low.
When would you use the Kruskal Wallace H test?
Violated assumptions of a 1 way ANOVA
NON parametric
Between subjects
What is the Kruskall Wallace H test?
The non parametric anova for between subjects design
What is the post hock test after a kruskall Wallace significant output
Man Whitney U
What output do you report for man Whitney U?
U and P
What is the non parametric anova for within participants design?
Friedman
What is the post hock test after a significant Friedman output?
Wilcoxon
What output do you report for wilcoxon
Z and P
When would you use a Friedman test
1 way ANOVA
NON parametric
within subjects design
How do you run a Kruskal Wallace test?
Analyse - non parametric tests - legacy dialogues - K independent samples - move variables - define range (amount of groups) - options - descriptives (and quartiles if you want medians) - continue - ok.
How do you run a Friedman test
Analyse - non parametric tests - legacy dialogues - K RELATED samples - move groups across - tick Friedman - click statistics - select quartiles - ok - continue
When do you use a one way ANOVA? // What are the 4 assumptions of an ANOVA?
1 factor with more than 2 levels
IV - Factor
DV - Levels
Between participants design
normally distributed data
equal sphericity
Why do we use the ANOVA?
Otherwise, we would need to do 3 (or more) paired t-tests each with a p value of .05 which makes an overall chance of error of 15% (or more) which is far too high.
What does the ANOVA find?
Whether the effect is bigger than the error.
How does the ANOVA find out whether the effect is bigger than the error?
F = Between groups variance
____________________
Within groups variance
What is the treatment effect?
The difference in something before and after manipulation.
What is the grand mean and how do we calculate it?
- The figure that is used to compare the individual group mean scores to which gives us the within group variance (measure of error)
- Add up the three groups means, then divide by the number of groups
How do you interpret the F value?
An F value greater than 1 means that there is more effect that error.
How do you report the ANOVA results?
A one way ANOVA showed that there is an effect between the (conditions)
F(between, within) = F value, P value.
What is the post hock test for an ANOVA?
Independent t test.
What is a Bonferonni correction?
Correcting the p value to reflect the amount of conditions.
3 conditions = 0.05/3 = new Bonferroni corrected p value of P = 0.17
What is a test of homogeneity?
A method of finding whether there is equal variance between the groups.
What test of homogeneity would you use for an independent t test?
Levene’s test
What does a significant Levene’s test result mean?
There is a difference in equal variance between the two groups so you’re violating one of the assumptions of the ANOVA.
This means you have to revert back to a non parametric ANOVA.
What does an insignificant Levene’s test result mean?
There is equal variance between the groups and you have met the assumption of the ANOVA
Continue with post hock tests.
When would you use a related ANOVA?
Within participants design.
What changes with a related ANOVA?
The between groups variance becomes the between conditions variance.
We don’t have to account for individual difference so we don’t compare the grand mean with condition means.
The f value uses the error and variance.
What is the Greenhouse geisser correction?
This works out the difference score and the mean variances of the differences for a related ANOVA
What does it mean when the mean variances of different scores are significantly different for a related ANOVA
We have violated the assumption of sphericity and therefore have to read the greenhouse geisser correction
When do we report the greenhouse geisser correction?
We always report this in front of the f value for related ANOVA’s