one-way anovas Flashcards
what are we comparing in a t-test?
two means
-> asking ‘are the means of the two groups significantly different when accounting for variance?’
what do t-values represent?
the difference between the mean as a function of variance (how spread out the data is)
what are we comparing in a regression?
two models of data and their relationship
what does a regression provide?
an accurate relationship between two data points
why do we use regressions instead of continuing to use t-tests?
- t-tests give you the same value as regression BUT can only be used to compare 2 data points / means while an ANOVA allows you to compare more complicated data then this
what are we looking for in a regression?
model that explains the difference between our variance / mean
* concept is the same as for a t-test
what do we do in regression?
compare more than two means -> general liner model
* (GLM: foundation for most inferential statistics -> all link into the concept of modelling our data and whether we can draw a line between our data which explains our data well
why can’t we just use multiple t-tests?
family-wise error
what is family-wise error?
with more groups / categories, the number of comparisons between groups increases (3 groups = 3 comparisons), the increase is not linear either (i.e. 4 groups = 6 comparisons, 10 = 45 comparisons etc.)
what is the issue with the number of comparisons between groups increasing called?
n choose k problem
which is the danger of multiple comparison?
we are looking at the probability of whether our results are due to chance (and when you add more comparisons, every single one of them has a 5% chance -> as you increase the comparisons, the chance level won’t stay at 5%)
what is an issue if more comparisons are made?
higher chance of type 1 error (false positive)
[accepting something which isn’t there in reality]
what is family wise error?
increased likelihood of type 1 error (due to multiple comparisons)
how can you calculate the family error rate?
for an alpha level of <.05, the probability of type one error can be calculated using:
1-(0.95)^k
*k is the number of comparisons
what are some examples of calculating a family wise error?
for 1 comparison, familywise error rate is 0.05 (5% chance of a type 1 error) -> 1-(0.95)^1
3 comparisons, increase error to 0.14 (14% chance of type 1 error) -> 1-(0.95)^3
14 comparisons, increases error rate to 0.51 (51% chance of a type 1 error) -> 1-(0.95)^14
*hence why conducting may t-tests is a bad idea
what issues does multiple comparisons cause?
- likely ‘find effects’ that are actually type 1 errors
- if you correct the alpha level, you miss real effects (type 2 errors)
what is the solution to issues with multiple comparisons?
ANOVAs -> allowing us to make more than one comparison within one test
what is a one-way ANOVA?
- one independent variable
- can have more than two levels (i.e. 4 age groups/8 nationalities etc.)
- one dependent variable (run a test per dependent variable)
what are the assumptions of a one-way independent ANOVA?
- Normality (K-S test or Shapiro-Walk)
- Homogeneity of Variance (Levene’s Test)
- Data is independent (Between-Subjects Design)
what to do if assumptions are violated?
non-parametric alternative
* but ANOVAs are pretty robust anyways -> if you violate the assumptions, there are non-parametric alternatives
how to structure your data on SPSS?
one row per subject
two columns
* IV i.e. (1-4 levels)
* DV
how do we inspect the data on SPSS?
Graph > Chart Builder
* Drag a simple bar chart into the canvas
* Drag the IV to the x-axis
* Drag the DV to the y-axis
* element properties; add error bars (+/- 1 SE)
How do we run a one-way ANOVA on SPSS?
Analyse > Compare Means > One-Way ANOVA
* drag IV to factor list
* drag DV to dependent list
Click Options
* Descriptive (Mean and SD)
* Homogeneity of Variance Test (Levene’s Statistic) [Brown-Forsyth and Welch -> non-parametric alternatives]
* Mean Plots (basic line graph)
* Exclude Cases by analysis