[L9] Analysis of Variance & [L10] The Kruskal-Wallis Test Flashcards
- Very flexible and general technique, and the principles
can be applied to a wide range of statistical tests.
ANOVA
ANOVA is a ___ test
parametric
Has a wide range of applications.
ANOVA
Many of applications make some tricky assumptions
about the data.
ANOVA
In ANOVA we measure an ___ variable (also called
a ___ variable).
* This outcome must be measured on a ___ scale.
outcome; dependent - continuous
It is called dependent because it depends on one or more
__ variables
predictor
__
_ variables can be Manipulated (Treatment) or
variables we simply measure (Sex).
Predictor
In ANOVA, predictor variables are mostly ___,
although continuous variables can also be used in the
same framework
categorical
When predictor variables are categorical, they are also
called “__“_
FACTORS or INDEPENDENT VARIABLES.
___ – measurement of differences
ANOVA
Differences happen for two reasons: ___
(a) because of the
effect of predictor variables (b) because of other reasons
In ANOVA, we want to know two things:
___
- How much of the variance (difference) between the
two groups is due to the predictor variable
- How much of the variance (difference) between the
- Whether this proportion of variance is statistically
significant, that is, it is larger than we would expect by
chance if the null hypothesis were true?
- Whether this proportion of variance is statistically
We can divide (statisticians sometimes say partition)
variance into three different types:
___
- The Total Variance
- Variance due to treatment, (Differences between Group)
- Variance due to Error (Differences within Group)
In ANOVA, the variance is conceptualized as sums of
_
__
squared deviations from the mean
In ANOVA, the variance is conceptualized as sums of
squared deviations from the mean.
* It is usually shortened to___ and denoted by
__
sum of squares; SS.
The 3 Sum of Squares
- Total Sum of Squares, SS total
- Between-groups Sum of Squares, SS between
- Error Sum of Squares, SS within
___– this is the variance
that represents the difference between the groups, and this
is called _
_
. Sometimes it refers to the betweengroups
sum of squares for one predictor, in which case it
is called SS predictor. Sometimes it is called___.
Between-groups Sum of Squares; SSbetween; SStreatment
The ___-groups variance is the variance that we are
actually interested in.
between
We are asking whether the difference between the groups
(or the effect of the predictor) is big enough that we could
say it is ___
not due to chance
_
_
_– also called within-groups sum
of squares.
Error Sum of Squares
It’s within the groups, because different people, who
have had the same treatment, have different scores.
Error Sum of Squares
They have different scores because of error. So this is
called either ___
SSwithin, or SSerror.
We need to calculate the three kinds of Sum of Squares,
___
TOTAL, WITHIN GROUPS, and BETWEEN
GROUPS.
_
_
_sum of squared differences between the mean
and each score.
SStotal –
___
* To know how large the effect of the treatment has been
Calculating the Effect Size:
The same as asking what
___ the treatment effect
has been responsible for.
proportion of the Total
Variance (or Total Sum of Squares)
Effect Size goes under two different names: these are ___.
RSquared
or eta-Squared
Mean Squares. Often written as MS.
* These are ___
MSbetween, MSwithin, MStotal
Three sets of degrees of freedom
- df total, df between, and df within
Finally we calculate the relative size of the two values,
by dividing MS between by MS within.
* This gives us the statistic for ANOVA, which is called F,
or sometimes the
__
__
F-ratio.
To find the probability value associated with F we need
to have two sets of degrees of freedom, the ___
between and
within.
__
_are exactly the same test.
* It is just a different way of thinking about the result (when we have two groups).
ANOVA and t-test
In fact, if we take the ___and square it. We get the
value of F.
value of t
This is a general rule when there are 2 groups:
___
F = t-squared.
Question: If we covered t-tests, why are we doing it
again?
_
- t-test – restricted to comparing 2 groups.
- ANOVA extends in a number of useful directions.
___ extends in a number of useful directions.
* Can be used to compare 3 groups or more, to calculate
the p-value associated with the Regression Line, and ina
wide range of ___ situations
ANOVA, other
When there are 2 groups, ANOVA is equivalent to a
___ and it therefore makes the same assumptions as
the t-test.
t-test,
same assumptions as
the t-test. and it makes these assumptions regardless of the number
of
_
__ that are being compared
groups
Assumptions in ANOVA
- Normal distribution within each group
- Homogeneity of Variance
We do not assume that the outcome variable is normally
distributed.
* What we do assume is that __ each group are
normally distributed.
data within; Normal distribution within each group
_
* As with the t-test, we assume that the standard deviation
within each group is approximately equal.
Homogeneity of Variance
the variance being the square of the
.
SD
- However, as with the t-test, we don’t need to worry
about this assumption, if we have approximately equal
__
_ in each group.
numbers of people
ANOVA comparing Three Groups
- Formulae are all the same.
__most elementary analysis of
variance
One way ANOVA –
One way ANOVA – Also called as __
simple-randomized groups design,
independent groups design, or the single factor
experiment, independent groups design.
_
__that is being investigated, there
are two or more levels or conditions of the IV, and
subjects are randomly assigned to each condition.
Only one IV (one factor); one-way anova
ANOVA – not limited to ___experiments.
single factor