Statistics And Data Analysis 21061 Flashcards
What is the difference between categorical level data and continuous data
Week 1
Catergorical data is nominal only (numbers, names gender only) whereas as continious data can be put on a continious scale
What two descriptive statistics do we typically use
Week 1
Central tendency & spread
What is the difference between how independent variables and dependent variables are measured
Week 1
The IV is ALWAYS measured on a categorical scale
The DV is IDEALLY measured on a discrete/continious scale
What is the benefit of measuring the DV on a continious scale
Week 1
So that we can use parametric statistics
What is the difference between a true-experimental vs a quasi-experimental design
Week 1
We actively manipulate the IVs in a true experimental design whereas the IVs in a quasi experimental design reflect fixed characteristics
Is handedness a quasi or true experimental IV
Week 1
Quasi - it is a fixed characteristic
What are the 3 main types of subject design
Week 1
Between subjects, within subjects, mixed design
What is a (2^ 3) mixed design
Week 1
Has two IVs, one between, one within.
Between IV has two levels, within IV has 3 levels
(e.g males and females preferences to horror, action and romance movies)
What does normally distributed data allow us to do
Week 1
Use parametric stats
What are the properties of normally distributed data
Week 1
Symmetrical about the mean
Bell shaped - mesokurtic
What is Platykurtic data
Week 1
Data which has more variations/spread than normally distributed data
(-ve kurtosis value)
What is leptokurtic data
Week 1
Data which has less variations/spread than normally disributed data (+ve kurtosis value)
What type of skew does normal data have
Week 1
normally distributed data has no skew
What is sampling error
Week 1
degree to which sample statistics differ from underlying population parameters
What are Z scores
Week 1
converted scores from normally distributed populations
What is sampling distribution
Week 1
Distribution of a stat across an infinite number of samples
What is the sampling distribution of the mean
Week 1
Distribution of all possible sample means.
What are standard error (SE) and estimated standard error (ESE)
Week 1
Standard deviation of sampling distribution
ESE is simply an estimate of the standard error based on our sample
What do we use sample statistics for
Week 1
to estimate the population parameters
What is a T-test
Week 2
Inferential statistic when we have 1 IV and 2 DVs that estimates whether population means under 2 IV levels are different
What contributes to variance between IV levels in an independent t-test
Week 2
- manipulation of IV (treatment effects)
- individual differences
- experimental error
* random error
* constant error
what contributes to variance within IV levels in an independent t-test
week 2
individual differences
random experimental error
What would happen if we continued to determine the mean of the difference for infinite samples
Week 2
it would essentially be like calculating the population mean difference
What is the null hypothesis when talking about sampling distribution of differences
Week 2
the sampling distribution of differences will have a mean of 0 as there is no difference between the sample means of 2 different samples
Why do we use estimated standard error instead of standard deviation in T-distribution
Week 2
Because it is a sampling distribution, instead of s.d we use s.e. This is because standard error is used to express the extent an individual sample mean difference deviates from 0
As we do not have all of the possible samples to calculate the standard error, we estimate the standard error , hence why we use e.s.e
What is the equation for t in an independent design
Week 2
Xd/ESEd
AKA
Mean of the difference / estimated standard error of the difference
AKA
variance between IV levels/variance within IV levels
What does the distance to 0 of the t value indicate?
Week 2
If t value is closer to 0, smaller variance between IV levels relative to within
If t value is further from 0 , large variance between IV levels relative to within IV levels
What does it mean if the null hypothesis is true for t-dist
Think CI
week 2
If the null hypothesis is true - 95% of sampled t-values will fall within the 95% bounds of the t-dist
If the null hypothesis is true, only 5% of sampled t-values will fall outside the 95% bounds
What are degrees of freedom and how are they calculated
Week 2
the differences between the number of measurements (sample size) made & number of parameters estimated (usually one, mean)
(Sample size - # of parameters)
N-2 for independent t-test
n-1 for paired t-test
What happens to the degrees of freedom value the larger they get
Week 2
They tend to 1.96, the original value
What are some of the assumptions we make for an independent t-test
Week 2
- Normality: the DV should be normally distributed under each level of the IV
- Homogeneity of variance: The variance in the DV, under each level of the IV should be reasonably equivalent
-
Equivalent sample size: sample size under each level of IV should be roughly equal ( matters more with smaller samples)
* Independence of observations: scores under each level of the IV should be independent
What test do we use when the asumptions for the independent t-test are violated
Week 2
we use the non-parametric equvalent: Mann-Whitney U test
What is Levenes test
Week 2
A test for equality of variance –> homogeneity of variances
what does levenes test tell us and what does it not tell us
Week 2
Tells us: Whether theres a diff in variances under the IV levels
doesn’t tell us:if our means are different or IV manipulation
What is the null hypothesis of levenes test
Week 2
no diff between the variance under each level of the IV (i.e homogeneity in variance)
If we reject Levene’s test, what does this mean
Week 2
There is heterogeneity in variance - the way in which the data varies under both IVs is different
What assumptions do we want when it comes to variance between IV levels?
Week 2
equal variance and homogeneity
What contributes to variance between IV levels in a paired t test
Week 2
- Manipulation of IV (treatment effects)
- Experimental error
what contributes to variance within IV levels in a paired t test
Experimental error
(RM designs - can discount the variance due to individual differences (leaving only variance due to error))
What assumptions do we make during a paired t-test
-
Normality - distribution of difference scores between the IV levels should be approximately normal
* Assume ok if n> 30 - Sample size - sample size under each IV level should be roughly equal
What do we do when our assumptions are violated during a paired t-test
Week 2
We use the non-parametric equivalent - Wilcoxon test
How do we interpret 95% Confidence intervals for repeated measure designs
Week 2
we can’t determine if result is likely to be significant by looking at 95% CI plot therefore we need to look at the influence of the IV in terms of size & consistency of effect
For a repeated measures design, what would happen if the confidence intervals cross 0 (lower value is negative and higher value is positive)
Week 2
you cannot reject the null hypothesis as you cannot conclude that the true population mean difference is different from 0
What is Cohen’s D
Week 2
The magnitude of difference between two IV level means, expressed in s.d units
I.e - a standardised value expressing the diff between the IV level means
What are the values for effect size of Cohen’s d
week 2
Effect size d
Small 0.2
Medium 0.5
Large 0.8
How does cohen’s d differ from T? Define both.
week 2
D = magnitude of difference between two IV level means, expressed in s.d units
T = magnitude of diff between two IV level means, expressed in ESE units
T takes sample size into account - qualifies the size of the effect in the context of the sample size .
When do we use a One way anova
Week 3
When we have 1 IV with more than 2 levels
What does a one way anova do?
Week 3
Estimate whether the population means under the diff the levels of the IV are different
What is an ANOVA like (think of t-tests)
Week 3
an extension of the t-test –> if you conducted a one-way anova on an IV w/ 2 levels, you’d obtain the same result (F = t^2)
Why do we use ANOVA instead of running multiple t-tests
Week 3
the more we draw from a population, the more likely we are to encounter a type I error and reject the null hyothesis, even if it true
What is the familywise error rate and what does ammedning it provide
Week 3
Probability that at least one of a ‘family’ of comparisons run on the same data, will result in a type I error
Provides a corrected significance level (a) reducing the probability of making a type I error
How do calculate the familywise error rate ?
Week 3
a’ = 1 - (1- a)^c
where c is the number of comparisons
e.g for 3 IV levels (3 comparisons) (ab ac bc)
1 - (1 - 0.05) ^3 = .143 = 14% chance of type I error
for 4 IV levels (6 comparisons (ab ac ad bc bd cd) )
1 - (1 - 0.05)^6 = .264 = 26% chance of type 1 error
Why do we use omnibus tests?
Week 3
To control familywise error rate
What is the null hypothesis of the F ratio/ANOVA?
Week 3
there is no difference between populations means under different levels of IV
H0:u1=u2=u3
what is the ratio for the F value.
Week 3
Variance between IV levels/ Variance within IV levels
What does the closeness of the F value to 0 indicate
Week 3
F value close to 0 = small variance between IV levels relative to within IV levels
F Value further from 0 = large variance between IV levels relative to within IV levels
What assumptions do we make for an independent one way ANOVA
Week 3
Same as those for independent T-test
Normality: DV should be normally distributed, under each level of the IV
Homogeneity of variance : Variance in the DV, under each level of the IV, should be (reasonably) equivalent
Equivalent sample size : sample size under each level of the IV should be roughly equal
Independence of observations : scores under each level of the IV should be independent
What do we do when the assumptions of the independent one-way anova aren’t met?
Week 3
We use the non-parametric equivalent, the Kruskal Wallis test
1.
What is the model sum of squares?
Equation
Week 3
Model Sum of Squares (SSM): sum of squared differences between IV level means and grand mean (i.e. between IV level variance)
What is the residual sum of squares?
Week 3
Residual Sum of Squares (SSR): sum of squared differences between individual values and corresponding IV level mean (i.e. within IV level variance)
What is SSt and how is it calculated
Week 3
Sum of squares total
= SSm( Sum of squares model ) + SSr (Sum of squares residual)
What is the mean square value and how is it calculated? What are the two types?
Week 3
MS = SS/df (Sum of squares/ degrees of freedom)
MSm = model Mean square value
MSr = residual mean square value
What do we use mean square values for?
Week 3
To calculate the F statistic
How do we calculate the F statistic
mean square values
Week 3
MSm/MSr
aka
model mean square value / residual mean square value
What do we do when the assumption of homogeneity is violated in an independent 1-way ANOVA
Week 3
We report Welch’s F instead of ANOVA F
What happens to the degrees of freedom when we use Welch’s F?
Week 3
The degrees of freedom are adjusted (to make the test more conservative)
How is the ANOVA F value reported
Week 3
F(dfm,dfr)=F-value, p =p-value
How do we calculate degrees of freedom for an independent 1 way ANOVA
Week 3
find the difference between the number of measurements and the number of parameters estimated
i.e. no. of measurements – no. parameters estimated
How do we calculate df for between IV level (model) variance where N is total sample size and k is number of IV levels
Week 3
K-1
How do we calculate df for within IV level (residual) variance where N is total sample size and k is number of IV levels
Week 3
N-k
What are post hoc tests
Week 3
Secondary analyses used to assess which IV level mean pairs differ
When do we use post-hoc tests
Week 3
only when the F-value is significant
How do we run post-hoc tests?
Week 3
As t-tests, but we include correction for multiple comparisons
what are the 3 type of post-hoc test
Week 3
- Bonferroni
- least significant difference (LSD)
- Tukey honestly significant difference (HSD)
Which post hoc test has a very low Type I error risk, very high type II error risk and is classified as ‘very conservative’
week 3
Bonferroni
Which post-hoc test has a high type I error risk, a low type II error risk and is classified as ‘liberal’
Least significant difference (LSD)
Which post-hoc test has a low type I error risk , a high type II error risk and is classified as ‘reasonably conservative’
week 3
Tukey Honestly significant difference (HSD)
What are the three levels of effect size for partial eta^2 for ANOVA
week 3
> 0.01 is small
0.06 is medium
0.14 is large
what is effect size measured in for ANOVA
calculated in 2 ways, cohens d and partial eta squared
How do you calculate partial eta squared
week 3
Model sum of squares/ (model sum of squares + residual sum of squares)
In a repeated measures design for a one way ANOVA, what contributes to variance between IV levels
Week 4
- Manipulation of IV (treatment effects)
- Experimental error (random & potentially constant error
In a repeated measures design for one way ANOVA, what contributes to variance within IV levels
Week 4
Experimental error (random error)
**
how do we calculate total variance?
Week 4
Model variance(variance between IV levels)/ residual variance (variance within IV levels) -
Individual differences (in independent designs)
what is the t/F ratio and how do we calculate it?
Week 4
variance between IV levels/ variance within IV levels (excluding variance due to individual diffs WHEN IN RM design)
how is the F ratio calcated in terms of Mean square values
Week 4
Mean sum of squares model/ mean sum of squares residual
What are the 3 assumptions made in a repeated measures 1-way ANOVA
Week 4
- Normality - distribution of difference scores under each IV level pair should be normally distributed
- Sphericity (homogeneity of covariance) - the variance in difference scores under each IV level pair should be reasonably equivalent
- Unique to RM 1-way anova
- Equivalent sample size: sample size under each level of the IV should be roughly the same
What corrects for the sphericity assumption.
Week 4
Greenhouse-geisser