RMB: PARAMETRIC TESTS WEEK 3 Flashcards
1
Q
What are parametric assumptions/tests?
A
- A parametric assumption assumes things about the data-set
- Assumes data will be normally distributed and that both data sets will have homogeneity of variance
- Usually done on interval or ratio level data > Data measured on a scale, like cm’s measured with a ruler (called ratio-level data), or temperature which has scale but does not have a true 0 (called interval-level data)
2
Q
Parametric vs Non-parametric and why we have them
A
- Parametric and non-parametric tests are distinct from each other
- Important because based on how the data looks (parametric or non-parametric) depends on what test you can do > if the data doesn’t meet parametric assumptions then a non-parametric test is needed
3
Q
Normal distribution
A
- Population distribution should be roughly normal (Gaussian distribution)
- We don’t want the results to be significantly different > want them to belong to a normal distribution
4
Q
Homogeneity of variance
A
- when comparing more than one group, the groups should have equal variance (looks the same/similar) > should not vary too differently from each other
- Homogeneity means same/similar
- We don’t want the results to be significantly different > want them to have similar results
5
Q
Non-parametric tests
A
- don’t make assumptions about the population distribution (distribution free tests) > these tests are lower in power + less flexible than parametric tests > only used when parametric assumptions are NOT met
- Usually used on ordinal and nominal data (but can be used on interval + ratio)
6
Q
Are parametric tests better than non-parametric tests?
A
- Prefer using parametric testing because it is more powerful and robust, limitations are well documented
- If the data is not normally distributed or doesn’t have homogeneity of variance we could transform the data using logs to normalise distributions
- Occasionally we can use “equal variances not assumed” if parametric assumptions are not met but we want to use a parametric test (e.g. independent t-test)
- Parametric tests are powerful but can be abused if assumptions aren’t met
7
Q
Kolmogorov-Smirnov test for normality of data
normal distribution test
A
- Tests to see likelihood of data being distributed normally
- K-S test tests the null hypothesis that the distribution is normal
- if p is less than 0.05 then the data is NOT normally distributed > this is because the p value is significant so the distribution isn’t normal (p has to be non-significant to show a normal distribution)
- if p is over 0.05 then the data is not significant + normally distributed > we want p to be OVER 0.05 (e.g. p = 0.001 is significant so not ND but p = 0.9 is not significant and IS ND)
8
Q
Levene’s test for homogeneity of variance
A
- determines if the data sets are from the same population > tests null hyp that each sample has a similar variance
- If the samples do have homogeneity of variance, then the test will NOT be significant > p has to be over 0.05 to have similar variance (non-significant) > we want p to be over 0.05 to use parametric testing > if p is under 0.05 then there is NOT homogeneity of variance because the result is significant
9
Q
How do we confirm our assumptions?
A
- If the data is not significantly different from the normal distribution and (if appropriate, e.g. doing a test of difference) there is no significant difference between the variance of samples (aka there is homogeneity of variance) then we can perform a parametric test
- To perform a parametric test, we need both tests to not be significant > if one out of 2 doesn’t work, a non-parametric test should be used
- If these assumptions are not fulfilled, we have to do the equivalent non-parametric test
BUT - some parametric tests are very robust + sometimes they will be used even if the tests of parametric assumptions are not fulfilled (standard can vary depending on the test)
10
Q
What is an experiment?
A
- Manipulation of one or more variables. e.g., coffee intake.
- Determine the effect of this manipulation on another variable. e.g., driving when tired.
- To test of cause-effect relationship between variables.
- Test of causality > e.g., does coffee improve your driving when tired?
11
Q
What is a hypothesis?
A
- Science is about testing hypotheses.
- Hypotheses are derived from theories (one theory may generate thousands of H’s)
- A hypothesis is a testable prediction.
12
Q
Experimental/alternative hypothesis.
A
- ‘Learning with background music does lead to lower marks.’
- Treatment leads to an effect.
13
Q
Null hypothesis
A
- ‘Learning with background music does NOT lead to lower marks.’
- Treatment does not lead to an effect.
14
Q
Independent & Dependent variables
A
- The independent variable is what you change and the dependent variable is what you measure
- Manipulating the independent variable changes the value of the dependent variable.
15
Q
Nuisance variable
A
- An additional factor that affects the dependent variable.
- E.g. testing affect of music on marks but nuisance variable could be the environment or time of testing