Statistics 5 Flashcards
What is the most common type of data analysis in geography?
Non-parametric data
What is a benefit and a problem with using non-parametric data analysis?
Problem: only simple arithmetic (non-normal)
Benefit: less assumptions made which makes it easier to work with
What are inferential statistics?
Tests of difference
What does a one-way chi-squared test do? What is the parametric equivalent?
Test for the difference between sample and population for nominal and ordinal data. Parametric equivalent is a one-sample t-test
What are we interested in from what SPSS produces for a one-way chi-squared test?
The value produced (not that interesting) and the p-value (very interesting) because this is the significance value
How do you interpret the p value SPSS produces for a one-way chi-squared test?
Normally
What does a two-way chi-squared test do? What is the parametric equivalent?
Tests for difference between two samples for nominal and ordinal data. Parametric equivalent is the a two-sample t-test
What is cross tabulation and what does it enable us to do?
Cross tabulation enables us to compare two different responses in a survey
What is an example of cross-tabulation?
If one question on a survey asks for Gender then as well as this being data collected about the people we can also use it as a lens from which to analyse responses to other questions such as alcohol consumption. We can then see how the alcohol consumption differs along gender lines.
How do we interpret the p-value that SPSS produces for a two-way chi-squared test?
normally
When is a two-way chi-squared test invalid?
If more than 20% of the expected values for each result/type of answer in a survey have a count less than 5
Give an example of when a two-way chi-squared test is invalid
If when looking at opinions on gay marriage where the responses that people can provide are between strongly disagree, disagree, neutral, agree, strongly agree (5 values for each of the 5 categories of response) then if the expected result for 1 (20%) of these categories is below 5 (i.e. E(X) is 4) then the test is invalid
What do statisticians commonly do to solve the invalid two-way chi-squared problem?
they combine categories - i.e. if you combine the expected values for two categories then you automatically dramatically increase the chances that the new category that is created as a result of their combination will have an E(X) >5.
What have you got to be careful of when solving the invalid two-way chi-squared problem?
You do not simply combine categories that are opposites, i.e. you want to combine ones that are very similar for example “agree” and “strongly agree”
What is a Mann-Whitney U test?
This test compares the mean ranks of equal/unequal sample sizes for two samples