Statistics 5 Flashcards

1
Q

What is the most common type of data analysis in geography?

A

Non-parametric data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a benefit and a problem with using non-parametric data analysis?

A

Problem: only simple arithmetic (non-normal)
Benefit: less assumptions made which makes it easier to work with

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are inferential statistics?

A

Tests of difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does a one-way chi-squared test do? What is the parametric equivalent?

A

Test for the difference between sample and population for nominal and ordinal data. Parametric equivalent is a one-sample t-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are we interested in from what SPSS produces for a one-way chi-squared test?

A

The value produced (not that interesting) and the p-value (very interesting) because this is the significance value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do you interpret the p value SPSS produces for a one-way chi-squared test?

A

Normally

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does a two-way chi-squared test do? What is the parametric equivalent?

A

Tests for difference between two samples for nominal and ordinal data. Parametric equivalent is the a two-sample t-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is cross tabulation and what does it enable us to do?

A

Cross tabulation enables us to compare two different responses in a survey

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is an example of cross-tabulation?

A

If one question on a survey asks for Gender then as well as this being data collected about the people we can also use it as a lens from which to analyse responses to other questions such as alcohol consumption. We can then see how the alcohol consumption differs along gender lines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do we interpret the p-value that SPSS produces for a two-way chi-squared test?

A

normally

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When is a two-way chi-squared test invalid?

A

If more than 20% of the expected values for each result/type of answer in a survey have a count less than 5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Give an example of when a two-way chi-squared test is invalid

A

If when looking at opinions on gay marriage where the responses that people can provide are between strongly disagree, disagree, neutral, agree, strongly agree (5 values for each of the 5 categories of response) then if the expected result for 1 (20%) of these categories is below 5 (i.e. E(X) is 4) then the test is invalid

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What do statisticians commonly do to solve the invalid two-way chi-squared problem?

A

they combine categories - i.e. if you combine the expected values for two categories then you automatically dramatically increase the chances that the new category that is created as a result of their combination will have an E(X) >5.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What have you got to be careful of when solving the invalid two-way chi-squared problem?

A

You do not simply combine categories that are opposites, i.e. you want to combine ones that are very similar for example “agree” and “strongly agree”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a Mann-Whitney U test?

A

This test compares the mean ranks of equal/unequal sample sizes for two samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Explain how a mann-whitney U test works?

A

It ranks all the responses in their order and determines for each sample what rank the calculated mean value would be within this order. It then compares the ranks of these means for both samples against each other to determine the difference between them

17
Q

What is the Mann-Whitney U test similar to and how is it different?

A

Two-way chi-squared test for ordinal and ratio and interval data that are not normally distributed.

18
Q

Why is nominal data not able to be tested through the Mann-Whitney U test?

A

Because it is not as measurable

19
Q

How do you interpret the p-value that SPSS produces for the Mann-Whitney U test?

A

normally

20
Q

What is a Kruskal-Wallis test and what is the parametric equivalent?

A

Tests for difference between 3 or more samples. Parametric equivalent is ANOVA

21
Q

Explain how the Kruskal-Wallis test works?

A

Identical process to Mann-Whitney U test - It ranks all the responses in their order and determines for each sample what rank the calculated mean value would be within this order. It then compares the ranks of these means for each sample with the other samples to determine the difference between them

22
Q

How do we interpret the p-value that SPSS produces for a Kruskal Wallis Test?

A

normally

23
Q

If we are comparing a lot of samples in a Kruskal Wallis test then what might me want to do to determine the specific relationship between two of them?

A

Isolate the two samples and carry out a Mann-Whitney U test not a two-way chi-squared

24
Q

What data type are relational statistics not suitable for? What data types are they suitable for?

A

Nominal not suitable. Ordinal and Ratio and Interval that are not normally distributed

25
Q

What are relational statistics?

A

Test to determine correlation

26
Q

When carrying out relational statistics what is it typically best to represent ordinal statistics using?

A

Table

27
Q

How do we interpret the table that SPSS produces for Spearman’s rank?

A

Cover 1/2 of the table along a line form the top left to the bottom right as the values are the same. Each row has three sub-rows which are correlation coefficient, significance value and N value

28
Q

How do we interpret the significance for a spearman’s rank test in SPSS?

A

Look at the number of asterisk that are superscript on the correlation coefficient value

29
Q

What does two asterisk mean?

A

99% confident that it did not occur due to chance i.e. 99% statistical significance threshold.