Categorical analysis 2 Flashcards

Question 1

Q

What should you use if you are comparing two nominal variables?

Answer

A

Chi-square test of association or test of independence

tests if two nominal-scale variables are related to each other

Question 2

Q

What is ‘effect size’?

Answer

A

The outcome of a hypothesis depends on the sample size (larger better)

Question 3

Q

Why is it recommended that an independent measure of effect size be used when reporting a significant statistical effect?

Answer

A

Small treatment effect can be statistically significant if the sample is large enough

Question 4

Q

What does the effect size estimate a metric provide information about?

Answer

A

The size of an effect that is not influenced by factors such as sample size

Measures how ‘big’ the difference between the data and the null hypothesis predictions actually were

Question 5

Q

What does ‘cramer’s v’ do?

Answer

A

Measures effect size in categorical analysis (chi-square)

Question 6

Q

What is the R command for ‘Cramers v’?

Answer

A

associationTest() prints it automatically but can also use cramersV() for it directly

Question 7

Q

How should you roughly interpret Cramer’s V?

Question 8

Q

Why are assumptions necessary in a test?

Answer

A

Necessary to allow inference

If assumptions are wrong though, you can make mistakes

Question 9

Q

Is sampling distribution equal to ‘chi-square’ in chi-square tests?

Answer

A

No, only approximately

Question 10

Q

What assumptions do both chi-square tests (‘goodness of fit’ and ‘association’) make?

Answer

A

‘Large’ expected frequencies

Independence of the data

Question 11

Q

What are ‘large’ expected frequencies an assumption of chi-square tests?

Answer

A

Data only becomes chi-square if we can presume that there are enough observations for the underlying binominal distributions to be ‘normal’

Question 12

Q

What test should you use for comparing nominal variables if frequencies are too small?

Answer

A

Fisher Exact Test

Question 13

Q

What is the Fisher Exact Test?

Answer

A

An analogue of the chi-square test of association

However, it doesn’t require large expected frequencies (works best for small frequencies)

Question 14

Q

What assumptions does the Fisher Exact Test make that the chi-square test of association doesn’t?

Answer

A

It assumes that row and column totals are fixed

(can’t be changed and are the same number)

Question 15

Q

How does the Fisher Exact Test work?

Answer

A

By calculating the exact probability of obtaining a particular contingency table (i.e. cross-tabulation)

The p-value is calculated by summing over all contingency tables that are “more extreme” than the observed one.*
The definition of “more extreme” is tricky, but basically means “more uneven”*

Question 16

Q

What is the main thing to note in the Fisher Exact Test when looking at the results?

Answer

Study These Flashcards

A

p-value

Question 17

Q

What does the second assumption of chi-square tests ‘independence of data’ mean?

Answer

Study These Flashcards

A

Can’t have any ‘special relationship’ among some of your observations

(e.g. same people participating in two of the same experiments)

Question 18

Q

What test should you use to analyse categorical data if the two sets of data are not independent of one another?

Answer

Study These Flashcards

A

McNemar test

Question 19

Q

What is McNemar’s ‘limited solution to a standard problem’?

Describe the problem and his solution.

Answer

Study These Flashcards

A

What do you do when you have multiple observations from each person? (e.g. pre-test and post-test)

Can’t use chi-square because this violates the independence assumption

Solution:

You have a binary outcome measure (e.g. yes or no) and you measure it twice (e.g. pre- and post-)

Question 20

Q

McNemar’s test is testing which two ‘cells’ in the cross-tabulation before answers (yes, no) and after answers (yes, no)?

Answer

Study These Flashcards

A

Question 21

Q

How do you do McNemar’s test in R?

Answer

Study These Flashcards

A

allAds <- xtabs(~after+before, data=ads)

mcnemar.test(x=allAds)

Categorical analysis 2 Flashcards

(21 cards)