Week 6 day 2 Flashcards

1
Q

What is the difference between a chi-squared Goodness of Fit Test and a chi-square test for independence?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are residuals?

A

They are the difference of what was expected.
The difference between observed and expected values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Does a test statistic need to be a single number?

A

yes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

As X-square values increase, are we more likely to reject the null?

A

Yes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How are the degrees of freedom calculated for chi-square tests for independence?

A

df = (R-1)(C-1)

R - number of rows.
C - number of columns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How to calculate the p-value for a chi-square in R?

A

1-pchisq(95th quantile x-square value,df=…)

95th quantile X-square: qchisq(0.95,df=…)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Do people ever use Pearson residuals and ‘standard’ or ‘adjusted’ residuals?

A

Yes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What effect size is used often with chi-square testing?

A

Cramer’s V.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does effect size tell us?

A

Tell how big an observed difference is, regardless of p-value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why don’t we just use chi-square stat for an effect size measure?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What function do we use to calculate Cramer’s V?

A

CramerV() from DescTools package.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

In the chisqr() function, if we do not specify the p= argument, what does it assume it is?

A

It assumes that the expected frequency is an equal distribution across all variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the assumptions for a chi-square test?

A
  1. “Large” expected frequencies.
  2. Independence of the data.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do we determine whether our assumptions for our test are met “enough”?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the FIscher Exact Test, and when do we use it?

A

The Fischer Exact Test is used when there are low expected frequencies (generally less than 5) per cell.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Does R warn us if our expected frequencies are too low for the assumptions required for a chi-square test?

A

Yes.

17
Q

What does the Fischer Exact Test tell us?
Does it give us a test stat?
Does it give us a p-value? If so, what does the p-value represent?

A

The Fischer exact test does not give a test stat, it only gives a p-value. The p-value tells us whether the distribution of data is the same as what would be expected or not.

18
Q

Why is the chi-square distribution the shape that it is?

A
19
Q

When a X-squared stat that is significant, how do we determine WHERE the significant difference within the data is?

A

Using adjusted residuals.

If the adjusted residuals are + or - 1.96 then the significant difference has come from these cells.

20
Q

What are the two assumptions that are required for a chi-squared analysis?

A
  1. “Large” expected frequencies.
  2. Independence of data.
21
Q

What tests can we do if either of the two underlying assumptions for chi-squared analysis are violated?

A
  1. 2.
22
Q

What is the Fischer test and when do we use it?

A

The Fischer Exact test is used for when the expected frequencies are less than 5 (generally) and we want to measure whether a statisically significant relationship exists between two categorical variables, such as whether Flopsy and Shadow have the same distribution of colours in their wardrobes, or whether if women are more likely to vote for Greens than men.

23
Q

What is the McNemar test and when do we use this?

A

The McNemar chi-squared test is used when our data is not independent. This could be when we are measuring someone’s response pre- and post- a treatment as measured by a non-continuos scale.