W3: Tests of Association Flashcards

1
Q

Another possibility is that response we are interested in is in the form of a count, examples of count variables are (2)

A
  • number of students with graduate jobs
  • number of individuals passing a test
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

One question of interest is if the counts change (Y) based on

A

one or more binary covariates (X)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

We are interested whether there is an association between

A

a binary factor and their response –> not regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Suppose we are interested in association of binary factor and response, summarise in table below:

If there is no association between binary factor (e.g., degree = psych, enginerring) and response (yes/no to having a grad job) then expect

A

proportion of a/g to be similar to the proportion b/h

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Formally test the contigency table of whether proportion are similar or not using

A

Chi-squared tets for independence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

The null hypothesis for this test is that

A

There is no association between the factor and response

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Alternative hypothesis is that

A

There is an association between factor and response

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Example of null and alternate hypothesis (2) using this table:

Chi squared

A

HO: No association between region and access to services
H1: Association between region and access to services

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Calculate expected frequencies for each cell of the contigency table assuming no association (H0 is true) using formula:

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Whats column and whats row?

Chi squared

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How to calculate overall total of a table? - for example in this table?

A

10 + 5 + 2 +27 + 25 + 8 = 77

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How to note down E while doing it for each cell?

A

ETL
EBL
EBR
ETR
EMR
T = top
B = Bottom
L = Left
R = Right
M = Middle

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

After hypothesis and expected frequencies, we calculate the test statistic
Formula is:

A

We do it for each cell in table for observed (O) then expected frequencies we just calculated and add it together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Chi-squared test formula still holds the same when response and/or factorrs have

A

more than two levels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

After calculating x^2 chi-squared statistic, we calculate degrees of freedoom by:

A

R = no. of rows in contigency actual table
C = no. of columns in contigency actual table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Calculate DF for this table

Chi squared

A

r = 3
c = 2
(3 - 1) * (2 - 1) = 2 * 1 = 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

For tables with 2 rows and 2 columns we use the

Chi squared

A

x2/1 disturbition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

The DF calculated corresponds to what p value you look at:
If x2/1 disturbition then look at

Chi squared

A
19
Q

What do we say before calculating expected frequencies?

Chi squared

A

We calculate the expected frequencies

20
Q

What do we say before stating hypotheses

Chi squared

A

We test:
List HO/H1

21
Q

What do we say before calculating test statistic

A

We calculate the test statistic as

22
Q

After calculating test statistic and then calculating DF we say:

Chi squared

A

We compare this x^2 with the x2/1 disturbition

23
Q

What table of critical sig levels do we list to compare x^2 value?

Chi square

A

10%
5%
1%
0.1%

24
Q

If x^2 larger than 0.1% value then

A

p <0.001

25
Q

Example of a conclusion of chi-squared test being significant at 0.1% level

A

We see p < 0.001 and so we reject the H0 at the 0.1% level. We have very strong evidence of an association between personality and degree.

26
Q

List of concluding statements - p-value greater than 10%

A

The p-value is greater than 10% so there is no evidence against the null hypothesis H0 and we do no reject it.

27
Q

List of concluding statements - p-value less than 10%

A

The p-value is less than 10% but greater than 5% so there may be slight evidence against the null hypothesis H0 but we do no reject it.

28
Q

List of concluding statements - p-value less 5%

A

he p-value is less than 5% but greater than 1% so there is moderate evidence against the null hypothesis H0 and we reject it and accept the alternate hypothesis H1.

29
Q

List of concluding statements - p-value less 1%

A

The p-value is less than 1% but greater than 0.1% so there is strong evidence against the null hypothesis H0 and we reject it and accept the alternate hypothesis H1.

30
Q

List of concluding statements - p-value less 0.1% level

A

The p-value is less than 0.1% so there is very strong evidence against the null hypothesis H0 and we reject it and accept the alternate hypothesis H1.

31
Q

Assumptions of Chi-squared test - (2)

A
  • We require independent observations
  • For 2x2 table, all expected frequencies must be larger than 5.
    For a larger table, no more than 20% of all cells may have an expected frequency less than 5 and all expected frequencies must be larger than 1
32
Q

In assumptions of chi-squared , we should always check the

A

second assumption

33
Q

An alternative test to Chi-squared test is

A

Fisher’s exact test

34
Q

Why is Fisher’s exact test a suitable alternative test to chi squared

A

This is based on an exact p-value rather than asymptotic properties
of the test statistic, and so is approriate even when there are small
counts in some cells of the contingency table.

35
Q

When is Fisher’s exact test utilised? - (2)

A

The Chi-squared test is exact asymptotically, which means that it is more
and more accurate as the sample size gets bigger.
This is why, if any of the expected frequencies are less than 5, we should
not use it, as the sample size is not large enough for the test to be
accurate.

36
Q

Chi-squared 2x2 contiegency table in SPSS

A

Interpret same as before - seeing difference in proportion between the rows

37
Q

Chi-squared test statistics, DF and p-value

A

Pearson Chi-squared = 0.024
DF = 1
P -value = 0.877

38
Q

Checking chi-squared test if all expected frequencies are above five in SPSS

A

Little note that 0 cells have expected counts less than 5
So all assumptin are satisfied so use results of Chi-squared test

39
Q

Fisher’s exact test gives same 2x2 contiegency table as well as data on DF and p-value:

A

DF 1 since exact-sig (2-sided)
P-value is 0.534

40
Q

2X2 contigency table in R

A
41
Q

Chi-squared test statistic, DF, p-value in R

A
42
Q

Chi-squared test expected frequencies output

A
43
Q

Fisher’s exact test results in R

A

P value = 1 (not sig)