Term 2 L7: Chi square and Contingency Tables Flashcards

1
Q

Two types of chi square test:

A

for goodness of fit;

of independence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What level of data are chi square

A

The tests are often useful for

nominal or ordinal variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Both tests designed to test Hypotehses about:

A

frequencies (counts).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Goodness of Fit - what is it answering?

A

whether the deviation of observed results from expected values is large enough for us to reject the null hypothesis?

Is the difference between observed and expected values statistically significant?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

The Chi-Square (Ο‡2) Statistic (goodness-of-fit test)

A

πœ’
2 = βˆ‘(π‘‚βˆ’πΈ)2 / 𝐸

O - observed frequencies;
E - expected frequencies;
Ξ£ - sum (to be taken over all cells in the table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Degrees of freedom for the goodness-of-fit test

A

df = β€˜Number of cells’ – 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The distribution of the Ο‡

2 statistic depends on….

A

the associated degrees of

freedom (df, denoted by k in the graph above)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Interpreting results of chi square goodness of fit

A

called a goodness-of-fit test,
because we are testing whether the distribution of a variable (here: accuracy of guesses) fits an expected distribution (under
the null hypothesis).

In this case, we saw that the data do not fit the expectation. Practitioners of Therapeutic Touch performed worse than their
theory would have led to suppose (and also worse than chance)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do you report the result of chi square statstic?

A

Ο‡2 (1) = 4.128

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Chi-square test of independence

Analysis of Contingency Tables

A

Contingency tables are frequently used in the analysis of

categorical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what does Chi-square test of independence test?

A

whether the null hypothesis
that the row variable and the column variable are independent
(i.e. that there is no association between the two variables)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Give an example of a hypothesis for a Chi square test of independence

A

Does the table
constitute evidence for an effect of β€œtype of treatment” on the likelihood of suicide attempt, (or could we have obtained the numbers in the table by chance, even if there was no effect in the
population)?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Marginals

A

allow us to work out expected outcomes

and if we have one of the observed values we can caluate all the observed values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Degrees of freedom for Ο‡2 test of independence

A

𝑑𝑓 = (π‘Ÿ βˆ’ 1) Γ— (𝑐 βˆ’ 1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What to do with small expected frequencies? for test of independence

SPSS vs Howell what are their minimum values?

A

Small expected frequencies
The chi-square test can be inaccurate if one or more of the expected values are very small.
One convention is to set a minimum value to 5. This convention is overly cautious.

β€’ For a 2Γ—2 table, a sample size of n=10 will ensure reasonable accuracy (see Howell 2013, p. 152). This corresponds to an average expected value of 2.5!

β€’ For larger tables, all cells should have expected values greater
than 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Measures of effect size

is chi sqare a measure of effect size?
why?

A

The chi-square statistic is not a measure of effect size.

Chi-square is sensitive to sample size. That means that you cannot
compare chi-square values computed on different tables, if the sample
sizes of the two tables are different.
β€’ For example, if two samples have the same effect size, but one
sample is twice as large as the other, then the value for chi-square
for the larger sample will be twice as high as that of the smaller
sample.

Also: If the sample is large, the chi-square statistic may be significant
even if the actual effect is very small. This is an issue of power:
β€’ If the sample is large, the chi-square test is very powerful, which
means that it may detect small effects that exist in the population –
including effects that are too small to be meaningful.
β€’ If the sample is small, the test has little power, which means that
even large effects may not lead to statistically significant results.

17
Q

There are several ways to measure the effect size in contingency
tables (4)

A

Absolute risk reduction:

Using MBT, by how many percentage
points do we reduce the risk of a suicide attempt compared to TaU?

β€’ Numbers Needed To Treat:

How many patients do we need to treat
with MBT to avoid one suicide attempt (compared to TaU)?

Relative risk:

How much higher is the risk of a suicide attempt under
TaU compared to Mentalization-based Treatment MBT?

Odds Ratio:

How much higher are the odds of a suicide attempt
under TaU compared to MBT?

18
Q

Interpreting odds ration

A

As before, let’s define: 𝑂𝑅 =
π‘‚π‘‘π‘‘π‘ π‘‡π‘Žπ‘ˆ/ 𝑂𝑑𝑑𝑠𝑀𝐡𝑇

Then the observed odds ratio can be interpreted as follows:
OR = 1 β†’ No effect (no difference between groups; odds
are the same).
OR > 1 β†’ β€œPositive” effect. Higher odds for TaU than MBT.
OR < 1 β†’ β€œNegative” effect. Lower odds for TaU than MBT.

Examples
OR = 2 β†’ The odds of having a suicide attempt are twice as high under TaU as under MBT.
OR = 1.5 β†’ The odds of having a suicide attempt are 50% higher under TaU compared to MBT.

OR = 0.5 β†’ The odds of having a suicide attempt are halved under TaU compared to MBT.

19
Q

CIs for Effect Sizes: Odds Ratios

What is effect size?
why calculate CIs? (SE)

A

Effect sizes computed from a sample are estimates of the β€˜true’ effect size. Like all statistics, effect size estimates are subject to sampling error. It is therefore good practice to
report confidence intervals for effect size estimates

This means that we are 95% confident that the interval
between .94 and 12.97 contains the true odds ratio. This
confidence interval is rather wide, reflecting the uncertainty of
our estimate.

20
Q

CIs is 1 or 0 an important figure?

A

1

We are 95% confident that the interval between
0.94 and 12.97 contains the true odds ratio.

  • Note that the confidence interval contains the value 1.
  • In the case of OR, 1 signifies β€œno effect”.

β€’ This agrees with the non-significant result of the chi-square test on these data (which led us to conclude that we had no evidence to reject the null hypothesis of β€˜no effect’).