Lecture 8: Non-parametric tests Flashcards

1
Q

What does the chi-square tests?

A

whether there is a relationship between two categorical variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What leads to chi-squared test?

A

Q: What sort of measurement? A: Categorical (in this case counts or frequencies)
Q:How many predictor variables? A: One
Q: What type of predictor variable? A: Categorical
Q: How many levels of the categorical predictor? A: Not relevant
Q: Same or Different participants for each predictor level? A: Different
This leads us to and Chi-square test for independence of groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

IV and DV in chi-squared tests - (2)

A

One categorical DV (because of frequencies)

with one categorical IV with different participants at each predictor level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

In chi-square since we are using categorical variables we can not use

A

mean or any similar statistic hence cannot use any parametric tests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

In chi-square test when measuring categorical variables we are interested in

A

frequencies (number of items that fall into combination of categories)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does chi-square compare?

A

observed frequencies from the data with frequencies which would be expected if there was no relationship between the two variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In chi-square test, participants is allocated to one and only one category such as - (3)

A

pass or fail,

pregnant or not pregnant,

win, draw or lose

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is assumptions of chi-square test? - (3)

A

Data values that are a simple random sample from the population of interest.

Two categorical or nominal variables. Don’t use the independence test with continuous variables that define the category combinations. However, the counts for the combinations of the two categorical variables will be continuous.

For each combination of the levels of the two variables, we need at least five expected values. When we have fewer than five for any one combination, the test results are not reliable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Since each participant is allocated to one category in chi-squared test each individual therefore

A

contributes to the frequency or count with which a category occurs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

In chi-squared categorical outcomes, the null hypothesis is set

A

up on the basis of expected frequencies, four all four variable combinations, based on the idea that the variable of interest has no effect on frequencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Example of scenario using chi-square

A

We have a list of movie genres; this is our first variable. Our second variable is whether or not the patrons of those genres bought snacks at the theatre. Our idea (or, in statistical terms, our null hypothesis) is that the type of movie and whether or not people bought snacks are unrelated. The owner of the movie theatre wants to estimate how many snacks to buy. If movie type and snack purchases are unrelated, estimating will be simpler than if the movie types impact snack sales.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

We have a list of movie genres; this is our first variable. Our second variable is whether or not the patrons of those genres bought snacks at the theatre. Our idea (or, in statistical terms, our null hypothesis) is that the type of movie and whether or not people bought snacks are unrelated. The owner of the movie theatre wants to estimate how many snacks to buy. If movie type and snack purchases are unrelated, estimating will be simpler than if the movie types impact snack sales.

Is the Chi-square test of independence an appropriate method to evaluate the relationship between movie type and snack purchases? - (3)

A

We have a simple random sample of 600 people who saw a movie at our theatre. We meet this requirement.

Our variables are the movie type and whether or not snacks were purchased. Both variables are categorical.

But last requirement is for more than five expected values for each combination of the two variables. To confirm this, we need to know the total counts for each type of movie and the total counts for whether snacks were bought or not. = check later

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

We have a list of movie genres; this is our first variable. Our second variable is whether or not the patrons of those genres bought snacks at the theatre. Our idea (or, in statistical terms, our null hypothesis) is that the type of movie and whether or not people bought snacks are unrelated. The owner of the movie theatre wants to estimate how many snacks to buy. If movie type and snack purchases are unrelated, estimating will be simpler than if the movie types impact snack sales.

Diagram of contigency table in Chi-square and calculating row totals and colum and grand total - (7)

A

50 + 125 + 90 +45 = 310

75 + 175 + 30 + 10 = 290

50 + 75 = 125

125 + 175 = 300

90 + 30 = 120

45 + 10 = 55

310 + 290 = 600

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

We have a list of movie genres; this is our first variable. Our second variable is whether or not the patrons of those genres bought snacks at the theatre. Our idea (or, in statistical terms, our null hypothesis) is that the type of movie and whether or not people bought snacks are unrelated. The owner of the movie theatre wants to estimate how many snacks to buy. If movie type and snack purchases are unrelated, estimating will be simpler than if the movie types impact snack sales.

Diagram of contigency table in Chi-square of calculating eexpected counts

A

e.g., for action and snacks it would be column total (310) * row total (125) divided by grand total of 600 = 65

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How to calculate chi-square test statistic? - (4)

A
  1. Calculate the difference from actual and expected for each Movie-Snacks combination.
  2. square that difference.
  3. Divide by the expected value for the combination.
  4. We add up these values for each Movie-Snacks combination. This gives us our test statistic.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Example of calculating chi-square from table

A

For this it would be 65.03

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How to understand your test statistic from chi-squared? - (5) if you have test statistic of 65.03

A
  1. Set your significance level = .05
  2. Calculate the test statistic -> 65.03
  3. Find your critical value from chi-squared distribution table based on df & significance level
  4. Degrees of freedom: df (r – 1) x (C-1)
    For the movie example this is; Df = (4-1) x (2-1) = 3 -> 7.815
  5. compare test statistic with critical level
    65.03 > 7.82 so reject the idea that movie type and snack purchases are independent
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Example of research question and hypothesis and sig level of chi-square test of independence- (4)

A

Research question:
Does the area of psychology that a person prefers depend on whether they would select a cat or a dog as a pet?

Hypotheses:
H0: The area of interest in psychology and type of pet preferred are independent of each other.
H1: The area of interest in psychology and type of pet preferred are not independent of each other. That is the primary area of interest in psychology depends on whether you prefer a cat or a dog.
Significance level: α = .05

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Does the area of psychology that a person prefers depend on whether they would select a cat or a dog as a pet? - chi-square test of independence

Chi-square example we need to check the assumptions below - (2)

A

Independence
Each item or entity contributes to only one cell of the contingency table.

The expected frequencies should be greater than 5.
In larger contingency tables up to 20% of expected frequencies can be below 5, but there is a loss of statistical power.
Even in larger contingency tables no expected frequencies should be below 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Does the area of psychology that a person prefers depend on whether they would select a cat or a dog as a pet? - chi-square test of independence

Chi-square example we need to check the assumptions of The expected frequencies should be greater than 5.

What does it show? - (4)

A

Here we see that all the expected counts in the cat group and one expected count in the dog group are below 5.

We also have one in the cat group that is below 1.

So, SPSS has flagged that we have 60% of the expected counts falling below 5.

So assmption of expected frequencies greater than 5 is not assumed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

If chi-square assumption that The expected frequencies should be greater than 5 is not satisfied then do - chi-square test of independence

A

We should use Fisher’s Exact Test which can correct for this.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Does the area of psychology that a person prefers depend on whether they would select a cat or a dog as a pet? - - chi-square test of independence

If assumptions were met (expected frequencies greater than 5) then.. report - (2)

A

A chi-square independence test was performed to examine whether there was a relationship between their area of studies in psychology and their preference for cats or dogs.

The relationship between these variables was not significant, χ²(4, N = 46) = 1.46, p = .834, so we fail to reject H0.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Are directional hypotheses possible with chi-square?

A.Yes, but only when you have a 2 × 2 design.
B.Yes, but only when there are 12 or more degrees of freedom.
C.Directional hypotheses are never possible with the chi-squared test.
D.Yes, but only when your sample is greater than 200.

A

A = only when you have 2 variables to compare and can’t do non-directional in chi-square have to use loglinear or goodness of fit tests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Example situations you can do chi-square directional and not possible - (5)

A

If we are just comparing pet preferences between males and females, we can make a directional hypothesis (2 x 2 – male/female, cats/dogs).

Males prefer cats or females prefer dogs.

However, when we start adding variables to the design it gets complicated.

If we wanted to compare drink preferences at different times of the day for students/lecturers, we couldn’t form a directional hypothesis.

This is because we have 3 main effects and several interactions to consider. We need to use loglinear analyses to do this.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Table scenario in which cats can be trained to dance more effectively with food or affection at reward - chi-squared test

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Table scenario in which cats can be trained to dance more effectively with food or affection at reward - chi-squared test

what are the four categories? - (4)

A
  • could they dance - yes
  • could they dance - no
  • food as reward
  • affection as reward
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Table scenario in which cats can be trained to dance more effectively with food or affection at reward - chi-squared test

highlight the frequencies for four categories

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Table scenario in which cats can be trained to dance more effectively with food or affection at reward - chi-squared test

what do the rows give?

A

Row totals give frequencies of dancing and non-dancing cats

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Table scenario in which cats can be trained to dance more effectively with food or affection at reward - chi-squared test

what do the columns give? - (2)

A

The column totals give frequencies of food and affection as reward

These are the numbers in each group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Loglinear analysis is a …. of chi-square

A

extension

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Chi-square only analyses two variables at a time, whilst log-linear models

A

can determine complex interactions in multidimensional contingency tables with more than two categorical variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Loglinear is appropriate when

A

there’s no clear distinction between response and explanatory variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

think of

A

Think of chi-square like t-tests (2 groups) and log-linear like ANOVA (more than 2 groups).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Example of RQ, hypothesis and sig level of loglinear - (3)

A

Research question: Is the new treatment associated with improvements in health in cats and dogs?

Hypotheses:
H0: Treatment, type of animal and improvements are independent of each other.
H1: Treatment, type of animal and improvements are associated with each other.

Significance level: α = .05

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

Assumptions of log linear - (2)

A

Independence
Expected counts > 5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Research question: Is the new treatment associated with improvements in health in cats and dogs?

Checking assumption of expected counts - (3):

A

Here we have 3 things we are comparing: animal (cat/dog), treatment (yes/no) and improvement (yes/no) all of which are categorical.

We look and see that all of the expected counts are above 5.

So met assumption of independence and expected counts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Research question: Is the new treatment associated with improvements in health in cats and dogs?

Here we have 3 things we are comparing: animal (cat/dog), treatment (yes/no) and improvement (yes/no) all of which are categorical.

In loglinear model selection it begins with - (2)

A

all terms present (all main effects and all possible interactions

main effects: Animal, Treatment and Improvement

interactions: Animal * Treatment, Animal * Improvement, Treatment * Improvement and Treatment* Animal* Improvement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

Research question: Is the new treatment associated with improvements in health in cats and dogs?

Here we have 3 things we are comparing: animal (cat/dog), treatment (yes/no) and improvement (yes/no) all of which are categorical.

In loglinear model selection after including all main effects and interactions then it - (4)

A

Remove a term and compares the new model with the one in which the term was present.

Starts with the highest-order interaction (including max number of variables/categories)

Uses the likelihood ratio to ‘compare’ models below:

If the new model is no worse than the old, then the term is removed and the next highest-order interactions are examined, and so on.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

Model selection of loglinear - what does it show?

Research question: Is the new treatment associated with improvements in health in cats and dogs?

Here we have 3 things we are comparing: animal (cat/dog), treatment (yes/no) and improvement (yes/no) all of which are categorical. - (3)

A

We can see that the model selection worked in a way that it first tried to remove the 3-way interaction.

However, we can see here that it * affected the fit of the model, so it was left in.

Since removing the highest-order interaction made a * difference to the fit of the model, we get a final model that is the saturated model (it contains all main effects and interactions).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

Loglinear SPSS K way and Higher order effects what does it show?

Research question: Is the new treatment associated with improvements in health in cats and dogs?

Here we have 3 things we are comparing: animal (cat/dog), treatment (yes/no) and improvement (yes/no) all of which are categorical. - (2)

A

we are using the likelihood ratio here because that’s how we compare the models to find the best fit

. We see that all main effects and interactions are significantly contributing to explaining the variance in the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

loglinear what does K represent and what does K = 1,2 and 3 represent? - (4)

A

K represents the level of the terms.

For example, K=1 would be the main effects,

K=2 would be our 2-way interactions and

K=3 is our 3-way interaction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

Loglinear SPSS - what does parameter estimates show?

Research question: Is the new treatment associated with improvements in health in cats and dogs?

Here we have 3 things we are comparing: animal (cat/dog), treatment (yes/no) and improvement (yes/no) all of which are categorical. - (3)

A

There is a significant three-way interaction between animal, treatment and improvement, as well as two significant two-way interactions between animal and improvement and treatment and improvement (p < .001)

a * 3-way interaction between animal, treatment and improvement as well as two * 2-way interaction between animal/improvement and treatment/improvement.

Like our post-hoc tests, this is telling us where the * differences are.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

Loglinear after seeing statistical tests we go to raw data showing that…

A

Based on the raw data, there seems to be indication that the cats responded better to treatment than dogs, this should be followed up by chi-square tests separately for cats and dogs to determine whether the association between treatment and improvement is present in both cats and dogs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

When conducting a loglinear analysis, if our model is a good fit of the data then the goodness-of-fit statistic for the final model should be:

A. Significant (p should be smaller than .05)

B. Non-significant (p should be bigger than .05)

C. Less than 5 but greater than 1

D. Greater than 5

A

B

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

The goodness of fit tests in log linear tests

A

hypothesis that frequencies predicted by model (expected frequencies) are sig different from actual frequencies in data (obsevered)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

A significant goodness of fit result mean

A

our model was significantly different from our data (i.e., the model is a bad fit to the data).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

A recent story in the media has claimed that women who eat breakfast every day are more likely to have boy babies than girl babies. Imagine you conducted a study to investigate this in women from two different age groups (18–30 and 31–43 years).

Looking at the output tables below, which of the following sentences best describes the results? = chi-square

A. Women who ate breakfast were significantly more likely to give birth to baby boys than girls.
B. There was a significant two-way interaction between eating breakfast and age group of the mother.
C. Whether or not a woman eats breakfast significantly affects the gender of her baby at any age.
D. The model is a poor fit of the data.

A

C

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

Chi square and log linear are both

A

non-parametric methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
49
Q

Non-parametric tests used when

A

When data violate the assumptions of parametric tests we can sometimes find a nonparametric equivalent
eg. normality of distribution

50
Q

Non-parametric tests work on the principle of

A

randomization or ranking the data for each group

51
Q

Ranking data gets rid of in non parametric

A

outliers and skew

52
Q

How does ranking work in non-parametric? - (2)

A

Add up the ranks for the two groups and take the lowest of these sums to be our test statistic

The analysis is carried out on the ranks rather than the
actual data.

53
Q

Non-parametric equivalent of independent/unrelated t-tests

A

Mann-Whitney or Wilcoxon
rank-sum test

54
Q

Non-parametric equivalent of repeated t-test

A

Wilcoxon signed-rank test

55
Q

Non-parametric equivalent of : One-way independent (between-subjects) ANOVA

A

Kruskall-Wallis or (for trends)
Jonckheere-Terpstra

56
Q

Non-parametric equivalent of one-way repeated ANOVA

A

Friedmanʼs ANOVA

57
Q

Non-parametric equivalent of Multi-way between or
within-subjects ANOVA

A

Loglinear analysis (categorical
outcome, with participants as a factor)

58
Q

Non-parametric equivalent of correlation

A

Spearman’s Rho or Kendall’s Tau

59
Q

Mann-Whitney/Wilcoxon rank-sum Test - Compares

A

two independent groups of scores

60
Q

Wilcoxon signed rank Test - Compare

A

two dependent groups of scores

61
Q

Kruskal-Wallis Test - Compares

A

> 2 independent groups of scores

62
Q

Friedman’s Test - Compares

A

> 2 dependent groups of scores

63
Q

Spearman’s Rho & Kendall’s Tau - Measures the extent to which

A

two continuous variables are related (pattern of responses across variables)

64
Q

Logic behind Wilcoxon’s rank sum test, what does SPSS do? - (3)

A

Step 1: Get some not normally distributed data

Step 2: Rank it (regardless of group)

Step 3: Significance testing
Does one of the groups have more of the higher ranking scores than the other?

65
Q

What is DF of chi-square?

A

(r-1)(c-1)

66
Q

The likelhood ratio in loglinear model preferred

A

small sample sizes

67
Q

DF of likelhood ratio in loglinear

A

df = (r-1)(c-1)

68
Q

Decision tree of Mann Whitney - (4)

A

1 DV = Ordinal (e.g., high school, bachelors, order is meaningful) or continous
1 IV = Categorical and 2 levels
Different partiicpants
Does not meet assumption of parametric

69
Q

Wilcoxon rank sum and Man Whitney U is

A

same procedure and used to compare two independent groups and assess whether samples come from same distribution

70
Q

For Mann-Whitney U/Wilcoxon Rank Sum they comparing 2 independent conditions the two steps - (2)

A

Rank all the data on the the basis of the scores irrespective of the group

compute the sum of ranks of each group

71
Q

For wilcoxon rank sum, the statistic Ws is

A

the lower of the two sums of ranks

72
Q

For Mann-Whitney, the statistic U use the

A

sum of ranks for group 1, R1, as follows

73
Q

Example of table where comparing 2 independent conditions of Wilcoxon rank sum or Mann Whitney U test

A

Here we have data for two groups; one taking alcohol, the other ecstasy. The scores for a measure of depression. Scores were obtained on two days; Sunday and Wednesday. The drugs were administered on Saturday.

74
Q

Example of table where comparing 2 independent conditions of Wilcoxon rank sum or Mann Whitney U test

Here we have data for two groups; one taking alcohol, the other ecstasy. The scores for a measure of depression. Scores were obtained on two days; Sunday and Wednesday. The drugs were administered on Saturday.

Two steps for both statistics:
Rank all the data on the the basis of the scores irrespective of the group
compute the sum of ranks of each group - (5)

A

The graphic here shows how we can list the scores in order and as a result assign each score a rank.

When scores tie, we give them the average of the ranks.

If we ensure we keep track of the group the scores came from we can relatively easily add the ranks up for each group.

Note, that if there was little difference between the groups the sums of their ranks would be similar, as they are for the data shown her for Sunday.

However, the sum of ranks differ considerably for the data obtained on Wednesday.

75
Q

For Wilcoxon sum of ranks = comparing 2 independent groups the W s statistic,

the group sizes of n1 and n2 the mean of W2 is given:

A
76
Q

For Wilcoxon sum of ranks = comparing 2 independent groups the W s statistic,

the standard error of Ws is given

A
77
Q

For Wilcoxon sum of ranks = comparing 2 independent groups the z score of Ws can be calculated

A
78
Q

For Mann Whitney , the statistic U use the the sum of ranks for group 1, R1, as follows

A
79
Q

For Mann Whitney , the statistic U use the the sum of ranks for group 1, R1, as follows

Specificy equation - (2)

A

The first terms involving n1 and n2 actually compute the maximum possible sum of ranks for group 1.

U is zero when all those in group one have scores that exceed the scores of those in group 2.

80
Q

In Mann Whitney U there is a standardised test statistic which is z score that can allow you to compute

A

effect size so r = z / square root of N (number of pps

81
Q

What is decision tree of Wilcoxon signed rank test? - (4)

A

1 IV categorical with 2 levels

Same participants in each predictor level

1 DV - Ordinal or continous

Does not meet assumption of parametric tests

82
Q

Steps of Wilcoxon signed rank test - (4)

A
  1. Compute the difference between scores for the two conditions
  2. Note the sign of the difference (positive or negative)
  3. Rank the differences ignoring the sign and also exclude any zero differences from the ranking
  4. Sum the ranks for positive and negative ranks
83
Q

Example of Wilcoxon signed rank test carrying out steps - (9)

A

The table shown here has the Depression Scores taken on Sunday and Wednesday for those taking ecstasy on Saturday.

Data for Sunday are in the first column and Wednesday in the second column.

The third column shows the difference between scores obtained on Sunday and Wednesday.

NOte some could be negative, some positive. In this example however the difference is always positive apart from two values when the difference is zero.

The fourth column notes the sign of the difference or notes it is going to be excluded because the difference was zero.

The fifth column ranks the differences in terms of their size, but not sign.

The sixth and seventh column list the ranks that were for positive and negative differences, respectively.

It is these two columns that are summed to get the relevant statistics, called T+ and T-.

Because T+ and T- are not independent, we take only the T+ value.

84
Q

For Wilcoxon signed rank test the group size n the mean T is given

A
85
Q

For Wilcoxon signed rank test thestandard error fo T is given:

A
86
Q

For Wilcoxon signed rank test compute z score of T by

A
87
Q

Kruskal Wallis decision tree like one-way independent ANOVA - (4)

A

1 DV of ordinal

1 IV categorical predictor of more than 2 levels

Diff participants in each predictor level

Does not meet assumption of parametric

88
Q

Kruskal Wallis steps - (2)

A

Rank all the data on the the basis of the scores irrespective of the group

Compute the sum of ranks of each group, Ri , where i is the group number

89
Q

For Kruskal-Wallis, the statistic H is as follows

A
90
Q

What is decision tree of Friedman test? - (4)

A

1 DV continous or ordinal

1 IV predictor categorical with more than 2 levels

Same participants in each predictor level

Doesnot meet assumption of parametric tests

91
Q

What is steps of Friedman test? - (2)

A

Rank the scores or each individual - that means you will have ranks varying from 1 to the number of conditions the participants took part in

Compute the sum of ranks, Ri , for each condition

92
Q

For Friedman, the statistic F is as follows

A

K = conditions
N = number of pps

93
Q

Example when using chi-square test - (3)

A
  • In this example, they wanted to look at whether attendance at lectures had an impact on their exam performance on whether they passed or failed
  • Attendence was coded as 1 if participants generally attended lectures , barirng illness, and 2 if they did not attend
  • Exam was scored as 1 = Pass and 2 = Fail
94
Q
  • In this example, they wanted to look at whether attendance at lectures had an impact on their exam performance on whether they passed or failed - chi square

What does it show? - (4)

A
  • Attendence, Attended Lectures , Count = this is people who attended lecture and number of people who passed was 84 and people who failed was 29
  • % Within attendence give same info so 74.3% passed and 25.7% failed when attended lectures
    -Going to didn’t attend lectures, 22 people passed and 35 failed and below is in percentages:
  • Easier using percentages writing up
95
Q
  • In this example, they wanted to look at whether attendance at lectures had an impact on their exam performance on whether they passed or failed - chi square

What does it show? - (2)

A
  • At top row, pearson chi-squared which chi-square statistic which was 20.617, DF which is 1 and p-value was 0.000
  • 0 cells have a count less than 5  met assumption of chi-square test that expected counts greater than 5
96
Q
  • DF is always … in two-by-two chi-square
A

1

97
Q

If SPSS output shows below in chi-square that

A

0 cells have a count less than 5  met assumption of chi-square test

98
Q
  • In this example, they wanted to look at whether attendance at lectures had an impact on their exam performance on whether they passed or failed - chi square

What does it this effect size show? - (2)

A
  • x^2 (1) = 20.62, p < 0.001
  • Cramer’s V = 0.35 , indicating a medium effect size
99
Q

Effect size guideline of r correlation coefficient - (3)

A

Small effect = 0.1
Medium effect = 0.3
Large = 0.5 and above

100
Q

Cramer’s V can be interpreted similar to

A

correlation coefficient:

101
Q

In chi-square we can calculate odds

A

ratio

102
Q

Example of calculating odds ratio for chi-square - (3)

A

odds of passing/failing for students who attended lecture = no. of students who attended and passed (84) / no. of students who attedned and failed (29) = 2.897

odds of passing/failing for students who did not attend = no. of students who did not attend lectures and passed (22) / no of students who did not attended and fail (35)

Odds ratio = odds of P/F of attended/ odds of P/F of not attended = 2.897/ 0.629 = 4.606 saying for an individual who attended lectures lead them to be more likely to pass exam

103
Q

Example research scenario of Mann Whitney - (4)

A
  • Independent sample design
  • One IV, two conditions = existing vs new medication
  • One DV (symptoms) but this time on ordinal (scale from 1 to 5) and got combination of non-normally distributed data and small sample size (very problematic for t-tests)
  • Mann Whitney U Test
104
Q

Example of using Mann Whitney U = skew

A
105
Q

What does this Mann Whitney U show?

  • Independent sample design
  • One IV, two conditions = existing vs new medication
  • One DV (symptoms) but this time on ordinal (scale from 1 to 5) and got combination of non-normally distributed data and small sample size (very problematic for t-tests)
  • Mann Whitney U Test
A
  • This box summarises the p-value ( p = 0.026) and tells you whether to accept or reject the null hypothesis.
106
Q

What does this output show of Mann Whitney U? - (3)

A
  • Mann Whitney U test statistic is 166.000 to report and also people report standardised test statistic 2.292 which is z score so handy to report as know if its above +/-1.96 then p-value we get out of test is significant
  • P-value of exact significant is p = 0.026
  • This is significant difference between the 2 groups
107
Q
  • Next we would want to look at the median scores to see which group is scoring highest and lowest after sig Mann Whitney U test

What does this output show? - (3)

  • Independent sample design
  • One IV, two conditions = existing vs new medication
  • One DV (symptoms) but this time on ordinal (scale from 1 to 5) and got combination of non-normally distributed data and small sample size (very problematic for t-tests)
  • Mann Whitney U Test
A
  • For existing treatment, median score was 3.

And new treatment the median score was 4.

It suggests new treatment was more effective in reducing symptons than the existing treatment

108
Q

Example research scenario of Friedman ANOVA - (5)

A
  • Again we got ordinal data for DV not sure distances between levels is going to be the same
  • Related design
  • One IV, 3 conditions
  • One DV (level reached to video game)
  • Friedman’s ANOVA = more than 2 groups in related design
109
Q

What does this Friedman ANOVA output show?

A
  • We got total sample size which is 30 and test statistic which is 21.788, DF = 2 and p value is 0.000 so significant difference between the 3 groups
110
Q

For Friedman’s ANOVA we do

A

post hoc tests for pairwise comparisons to look where the differences are

111
Q

What do this Friedman ANOVA test post hoc tests show? - (7)

A
  • First one is Joy stick vs Vyper Max
  • Second one is Joystick vs Evo Pro etc…
  • Notice it gives two p-values of sig and adjusted sig
  • Adjusted sig control for multiple comparison and make correcitons to p-value (use this
  • Difference between joystick vs Vyper Max was sig at p = 0.005
  • Difference between Joystick vs Evo Pro was sig at p = 0.00
  • Difference between VyperMax vs Evopro is non-significant as p = 0.660
112
Q
  • The problem with non-parametric tests is that they have less power
A

to detect sig effects compared to parametric effects so maybe issue of dealing with power so may have median scores higher in one then another but not sig

113
Q

Non-parametric tests are used when

A. The assumptions of parametric tests have not been met.

B. You want to increase the power of your experiment.

C. You have more than the maximum number of tied scores in your data set.

D. All of these.

A

A = non parametric have fewer assumptions than parametric

114
Q

With 2  2 contingency tables (i.e., two categorical variables both with two categories) no expected values should be below ____.

A. 5

B. 1

C. 0.8

D. 10

A

A

115
Q

Which of the following statements about the chi-square test is false?

A. Which of the following statements about the chi-square test is false?

B. The chi-square test can be used to check how well a model fits the data.

C. The chi-square test is used to quantify the relationship between two categorical variables.

D. The chi-square test is based on the idea of comparing the frequencies you observe in certain categories to the frequencies you might expect to get in those categories by chance.

A

A = correct, because it is false. Chi-square can be used on categorical variables only

116
Q

When conducting a loglinear analysis, if our model is a good fit of the data then the goodness-of-fit statistic for the final model should be:

Hint: The goodness-of-fit test tests the hypothesis that the frequencies predicted by the model (the expected frequencies) are significantly different from the actual frequencies in our data (the observed frequencies).)

A. Non-significant (p should be bigger than .05)

B. Significant (p should be smaller than .05)

C.Greater than 5

D. Less than 5 but greater than 1

A

A = If our model is a good fit of the data then the observed and expected frequencies should be very similar (i.e., not significantly different

117
Q

What is the parametric equivalent of the Wilcoxon signed-rank test?

A. The paired samples t-test

B. The independent t-test

C. Independent ANOVA

D. Pearson’s r correlation

A

A

118
Q

Are directional hypotheses possible with chi-square?

A. Yes, but only when you have a 2 × 2 design.

B. Yes, but only when there are 12 or more degrees of freedom.

C. Directional hypotheses are never possible with the chi-squared test.

D. Yes, but only when your sample is greater than 200.

A

A =

119
Q

A psychologist was interested in whether there was a gender difference in the use of email. She hypothesized that because women are generally better communicators than men, they would spend longer using email than their male counterparts. To test this hypothesis, the researcher sat by the computers in her research methods laboratory and when someone started using email, she noted whether they were male or female and then timed how long they spent using email (in minutes). How should she analyse the differences in males and females (use the output below to help you decide)?

A. Mann–Whitney test

B. Paired t-test

C.Wilcoxon signed-rank test

D. Independent t-test

A

A.Mann Whitney Test

120
Q

What is the Jonckheere–Terpstra test used for?

A. To test for an ordered pattern to the medians of the groups you’re comparing.

B. To test whether the variances in your data set are approximately equal.

C. To test for an ordered pattern to the means of the groups you’re comparing.

D.To control for the familywise error rate.

A

A