Inferential tests Flashcards

1
Q

When should a chi-squared test be used?

A

Categorical design
Looking at the frequency of occurence across two categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When should a t-test be used?

A

Experimental design
Manipulating variables to look at differences between groups of participants or conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When should a correlational test be used?

A

Correlational design
Looking at relationships between two continuous variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What type of data and hypothesis should be used for a chi-squared test?

A

Nominal (categorical) data
Two-tailed hypothesis - is observed different from expected?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the expected frequency in a chi-squared test?

A

Frequency we would predict if belonging to each category were random

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the observed frequency in a chi-squared test?

A

The frequency that occur in the data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How is a Chi-squared test calculated? (Equation)

A

The sum of ( (O-E) squared / E )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the point of obtaining statistical significance?

A

Means that you can infer something about the entire population from your findings

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why would a chi-squared test not be significant?

A

Small chi-squared value = little difference between observed and expected frequencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a p value and which p value is used in psychology?

A

Probability you have committed a type 1 error (smaller p value = smaller probability that you have committed a type 1 error)
In psychology = 0.05 = 5% chance of type 1 error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How is a chi-squared test reported?

A

x^2 (df, N = XX) = XX.XX, p =.XXX OR p < .001

x^2 means chi square
df = degrees of freedom
N = number of participants
XX.XX = statistic calculated value
p value = significance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the difference between a chi-square Goodness of fit and a chi-square Test of association?

A

Goodness of fit:
- One variable
- Frequency table - analyses frequencies across different categories within one variable
Test of association:
- Two variables
- Contingency table - analyses frequencies across different categories for two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the difference between a frequency table and a contingency table?

A

Frequency table displays different categories of one variable and uses chi square Goodness of fit
Contingency table displays different categories of two variables and uses chi square test of association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are two important rules of contingency tables?

A

1) Categories should contain frequencies or counts - not percentages
2) Categories should be mutually exclusive - cannot have one participant in multiple categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the assumptions for a chi-squared test?

A

1) The level of measurement of all variables is nominal (or ordinal)
2) Each subject may contribute data to one and only one cell
3) The values in the cells should be frequencies or counts, not percentage
4) The value of the cell expected should be 5 or more in at least 80% of the cells, and no cell should have an expected of less than one
In this case, could combine categories so that expected should be more

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What steps are there in the process of psychological research?

A

1) Ask a research question
2) Review existing research
3) Develop a hypothesis
4) Collect data to test hypothesis
5) Analyse the data to draw a conclusion
(If you reject hypothesis, return to step 3)
6) Present and evaluate your findings
Repeat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are independent and dependent variables?

A

Independent = what you manipulate
Dependent = what you measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What type of data is the independent variable in an experimental design?

A

Nominal (always)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What type of data is the dependent variable in an experimental design?

A

Continuous - interval or ratio (always)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is an independent/ between subjects design?

A

Different participants are recruited into each of the conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is a repeated/ within subjects design?

A

One group of participants repeatedly takes part in the study under different conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What kind of t-test would be used when there is one condition of the IV?

A

One-sample t-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What kind of t-test would be used when there are two conditions of the IV and a within subjects design?

A

Repeated (paired sample) t-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What kind of t-test would be used when there are two conditions of the IV and a between subjects design is used?

A

Independent sample t-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What does the t value actually calculate?

A

Between group variance/ within group variance
(Difference between the two experimental conditions divided by the variability within the two experimental conditions)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

When is there likely to be a significant difference? (variance)

A

When there is more variance between than within groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What are parametric assumptions met through methodological design?

A

All observations are independent
Data collected are at interval or ratio level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What parametric assumptions are tested after data have been collected?

A

Data are roughly normally distributed
Homogeneity of variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What does homogeneity of variance mean?

A

The assumption that there is equal variance across samples
The average squared distance of a score from the mean is the same across all groups sampled in a study
Data are roughly distributed the same around the mean of a group

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

How is an independent t-test calculated?

A

t = (m1 - m2/ s1 + s2) x sq root of (n1 + n2)

m1 and m2 = average score for group 1 and group 2
s1 and s2 = standard deviation for group 1 and group 2
n1 and n2 = number of observation (participants) in group 1 and group 2
(1s and 2s should be subscript)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

How is an independent t test calculated if the size of the two groups is unequal?

A

Should use a different formulae with the ‘pooled variance’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What is the observed t value from a sample compared to?

A

Critical t-value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What happens when the observed t value is more extreme?

A

There are more chances to reject your null hypothesis?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

How can normality and outliers be checked visually?

A

Using box plots and histograms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

How are normal distribution and outliers checked statistically?

A

Shapiro-Wilk test tests whether data differ from a normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

When a Shapiro-wilk test is not significant what does this mean?

A

Data are normally distributed - it does not deviate significantly from a normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

How is the homogeneity of variance statistically checked?

A

The Levene’s test tests whether the spread of scores within each group differs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What does it mean if a Levene’s test is not significant?

A

The assumption of homogeneity of variance has not been violated and the spread of scores within each condition is the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

What is the non-parametric equivalent of an independent t-test?

A

Mann-Whitney U

40
Q

What is the non-parametric equivalent of a repeated (paired sample) t-test?

A

Sign test and Wilcoxon signed rank test

41
Q

What is the non-parametric equivalent of a one sample t-test?

A

Sign test and Wilcoxon signed rank test

42
Q

What should be done if the homogeneity is violated in an independent t-test?

A

Use Welch correction

43
Q

Is homogeneity of variance relevant for repeated and one sample t-tests?

A

No - only one sample

44
Q

What does the effect size measure?

A

The magnitude of the effect - the bigger the effect size the greater the difference is between our two groups
How significant something is

45
Q

What numbers are considered small medium and large effect sizes? (Not for correlation)

A

> /= 0.2 = small effect
/= 0.5 = medium effect
/= 0.8 = large effect

46
Q

What is a measure of effect size?

A

Cohen’s d

47
Q

How are the results of a t-test reported (statistical) ?

A

t (df) = XX.XX, p=.XXX OR p < .001, d = XX.XX
t = t value (the calculated statistic) (use small t and italicise)
df = degrees of freedom
d = tells you the effect size (report cohen’s d wtihout the negative)

48
Q

How to report the results of a t-test (written) ?

A

An [independent OR repeated measures OR one-sample] t-test was conducted to compare [your DV] in [IV condition 1] and [IV condition 2].
As can be seen on Figure 1, there is a [significant OR not a significant] difference in the scores for [IV condition 1] (M=___, SD=___) and [IV condition 2] (M=___, SD=___, t(df)=____, p = ____, d = ____).
The results confirmed/ did not confirm our hypothesis that…

49
Q

How is a repeated measures t-test calculated?

A

t = md/ sd x sq root of N
(d’s should be subscript)

50
Q

For repeated measures t-test what should assumptions also be checked for?

A

Difference between the conditions (JASP does it automatically)

51
Q

What is the one set of data in a one sample t-test compared to?

A

A reference value (this has to be entered at ‘test value’ in JASP)

52
Q

How is a one sample t-test calculated?

A

t = (m - u) / (s / sq root of N)
(u should have a tail)

m = average score for your sample
u = average score of your population of reference
s = SD

53
Q

What is a parametric test?

A

A test that requires data from one of the large catalogue of distributions that statisticians have described.
- Based on the normal distribution - require basic assumptions that must be met for the test to be accurate

54
Q

What are the four parametric assumptions?

A
  • Normally distributed sampling population
  • Homogeneity of variance
  • Interval or ratio data
  • All observations are independent
55
Q

What types of data are parametric and non-parametric?

A

No parametric = nominal and ordinal
Parametric = interval and ratio

56
Q

Which parametric assumption is key for independent measures design?

A

Homogeneity of variance

57
Q

What is a non-parametric test?

A

A family of statistical procedures that do not rely on the restrictive assumptions of parametric tests. In particular, they do not assume that the sampling distribution is normally distributed.

58
Q

What are tied values?

A

When ranking scores, if two or more raw scores are the same, the mean of the rank numbers should be calculated so they all have the same rank
e.g. raw scores in ranks 2 & 3 are the same, so instead they both have a rank of 2.5 (2+3/2 = 2.5)

59
Q

What is the ideal sample size for evaluating normality?

A

30+

60
Q

What descriptive statistic should be used for non-parametric tests and why?

A

Median
This is not affected by outliers
(Have to have ordinal data)

61
Q

Is parametric or non-parametric more powerful at finding a significant effect?

A

Parametric (assuming assumptions have been met)
- If there is a genuine effect in our data then a parametric test is more likely to detect it than a non-parametric one
- This is because parametric tests have a greater sensitivity to the data.

62
Q

What is Mann-Whitney U the non-parametric equivalent of?

A

Independent T-test

63
Q

How does the Mann-Whitney U test work?

A

Scores of both groups are ranked together from lowest to highest
Calculation is done
Basically, if one group is generally in the higher ranks and the other group is generally in the lower ranks, there is likely to be a significant difference

64
Q

What is the equation for Mann-Whitney U?

A

U = n1 x n2 + (n1(n1 + 1)/2) - T
(1s and 2s should be subscript)
n1 = number of ppts in group 1
n2 = number of ppts in group 2
T = rank totals
- Do this for each group - different rank totals
- Then, pick the smallest U value

65
Q

Which U value should be picked after calculating Mann-Whitney U?

A

The smallest one

66
Q

What is effect size shown as for Mann-Whitney U?

A

Rank-Biserial Correlation

67
Q

How should the results of a Mann-Whitney U test be reported? (Statistics)

A

U = XX.XX, p=.XXX OR p<.001, rb = XX.XX
(b should be subscript)
U = calculated statistic (capital and italics)
p = significance
rb = effect size (rank-biserial correlation) (report without the negative)

68
Q

How should the results of a Mann-Whitney U test be reported? (Written)

A

A Mann-Whitney U test was conducted to compare [your DV] in [IV condition 1] and [IV condition 2].
This test was used as we had an experimental between groups design, and these parametric assumptions were violated…
As can be seen on Figure 1, there is a [significant OR not a significant] difference in the scores for [IV condition 1] (Median=___, SD=___) and [IV condition 2] (Median=___, SD=___, U=____, p = ____, = rb ____).
The results confirmed/ did not confirm our hypothesis that…

69
Q

What is the Sign test and Wilcoxon test a non-parametric equivalent of?

A

Repeated t-test or one sample t-test

70
Q

What is the logic behind the Sign test and the Wilcoxon test?

A

Scores are ranked
If rank scores mainly show increasing values or decreasing values, there is likely a significant effect

71
Q

How is the sign test calculated?

A

The difference between two conditions of repeated measures is calculated and recorded as either + or -
Total + and total - are calculated
Smallest value is the sign value
N value is total number of + and - (if the difference is 0, this is not counted)

72
Q

How is the Wilcoxon signed rank calculated?

A

After completing sign test:
Sum all the negative ranks
Sum all the positive ranks
T value is the smallest one

73
Q

What assumptions should additionally be checked on the Wilcoxon test?

A

Difference between the conditions
(Jasp does it automatically)

74
Q

How are the results of a Wilcoxon test reported? (Statistics)

A

W = XX.XX, p =.XXX OR p< .001, rb = XX.XX
(b should be subscript)
W = calculated statistic (capital and italicise)
p = significance
rb = effect size (rank-biserial correlation) (report without the negative)

75
Q

How are the results of a Wilcoxon test reported? (Written)

A

A Wilcoxon signed rank test was conducted to compare [your DV] in [IV condition 1] and [IV condition 2/ population average].
This test was used as we had an experimental within groups/ one sample design, and these parametric assumptions were violated…
As can be seen on Figure 1, there is a [significant OR not a significant] difference in the scores for [IV condition 1] (Median=___, SD=___) and [IV condition 2] (Median=___, SD=___, W=____, p = ____, = rb ____).
The results confirmed/ did not confirm our hypothesis that…

76
Q

Why might you get a non-significant result on the sign test, but a significant result on the Wilcoxon test?

A

Because the Wilcoxon test takes into account the magnitude of the differences and not merely whether there is a positive or negative difference

77
Q

How are the calculations done for a one-sample non-parametric test?

A

Instead of calculating difference between two sets of scores, the difference is calculated between your sample and the reference
(Comparing a specific sample to the rest of the population)

78
Q

How is a one-sample non-parametric test (Wilcoxon) reported? (Written)

A

(Statistical is the same for repeated and one sample Wilcoxon tests)
The DV for our sample was significantly higher/lower/ not significantly different (median = XX.XX) compared to the average population DV (median = XX.XX, full statistic report)

79
Q

How should a non-parametric test result be reported (statistics)

A

The assumption of parametric tests were violated [justify why - because data are ordinal or because data are not normally distributed**] therefore a [Mann-Whitney U or Wilcoxon signed rank] test was conducted to compare [your DV] in [IV group/condition 1] and [IV group/condition 2].
The results showed is a [significant OR not a significant] difference in the scores for [IV condition 1] (Median=___) and [IV condition 2] (Median=___, W OR U=____, p < .001 or = ____, rb = ____).

** note if thisis the case where normality has been violated, then report the results of the Shapiro
E.g: ‘The Shapiro-Wilks revealed that data violated the assumption of normality (p = XXX)

80
Q

What does a correlational design investigate?

A

The relationship between two continuous variables
- Positive, negative or none
- No IV and DV
- Variables have to be continuous (interval or ratio) or ordinal, not categorical

81
Q

What do different Pearson’s r numbers mean?

A

-1 = perfect negative correlation
-0.6 to -0.99 = strong negative correlation
-0.3 to -0.59 = moderate negative correlation
-0.1 to -0.29 = weak negative correlation
0 to 0.9 = no correlation
+0.1 to +0.29 = weak positive correlation
+0.3 to +0.59 = moderate positive correlation
+0.6 to +0.99 = strong positive correlation
+1 = perfect positive correlation

82
Q

What is covariance?

A

Pearson’s correlation looks at this - how much the data points change together

83
Q

What does Pearson’s r calculate?

A

r = covariance between the variables/ variance within the variables

84
Q

What type of data should be used with a Pearson’s correlation?

A

Ratio or interval

85
Q

Does Pearson’s look at linear or non-linear relationships?

A

Linear only

86
Q

What should correlations be checked for?

A

Outliers

87
Q

What hypotheses can you have for a Pearson’s r?

A

Null hypothesis - no relationship
Two tailed hypothesis - there will be a relationship
One tailed hypothesis - positive relationship
One tailed hypothesis - negative relationship

88
Q

How are the results of a Pearson’s test reported? (Statistics)

A

r (df) = .XX, p =.XXX OR p<.001
r = calculated statistic (lower case and italicise)
df = degrees of freedom (N-2)
p = significance

89
Q

How are the results of a Pearson’s test reported? (Written)

A

There is a significant positive/negative correlation/ no significant correlation between the two variables (full statistical report), showing that… (explain actual variables)

90
Q

What do effect sizes show for Pearson’s r?

A

r statistic:
Small effect = .10
Medium effect = .30
Large effect = .50

91
Q

What is the non-parametric equivalent of Pearson’s r and when would this be used?

A

If one or both variables uses ordinal data:
Spearman’s rho
Ranks scores = non-parametric

92
Q

What is Spearman’s rho not as sensitive as Pearson’s r to?

A

Outlying points

93
Q

How are the results reported for spearman’s rho? (Statistics)

A

rs (df) = .XX, p =.XXX OR p <.001
(s should be subscript)
rs = calculated statistic (italicised r)
df = degrees of freedom (N-2)
p = significance

94
Q

How are the results reported for spearman’s rho? (Written)

A

There was a significant positive/ negative correlation/ no significant correlation (full statistic report). This shows that… (describe variable relationship)

95
Q

What should always be done when dealing with correlations?

A

Always draw and examine a scatterplot and look for the diagonal line of best fit and the slope of this, and look for outliers

96
Q

How should a correlation be reported? (Written)

A

A [Pearson’s or Spearman correlation] was conducted to test if there is a significant relationship between [variable 1] and [variable 2].

IF RESULTS ARE SIGNIFICANT:

There was a significant [positive/negative] correlation (r (df) = .xx, p = .xxx).
This shows that…

IF RESULTS ARE NOT SIGNIFICANT:

The correlation between variable 1] and [variable 2] was not significant (r (df) = .xx, p = .xxx).
[no direction to interpret!!!]

Note: this is for Pearson, for Spearman, you should report ‘rs’