research methods statistics Flashcards

1
Q

what are the 3 criteria for choosing a statistical test

A
  1. looking for a difference or a correlation / association?
  2. is experimental design related (repeated measures / matched pairs) or unrelated (independent groups)
  3. what is the level of measurement
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

unrelated design

A

using independent groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

related design

A

using repeated measures or matched pairs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

test of difference designs

A
  • unrelated design

- related design

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

test of difference, unrelated design producing nominal data.
what is the appropriate statistical test

A

chi- squared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

test for difference, unrelated design producing ordinal data
what is the appropriate statistical test

A

Mann- Whitney

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

test for difference, unrelated design producing interval data
what is the appropriate statistical test

A

unrelated t-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

test for difference, related design producing nominal data

what is the appropriate statistical test

A

sign test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

test for difference, related design producing ordinal data

what is the appropriate statistical test

A

Wilcoxon

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

test for difference, related design producing interval data

what is the appropriate statistical test

A

related t- test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

test for association or correlation producing nominal data

what is the appropriate statistical test

A

chi- squared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

test for association or correlation producing ordinal data what is the appropriate statistical test

A

spearman’s rank

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

test for association or correlation producing interval data

what is the appropriate statistical test

A

pearson’s rank

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

chi- squared test

A

used as a test of both difference and association / correlation.
data items must be unrelated
- test of difference, unrelated design, nominal data
or
- test of association or correlation, nominal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

mann - whitney

A

test of difference
unrelated design
ordinal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

unrelated t-test

A

test of difference
unrelated design
interval data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

sign test

A

test of difference
related design
nominal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

wilcoxon

A

test of difference
related design
ordinal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Related t test

A

test of difference
related design
interval data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

spearman’s rank

A

test of association or correlation

ordinal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

pearson’s rank

A

test of association or correlation

interval data

22
Q

nominal data

A

categories
each item can only appear in one category. there is no order.
e.g people naming their favourite football team.

23
Q

ordinal data

A

placed in order, intervals are subjective
data is collected on a numerical, order scale but intervals are variable, so that a score of 8 is not twice as much as a score of 4
ordinal data lacks precision because it is based on subjective opinion rather than objective measures
there is no units
e.g asking someone to rate how much they like psych on a scale of 1 to 10 where 1 is do not like at all and 10 is absolutely love

24
Q

interval data

A

units of equal size
interval data is based on numerical scales that include units of equal, precisely defined size.
this includes observations in a observational stay (8 tallies is twice as much as 4 tallies) or any public units of measurement (time, temperature, length)

interval data is better than ordinal data because more detail is preserved as the scores are not converted to ranks

25
Q

what happens if the statistical test is not significant

A

the null hypothesis must be accepted
the null hypothesis states there is no difference or no correlation between the conditions
the statistical test determines which hypothesis (null or alternative) is true and thus we accept or reject.

26
Q

the null hypothesis is accepted or rejected based on what

A

particular level of probability
probability is a measure of the likelihood that a particular event will occur, where 0 is a statistical impossibility and 1 a statistical certainties in psychology but there is a significance level - the point at which the null hypothesis is accepted or rejected.

27
Q

what level of significance is used

A

0.05 or 5%

this means the probability that the observed effect occurred by chance is equal to or less than 5%

28
Q

the calculated value

A

the value you calculate from the statistical test. this is compared to the critical value

29
Q

what is the critical value

A

fund from the table of critical values at the 0.05 significance. based on probabilities
the calculated value is compare to the critical value

30
Q

how do you find the correct critical value ( 3 criteria)

A
  • hypothesis one tailed (directional) or 2 tailed (non directional)
  • number (N) of participants or degrees of freedom (df)
  • level of significant (or p value) 0.05
31
Q

what is a type 1 error

A

the null hypothesis is rejected and the alternative hypothesis is accepted when the null hypothesis is true

this is an optimistic error or false positive as a significant difference or correlation is found when one does not exist.

32
Q

what is a type 2 error

A

the null hypothesis is accepted but in reality the alternative hypothesis is true
this is a pessimistic error or false negative

33
Q

why is a type 1 error is likely

A

is more likely to be made if the significance level is too lenient (too high e.g 0.1 or 10%)

34
Q

when is a type 2 error likely

A

more likely if the significance level is too stringent (too low e.g 0.01 or 1%) as potentially significant values may be missed

35
Q

what is an correlations

A

it illustrates the strength and direction of an association between 2 co-variables

36
Q

what is a positive correlation

A

co-variables rise or fall together

37
Q

what is a negative correlation

A

one co-variable rises and the other falls

38
Q

what are the differences between correlations and experiments

A
  • in an experiment the researcher manipulates the IV and records the effect on the DV. In a correlation there is no manipulation of variables and so cause and effect cannot be demonstrated
  • in correlation the influence of EVs is not controlled so it may be that a third untested variable is causing the relationship between the co-variables (called an intervening variable)
39
Q

evaluation of correlations

A

+ useful starting point for research. by assessing the strength and direction of a relationship, correlations provide a precise measure of how 2 variables are related. if variables are strongly related it may suggest hypothesis for future research
+ relatively economical. unlike a lab study, there is no need for a controlled environment and no manipulation of variables is required. correlations are less time-consuming than experiments
- no cause and effect. correlations are often present as causal when they only show how 2 variables are related. there may be intervening variables that explain the relationship
- methods used to measure variables may be flawed. for example, the method used to work out an aggression score might be low in reliability (observational categories might have been used). this would reduce the validity of the correlation study

40
Q

what are the measures of central tendency (3)

A
  • mean
  • median
  • mode
41
Q

what is the mean (and evaluate)

A

arithmetic average, add up all the scores and divide by the number
+ sensitive. includes all the scores in the data set within the calculation. more of an overall impression of the average than median or mode
- may be unrepresentative. one very large or small number makes it distorted. the median or the mode tend not to so easily.

42
Q

what is the median (and evaluate)

A

middle value, place scores in ascending order and select middle value. if there are 2 values in the middle, the mean of these is calculated
+ unaffected by extreme scores. the median is only focused on the middle value. it may be more representative of the data set as a whole
- less sensitive than the mean. not all scores are included in the calculation of the median. extreme values may be important

43
Q

what is the mode (and evaluate)

A

most frequent or common value, used with categorical / nominal data
+ relevant to categorical data. when data is discrete i.e represented in categories. sometimes the mode is the only appropriate measure.
- an overly simple measure. there may be many models in a data set. it is not useful way of describing data when there are many modes.

44
Q

what are the measures of dispersion (2)

A
  • range

- standard deviation

45
Q

what is range (and evaluate)

A

the difference between highest to lowest value (+1)
+ easy to calculate. arrange values in order and subtract largest from smallest. simple formula, easier than the standard deviation
- does not account for the distribution of the scores. the range does not indicate whether most numbers are closely grouped around the mean or spread out evenly. the standard deviation is a much better measure of dispersion in this respect

46
Q

what is standard deviation (and evaluate)

A

measure of the average spread around the mean. the larger the standard deviation, the more spread out the data are.
+ more precise than the range. includes all values within the calculation. a more accurate picture of the overall distribution of the data set
- it may be misleading. may hide some of the characteristics of the data set. extreme values may not be revealed, unlike with the range

47
Q

what is a normal distribution

A

symmetrical, bell shaped curve. most people are in the middle area of the curve with very few at the extreme ends.
the mean median and mode all occupy the same mid point of the curve

48
Q

what is a skewed distribution

A

distribution that lean to one side or the other because most people are either at the lower or upper end of the distribution

49
Q

what is a negative skew

A

most of the distribution is concentrated towards the right of the graph, resulting in a long tail on the left.
mode is the highest point of the peak then the median next to the left and the mean is dragged across to the left (if scores are arranged from lowest to highest)

50
Q

what is a positive skew

A

most of the distribution is concentrated towards the left of the graph resulting in a long tail on the right
the mode is the highest point of the peak, the median comes next to the right and the mean is dragged across to the right (if scores are arranged from lowest to highest)

51
Q

significance

A

the difference/ association between 2 sets of data is greater than what would occur by chance - coincidence or fluke.
to find out if the difference / association is significant we need to use a statistical test

52
Q

how do you calculate the sign test

A

(test of difference, related design - nominal data)
1. the core for condition B is subtracted from condition A to produce the sign of difference (either a + or a -)
2. the total number of + and the total number of - should be calculated
3. participants who achieved the same score in condition A and condition B should be disregarded, and deducted from the N value.
4. the S value is the total of the less frequent sign
if S is equal or less than the critical value, then S is significant and the experimental hypothesis is retained.