Correlation Redux Flashcards

1
Q

what is Pearson’s r?

A

tells you the relationship between two variables: strength (‘size of effect’) or direction [+ or -]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what the p value tell us?

A

if it’s sig diff from 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is a ‘small effect’ size?

A

± 0.1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is a ‘medium effect’ size?

A

± 0.3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is a ‘large effect’ size?

A

± 0.5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is significance?

A

testing the null hypothesis that r = 0 (correlation value is zero)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

when can we reject the null hypothesis? / suggest there’s a significant different

A

when p < 0.05 we can reject the null hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is a positive correlation?

A

both variables increase together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

the interpretation of correlation and causality:

A
  • correlation = not sufficient evidence for causality between variables
  • does not imply causation
  • gives no indication of the direct of causality
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is a negative correlation?

A

one goes up, the other goes down

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what does a correlation coefficient give you?

A
  • How much the two variables vary together
  • The further the score from zero, the stronger the relationship
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why can’t we infer causation from a correlation?

A

The relation between the two variables is often due to the each’s variables relation to the third

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what can happen if we take account of this relationship with the third variable (‘control for’)?

A

the original relatonship disappears as it was spurious (not genuine)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what may the third variable be?

A

a cofounding variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

how can we examine third variables?

A

partial correlations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what are some issues with correlation?

A
  • shape of the relationship
  • outlier
  • restricted ranges
  • sample size
  • reliability of measures
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what should relationships be like?

A

linear relationship (similar to regression)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

what does Pearson’s Correlation Coefficient measure?

A

linear relationships

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

what doesn’t Pearson’s Correlation Coefficient measure?

A

non-linear relationships
* correlations be non-linear but Pearson’s r can not pick this up/do anything about it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

non-linear correlations

A

will reduce the correlation
* still a relationship, just a different shape

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Outliers

A

Individual scores can reduce or enhance the ‘r’
* i.e. if you add a pair of scores -> ‘no relationship’ is turned in a small/medium relationship by one data point
* i.e. if you add a different pair of scores to the data set -> a weak but significant relationship may be removed by one data point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

outliers can cut both ways, what does this mean?

A

can artificially enhance the relationship or reduce it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

what do we do about outliers?

A

we tend to remove them from the data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

how can we fixed stunted relationship?

A

by having a wide range of scores on all variables to get a clear view of the relationship between your variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

what is stunted relationships?

A

when there is a restricted range in our scores collected (aka. may be a specific demographic), we fall into the danger of a stunted relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

what is an example of a stunted relationship?

A

in our original data, we collect students from across the board - medium relationship (r = 0.353, - < 0.001)
BUT say we only collect data from students who turn up to lectures (most conscientious) - no conscientiousness score below 3.5 (while main was 2.5 in usual student sample)
^ there is a restricted range of the variable conscientiousness due to the scores
This reduces our correlation to 0.043 (p = 0.778)

27
Q

what is sample size?

A
  • determines significance
  • bigger, the better
  • lower correlation need large samples to achieve significance
  • .3 need around 50 to 60 participants
  • .10 needs close to 300
  • .10 is small but could be meaningful (important and may be the result you were looking for)
28
Q

what happens if we have a larger correlation?

A

more likely to achieve a significance and gain credible findings

29
Q

what is reliability?

A
  • internal consistency of a measure
  • how much can rely on a score from a test, sub scale etc.
  • how good is your measure at measuring something (reliability doesn’t tell you that something is
  • says nothing about what something is
30
Q

what are three ways in which we can measure reliability?

A
  • Test-Retest Method
  • Split-Half Method
  • Cronbach’s Alpha
31
Q

Test-Retest Reliability Method

A

Test once and then test later after a certain time

32
Q

Split-Half Reliability Method

A

Splits the questionnaire into two random halves, calculates scores and correlates them.

33
Q

Cronbach’s Alpha

A

Measures internal reliability using individual items
* Ranges from 0 (no reliability) to 1 (complete reliability)

34
Q

what is Cronbach’s Alpha?

A

measure of reliability for set scores

35
Q

[Kilne, 1999] When is the scale reliable?

A

α > 0.7
(tells you that you have a reliable measure)
- higher score = more reliable

36
Q

[Kilne, 1999] Why does α depend on the number of items?

A

more questions tend to produce a bigger α (more items, produce a bigger alpha)

37
Q

Conditions of α (in a way)

A
  • Treat Subscales separately (you are measuring reliability of the skill to measure ONE thing)
  • test of a measure’s ability to test for one ‘thing’
  • All scores should be in the same direction
  • reverse code reversed questions
38
Q

α is a property of a set of test scores, what does this mean?

A

so α can change across different populations
* do your own reliability measures even for established tests (score you get, you need to check the reliability for)

39
Q

what does the reliability of a measure put limits on?

A

strength of any relationship

40
Q

weaker reliability means..

A

you’re less likely to find a result / relationship between the variables

41
Q

weak reliability leads to…

A

lower correlations
* more random variation in the scores from your measures
* ‘Attenuation’ = more random in your scores if there’s less reliability

42
Q

Low reliability means…

A

more random variation
* more error / less true error of what you’re looking for (lower correlation)

43
Q

Score = True Score + Error

A

Systematic Variation + Random Variation

44
Q

SPSS and Cronbach’s Alpha

A

Item-total statistics TABLE in SPSS is:
* Useful to see which items are troublesome
* Ones with low Item-total correlation
* Ones that if removed increases the Alpha

45
Q

A student was interested if executive control was related to problem-solving. He used a standardised measure of executitive control (Wisconsin Card Sort) and had participants solve simple sudokus (to measure problem-solving). He found that there was no relation between the measures.

A

This is a case of restricted ranges

46
Q

A student wished to see if social media usage was related to mood IRL. She asked students to measure how long they spent on each social media platform over the week. She then designed a 10-item questionnaire that measured participant’s general mood over the past week. When the measure were correlation there was only a weak relationship found

A

Low Reliability of Measures

47
Q

What is a Partial Correlation?

A
  • we suspect that a correlation between two variables (X and Y) is due to a third variable (Z)
    BUT the real relationship is between each of the two variables and the third seperately
48
Q

how do we check for the real relationship between each of the two variables and third seperately?

A

partial correaltions
* examine the relationship between two variables after you have taken into consideration X and Y separate relationship with Z (third variable)

‘control for Z and examine the relationship between X and Y’

49
Q

what happens when you control for a third factor?

A

sometimes the relationship between the two (X and Y) variables can disappear

i.e. “When participants’ scores on the Extroversion scale were controlled for, the relationship between Small talk and life satisfaction was not significant (r=0.02)”

50
Q

what is covariance?

A

measure of how two variables vary together
* uses the difference between score and sample meal
* multiples these differences together
* we then need to standardised it and the standardised score is the correlation

51
Q

If there is a relationship between two variables then…

A

as one variable changes, the other should also change in a consistent way

52
Q

how do we measure change?

A

as deviation from the mean for each set of scores

53
Q

if the deviation from the means are similar for each group then what?

A

the scores vary together

54
Q

things that are related should…

A

show a similar distance from the mean
* and if they do this, this increases the covariance

55
Q

how to calculate the sums for partial correlation?

A
  1. take each score away from the mean for that variable
    - will tell us where each score is in relation to the mean (above / below / how far)
  2. we multiple residuals (difference between the scores and the means) for each participants
    - if both score above mean (=answer positive); the bigger each score’s differences from the mean the bigger this answer (‘the crossproduct’)
    - if one above, one below (=answer negative)
    - if both negative (=answer positive)
  3. add up the cross products
  4. divide by n-1 giving you the covariance for each variable
    - bigger one variable, stronger it is
56
Q

covariance: the larger the covariance

A

the more the variables co-vary

57
Q

covariance can be..

A

positive
- one variable goes up, so does the other

negative
- one variable goes up, other goes down

58
Q

Covariance

A
  • Affected by unit of measurement / sample size etc
  • You can’t compare across samples
  • Can’t real say this is a big or small relationship in absolute terms until you standardise it
59
Q

What can correlation do?

A
  • Standardizes the covariance
  • Allows you to compare across samples etc.
  • Gives you the standard Pearson “r”
60
Q

How to Calculate Pearson’s ‘r’?

A

r=covxy/sxsy

  • a standardized measure of the covariance
  • you divided the covariance by the product of Standard deviations of the samples (the SD’s multiplied together)
61
Q

Pearson’s R

A

correlation coefficient that measures linear correlation between two sets of data

62
Q

if individuals are scoring below the mean in one condition and above in the other, what type of covariance is this?

A

high negative
- all cross products would be negative

63
Q

if there were no consistent pattern between the data in conditions, what type of covariance is this?

A

low

64
Q

if participants are scoring either positive in both conditions or negative, what type of covariance is this?

A

high positive
- consistent parting in the relationship