Lecture 3 Flashcards

1
Q

What is a correlation?

A

A relationship between variable A and B. It is the easiest way of quantifying a relationship between 2 numeric variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Are correlations useful?

A

Not particularly useful so we follow them up with regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the value of a perfect positive relationship?

A

r=1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does r=0 mean?

A

A complete absence of a relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the value of a perfect negative relationship?

A

r=-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Name the two features of the correlation

A

Direction and magnitude

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does it mean if there is a weak relationship?

A

Someone’s performance on the Y variable is not determined by their performance on X, and there must be other variables involved

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What should you always do first?

A

Visualise the data. Knowing a mean and SD doesn’t give you much unless you visualise the points, you need to plot data to see the distribution of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Name the different types of scatterplot and what they look like, and would you use r?

A
  1. Normal - linear, yes.
  2. Curvilinear - semi-circle shape, r would underestimate the strength of the relationship because it only measures linear relationships
  3. A perfect relationship between X and Y - all point lie on a straight line except one outlier, r would underestimate the strength of the relationship
  4. No relationship - vertical line with an extreme outlier, r would lead to an overestimation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why do outliers pose a big threat when the sample is small?

A

Because they exert a strong influence on the results - can influence the value and interpretation of the correlation. Need to plot your data to see if there are any point which don’t lie with the rest of your data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What would qualify as at outlier?

A

There is no absolute definition, but in general, a score that is more than 3SDs from the mean would qualify as an outlier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What do z-scores do?

A

Convert individuals’ scores on a variable to standard scores which are normally distributed with a mean of 0 and SD of 1. Need to standardise the score to understand where it falls on the distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When are deviation products positive/negative, respectively?

A

When X and Y scores are both above the mean/both below the mean OR when X and Y scores are not both above the mean/below the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How can the correlation between X and Y can be obtained?

A
  • Multiplying each individual’s z-score on X by their z-score on Y
  • Summing these products
  • Dividing this sum by N or by N-1
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why do we have to do n-1?

A

If not, we have an over-estimated correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is r a measure of?

A

The strength of the linear association between 2 variables

17
Q

How do you compare the magnitude of correlations?

A

Calculate the proportion of shared or common variance

18
Q

What is a better way of looking at the strength of a correlation?

A

R^2