5. Inter relationships between variables Flashcards

1
Q

Inter relationships between variables

A

the strength and nature of the relationship between two sets of figures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

strength

A

correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

nature

A

Is it a straight line? is regression used to try and find the equation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Correlation

A

2 variables are correlated if they are related to each other i.e. if the value of one changes does the other data set also change

can only ever be between +1 and -1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Scatter diagrams

A

data points are plotted on a graph x axis independent variable (we choose it) y axis is dependant on the value of x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

regression

A

is what we use to work out the equation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Perfect positive linear correlation

A

r = + 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Perfect negative linear correlation

A

r = - 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

High Positive Correlation

A

r ≈ 0.9

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Moderate negative correlation

A

r ≈ -0.7

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

No correlation

A

r ≈ 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Non linear or curvilinear

A

N/A

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Pearsons correlation coefficient

A

n = number of data points

FORMULA GIVEN IN EXAM

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Coefficient of determination

A

take correlation coefficient and square it

makes it easier to interpret the coefficient of correlation

tells us the proportion of changes in y that can be explained by the changes in x, assuming a linear relationship.

i.e. correlation coefficient is 0.7, correlation of determination is 0.49, meaning 49% if the changes in y are caused by the changes in x and that 51% relate to other factors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Spurious correlation

A

where there is high value of correlation but no direct cause and effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Regression

A

where the regression line has the equation y = a +bx

find b using formula

then use formula to work out a

a = mean of y - bmean of x

17
Q

Interpolation

A

is when you estimate y when using a give value of x that is between the limits (outside the limit is extrapolation)

18
Q

Limitations of linear regression

A

just because we have a straight line doesn’t mean it is useful for forecasting

need to assume the relationship many not be linear

it could be spurious

need to be careful when using the regression line for extropolation

19
Q

Rank correlation: Spearman’s coefficient

A

when you want to calculate the correlation between two variables but one or both of them is not in a suitable quantitive form

i.e. student comes top in maths exam does that mean they will come top in econmics also, we interested in the rank not the absolute mark

d = difference in ranks

n = sample size