correlation and regression (w7) Flashcards

1
Q

when may data be a mistake

A

perfect straight line, one or more datapoints a long way away from others, no relationship at all between things you expect to be related

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what does correlation find

A

line of best fit by minimising the differences between data and line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

equation for correlation/line of best fit

A

r = Sxy (how much x and y change together) / Sx.Sy (how much x and y change separately)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what does the r value tell you

A

the direction of and how strong the correlation is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

when is correlation positive

A

r is above 0
1>(and equal to) r > 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

when is correlation negative

A

r is below 0
-1< (and equal to) r < 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

when is correlation strong

A

if r is close to 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

when is correlation weak

A

if r is close to 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what does the r2 (r-squared) value tell you

A

how much of the variance is explained by your correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what does r2 close to 1 mean

A

correlation explains a lot of variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what does r2 close to 0 mean

A

correlation explains only a little variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is r2 also called

A

coefficient of determination

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is 1-r2

A

amount of variance not explained

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

regression: when x increase by 1, what happens to y

A

y increases by the slope

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

correlation and regression: describes how many relationships

A

C - describes single relationship
R - can describe multiple relationships

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

correlation and regression: what do they describe about the relationship

A

C - direction and strength of relationship
R - directionS and strengthS of relationshipS

16
Q

correlation and regression: are x and y variables inter-changable

A

C - yes
R - no

17
Q

correlation and regression: do they allow prediction

A

C - no
R - yes

18
Q

correlation and regression: what are the co-efficients

A

C - r, r2
R - R, R2, F, t, SE, ß1-n

19
Q

how do you report correlations

A

r ([df]) = [pearson’s r] , p = [p-value]
r(64) = .881 , p < .001

20
Q

how do you report regressions (overall model fit)

A

F ([df1] , [df2]) = [F-value] , p = [p-value]
F (1 , 64) = 223 , p < .001

21
Q

how to report regressions (individual variables)

A

estimate = 121 , SE=8.10 , t(64) = 14.9 , p < .001

22
Q

what is t when reporting regression

A

t = estimate/SE (this is a t test)

23
Q

in multiple regression what is the outcome and predictor variables, how many

A

single outcome variable - y
multiple predictor variables - x1 , x2 , …

24
Q

in multiple regression, what do you find instead of line of best fit

A

find the best fitting surface (3d graph with a 2d shape (surface) acting as line of best fit)

25
Q

in multiple regression, what are residuals

A

distance from surface

26
Q

in multiple regression, what can predictors be

A

can be almost anything:
continuous, ordinal or discrete
normally - distributed or not
linear or non-linear

27
Q

what is a problem with correlation and regression, what may you have to do

A

extrapolation
non linear relationships (may have to transform data - quadratic, cubic, logarithmic)