Simple Regression Flashcards

Question 1

Q

linear regression

Answer

A

used when the relationship between two variables can be described with a straight line

proposes a model of the relationship

Question 2

Q

correlation vs regression

Answer

A

correlation determines strength of relationship between X and y
regression allows us to estimate how much Y will change as a result of a given change in X

Question 3

Q

terminology in regression

Answer

A

regression distinguishes between variable being predicted and variable(s) used to predict

Question 4

Q

variable being predicted: y

Answer

A

outcome variable
DV (only ever one)
criterion variable
verticle axis

Question 5

Q

variable used to predict: x

Answer

A

predictor variable
IV(s)
explanatory variable
horizontal axis

Question 6

Q

when might we use regression

Answer

A

to investigate strength of effect x has on y
estimate how much y will change as a result of a given change in x
predict future value of y based on x

Question 7

Q

what does regression assume + what does it not tell us

Answer

A

y is dependent (to some extent) on x
regression doesn’t tell us if this dependency is causal

Question 8

Q

3 stages of linear regression

Answer

A

analysing the relationship between variables: strength and direction (correlation)
proposing a model to explain that relationship: model is a line of best fit
evaluating the model: assessing goodness of fit

Question 9

Q

regression line

Answer

A

(step 2)
- line of best fit
- intercept: value of y (on line of best fit) when x is 0
- slope: how much y changes as a result of 1 unit increase in x

Question 10

Q

evaluating the model; simplest model vs best model

Answer

A

simplest model:
- using average/mean value of y (predictor) to make estimates
- assumes no relationship between x and y

best model:
- based on relationship between x and y
- regression line

Question 11

Q

sum of squares total

Answer

A

the difference between observed values of y and the mean of y

variance in y not explained by simplest model
not required to perform in exam

Question 12

Q

sum of squares residual

Answer

A

the difference between the observed values of y and those predicted by the regression line

variance in y not explained by regression model
not required to perform in exam

Question 13

Q

difference between SST and SSR

Answer

A

reflects improvement in prediction using the regression model compared to simplest mode

goodness-of-fit
sum of squares of the model
not required to perform in exam

Question 14

Q

the larger the SSm…

Answer

A

… the bigger the improvement in prediction using the regression model over the simplest model

Question 15

Q

final thing in goodness-of-fit test

Answer

A

use ANOVA for F-test to evaluate the improvement due to the model (SSm), relative to the variance the model does not explain (SSr)
ANOVA uses mean square values instead of SS
this takes d.f. into account
provides f-ratio

Question 16

Q

F-ratio

Answer

A

measure of how much the model has improved the prediction of y, relative to the level of inaccuracy of the model

Question 17

Q

interpreting F-ratio

Answer

A

if regression model is good at predicting y (relative to simplest model) the improvement in prediction of the model (MSm) will be larger, while the level of accuracy of the model (MSr) will be small

e.g. F value further from 0

Question 18

Q

H0 when assessing goodness of fit

Answer

A

regression model and simplest model are equal (in terms of predicting y)

MSm = 0
p < .05 reject H0, regression model is better for the data than simplest model

Question 19

Q

note of SS

Answer

A

you never need to calculate it by hand

Question 20

Q

regression equation

Answer

A

y = bx + a

a-intercept
b-slop

y = predicted value of y

Question 21

Q

linear regression assumptions

Answer

A

linearity: x and y must be linearly related
absence of outliers (should be removed)
normality, linearity and homoscedasticity, independece of residuals
NO PARAMETRIC EQUIVALENT

Question 22

Q

homoscedasticity of residuals

Answer

A

variance of residuals about the outcome should be the same for all predicted scores

Question 23

Q

SPSS output for regression

Answer

A

in model summary
- don’t need this in write-up

Question 24

Q

ANOVA SPPS output for regression

Answer

A

F = MSm / MSr

if p < .05 it is significant improvement when using regression model vs simplest model

Question 25

Q

SPSS Coefficient table

Answer

A

gives us elements for regression equation

beta: as standard deviation units (others as normal units e.g. £)

Question 26

Q

SPSS coefficient table outputs: t-test

Answer

A

t-test tests the null hypothesis that value of b is 0
provides us CIs for slope which we need in write up simple regression)

Question 27

Q

how is r^2 calculated

Answer

A

= SSm/SSt
- (multiple r^2 x100 for a percentage)
- in regression we use this to assume that x explains the variance in y

e.g. distance traveled explains a significant amount of variance in taxi fair, F…P… R^2 = .814 or distance traveled explained 81% of variance in taxi fair

Question 28

Q

square root of r^2

Answer

A

= r
IF WE ONLY HAVE ONE PREDICTOR
(remember we will lose the sign)

Question 29

Q

how do we calculate variance not explained by model

Question 30

Q

write up

Answer

A

no design
- results in text
- we conducted a linear regression to examine the influence of Y on X. Mean Y (SD, CIs)(from descriptive stats at top of output) and mean X (SD, CIs).
- preliminary analysis confirmed no violation of normality, linearity or homoscedasticity assumptions
- Y explained/ did not explain a significant/not significant amount of variance in X, F(,) = __.__, p < .__, R^2 = __. (ANOVA table for F and p, R in model summary table)
- for every (1 unit e.g. mile) increase in Y (e.e.g journey), X (taxi fair) increased by (slope) (coefficients table), 95% confidence interval limits for slope were [,] (coefficients table)

Question 31

Q

simple regression discussion

Answer

A

the findings suggets that X can be predicted by Y, with longer/shorter/higher/lower Y resulting in higher/lower X