Simple Linear Regression Flashcards

Question

what is the total variance?

Answer 1

* the total variance of Y scores in the data set All the variation that there is to explain

Answer 2

sum of (Y-M)^2 * each data point minus the mean for all Y data points

Answer 3

Sum of Squares of the Residual (SS residual): an estimate of the amount of variation that is not predicted by our regression in our sample (gap between the actual and the predicted) Total Sum of Squares (SS total): an estimate of all the variation in the sample

Answer 4

an estimate of how of the variation is actually predicted by our model

Answer 5

Take away the SS residual from the SS total -> sum of squares model

Answer 6

an estimate of the amount of variance explained by the repression or the model

Answer 7

take the mean of the actual Y score away from the predicted Y -> gives you the variance explained by the regression equation or model

Answer 8

an estimate of all the variance in the data set

Answer 9

estimate of the variance accounted for by the model/regression (gives us a idea of the variation explained)

Answer 10

sample size and amount of total variation in the sample * you can't compare it from different studies and samples as different sample sizes etc produce different estimates -> yet very useful if we want to generalise and compare results instead we need a standardised measure of the total proportion of the variation explained by the regression

Answer 11

Proportion of the variance predicted by the regression equation * SSreg divided by SStotal * Better 1 and 0 -> larger the better * can be expressed as a percentage i.e. 80% of the variance is explained by the model/regression

Answer 12

an estimate of all the variance in the data set

Answer 13

A measure of the amount of variance not explained but our regression

Answer 14

-> an estimate of the variance accounted for by the model / regression Take SStotal from SSres and that leaves us with the amount of variance explained by our equation

Answer 15

Standardise this by dividing SSreg by SStotal -> what proportion of the total variation is explained by the regression/model

Answer 16

ratio between variance that is predicted and the variance that is not predicted (error) * a way to see whether a significant amount of the variance is explained If F ratio is high -> this means the effect is strong; there is lots of variance explained in relation to the variance that is not explained (we should get a significant result)

Answer 17

'mean square error' * SS divided by degrees of freedom

Answer 18

the ratio between mean squared error -> SS / Df The degrees of freedom for the regression model is simply the number of predictors SS reg / m divided by the number of predictors in the model/regression ('k')

Answer 19

* These are the intercept and the predictors * There is always one intercept and “k” number of predictors

Answer 20

2 -> for each of the mean squared errors (df Msreg/m , df of Msres)

Answer 21

it's predicting a significant amount of the variance -> a lot of variance too

Answer 22

tells us that the result is significant -> allows us to make decisions about the null hypothesis

Answer 23

can reject the null hypothesis

Answer 24

the variance explained by the model is 0

Answer 25

there is no difference between the two means (or that the data comes from the same population)

Answer 26

The ratio of the Mean square model (or ‘regression’) error to the mean square residual error.

Answer 27

little p values

Answer 28

in the module summary, sometimes we report adjusted r squared next to it

Answer 29

* variable type must be continuous (predictor can be continuous or discrete) * non-zero variance: predictors must not have zero variance * independence: all values of outcomes should come from a different person or item * linearity: the relationship we model is, in reality, linear (x and y is still important to see if there's a relationship) * homoscedasticity: for each value of predictors, the variance of the error term should be constant AND independence of errors: Plot ZRESID (y-axis) against ZPRED (x-axis) * Normally-distributed errors: the residual (score) must be normally distributed (should form a normal distribution - if they don’t then we have some problems with the data) ○ Do a normal probability plot or ‘save’ the residuals and then compute all the usual tests for normality

Answer 30

ssreg/m divided by 'k' -> number of predictors) (1) SSres divided by N - K - 1 = 2 MSres = answer / 2 = 3 F = 1 / 3

Answer 31

way of predicting an outcome

Answer 32

total sum of squares of the differences between data points and the mean of y (all the variance there is to explain/account for)

Answer 33

total sum of squares of the differences between the data points and the line of best fit (variation that is not explained by the model) (an estimate of the variance that is not accounted for by the model/regression)

Answer 34

difference between SStotal and SSres -> variation explained by the model

Answer 35

SSmodel/regression / SStotal -> proportion of variance explained by the model)

Simple Linear Regression Flashcards

(61 cards)