L4 - Intro Regression Flashcards

1
Q

What is a residual value?

A

It is the value of the deviation of a data point from a slop line, relative to the y axis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What should the sum of squared residuals be if you have a good slope line?

A

zero!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the method of least squares?

A

It is a method that finds the best location for the straight line function because it estimates values for the slope, and the intercept that minimises the sum of squared residuals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the two fundamental equations in the linear regression model?

A
  • Full regression equation

- Regression model equation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do the full regression equation and the regression model equation differ?

A

‘Model’ equations will always have a predicted Y score on the LHS, and no residual score on RHS (e)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

of x and y, which is the dv and iv?

A

X - INDEPENDENT

Y - DEPENDENT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a and b in the regression equations?

A

a - intercept

b - slope

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a partial regression coefficient?

A

The regression slope parameter, noted as b, in multiple linear regression.

(Regression w/ 2 or more IVs)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the two features we focus on in the linear regression model?

A
  • The strength of the overall prediction model

- The strength of the prediction of each individual IV considered separately within the overall model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Does strong prediction in the overall model imply that each individual IV is a good predictor?

A

No!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does R Squared measure?

A

This indicates the strength of prediction in the overall model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the 3 sources of variability in scores on the DV?

A
  • Observed scores on the DV (SStotal)
  • Predicted scores on the DV (SSReg, variation accounted for by the model)
  • Difference between observed and predicted scores (SSRes, variation not accounted for by the model)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the formula for R squared?

A

R squared = SS Reg / SS Total

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the formula for SS total?

A

SSTotal = SS Reg + SS Res

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the formula for SS Reg?

A

SS Reg = SS total - SS Res

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a good value of R squared?

A

1 = no residuals! everything has been predicted by the model

The closer R squared is to 1, the stronger the prediction.

17
Q

What is the multiple correlation coefficient?

A

This is the square root of R square… = R

where 0 ≤ R ≤ 1

R is also a sample statistic for measuring the strength of prediction. It equals the PEARSON CORRELATION between OBSERVED SCORES AND PREDICTED SCORES ON THE DV.

18
Q

What is statistic represents the pearson correlation between observed scores and predicted scores on the DV?

A

R!

Square root of R squared!

19
Q

What are the 3 ways we can assess the strength of individual IVs in linear regression?

A
  • Unstandardised (partial) regression coefficient
  • Standardised (partial) regression coefficient
  • the squared semipartial correlation correlation (for multiple regression)
20
Q

what is the regression coefficient?

A

“b” in the regression equations

the slope

21
Q

What is the unstandardised simple regression coefficient?

A

This is the EXPECTED CHANGE in observed scores on the DV for A UNIT CHANGE in the observed raw score on the IV.

since it’s not standardised, it’s difficult to know what’s big or small.

22
Q

what does it mean when a regression coefficient is positive?

A

That an INCREASE in scores on the IV = expected INCREASE in scores on the DV.

or the other way round.

23
Q

What does it mean when a regression coefficient is negative?

A

That an INCREASE in scores on the IV = expected DECREASE in scores on the DV.

or the other way round.

24
Q

What are standardised regression coefficients?

A

This is the expected change in Z SCORE UNITS on the DV for a change of ONE STANDARD DEVIATION on the IV.

When standardised,

  • The intercept (a) is always zero, so it’s left out of the full regression equation
  • the regression coefficient is notated as beta.
25
Q

What is an unstandardised partial regression coefficient?

A

This coefficient indicates the expected change in scores on the DV for a unit change on the FOCAL IV, when holding constant scores on all over IV’s in the regression model

  • -> multiple regression.
  • -> only 1 IV changing
26
Q

What is a standardised partial regression coefficient?

A

It indicates the expected change in standard deviation units on the DV, for one standard deviation increase (or decrease) on the focal IV, when holding constant scores on all over IVS.

27
Q

What is the semipartial correlation?

A

This is the pearson correlation between scores on the DV and that part of scores on an IV that are NOT ACCOUNTED FOR by the model.

(residual scores)

alternative that is used to assess the strength of prediction of each IV on regression model.

28
Q

What can the regression intercept tell us?

A

It tells us the expected value on the DV when all scores on the IV are equal to zero!!!

intercept is always zero when standardised!
(if metric is arbitrary, then score of zero is often meaningless, so generally we are not interested in making inferences based on intercept)