Linear Regression Flashcards

1
Q

How to calculate error in single data point?

A

difference between actual value and predicted value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How to calculate Sum of Squared Errors (SSE)

A

(E1)2 + (E2)2+… + (En)2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the disadvantage of SSE

A
  • Depend on number of data points

* Units hard to interpret

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How to calculate Root mean squared errors?

A

sqrt(SSE/N)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How to calculate R squared ? ( another model)

A
  • Compares the best model with baseline model
  • Baseline model does not use any variables
  • 1 - (SSE/SST) [SST = errors for baseline model)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the difference between multiple and adjusted R squared?

A

Multiple r squared increase when adding more variable but Adjusted r squared decrease.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is t value in linear regression model?

A
  • Estimate / Standard Error , larger this value good possibility to significant.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Why check correlation in model?

A

to identify correlated variables. remove these variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

calculate test set R squared

A

SSE = sum((tempPredict - test$Temp)^2)

SST = sum( (mean(train$Temp) - test$Temp)^2)

R2 = 1 - SSE/SST

How well did you know this?
1
Not at all
2
3
4
5
Perfectly