Lecture 3 Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Residual standdard Error

A

Average amount that the response deviate from the regression line

y~ is the estimate
n is the sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

RSE small implies model fits data well

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

RSE high implies model does not fit Data well

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Any prediction of lpsa based on lweight will still be off by 1.046 units on average.

If it is accepted or not it depends on the problem

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

RSE is measured in units of the output

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

R-squared is a measure of the fit however without the units

A

YES

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

RSS: Amount if variablity that is left unexplained after performing the regression

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

TSS: total variance in response to Y

A

Amount of variability in response before regression is performed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

R squared measure the proportion of variability in response y that can be performed using x

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

R squared close to 1 : large proportion of variability is explained by x which is good

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

R squared close to 0 => Regression did not explain much of the variability

A

True

Linear regression thus can be wrong

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When the application we are considering to approximate is far from being approximated using he model then R2 will be near zero

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

R2 is highly affected by the number iof predictors we have

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Since R2 is highly affected by the number of predicotrs we have what is called adjusted R2,(how we pick predicotrs)

A

Regards

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

We want F-statistics to be as far from 1 as it can be

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

The larger the F-statics , the more it indicates that we have a relation between what we are feeding into the model and the response

A

True

17
Q

Correlation : measure of linear relationship between X and Y

A

True

18
Q

Correlation does not imply casuality, it meeans how much value vary in the same way

A

True

19
Q

Multiple linear regression model , we want to add more predictors to our response variable

A

Trues

20
Q

Interaction effect the or as known as the synergy effect in marketing

A

Accounting for possible interactions between the predictors

21
Q

I introduce a new coefficient and a new variable given by X1 * X2 which allws me to account for interaction

A

True

22
Q

Linear regression: I am assuming the relationship between response and the predictor is linear

A

True

23
Q

Since the relationship between the predictor and the response is not always linear thus we can generate a polynomial regression model

A

True

24
Q

We should always ask ourselves, is it worth it to create a higher order model?

A

True

25
Q

I data science we are taking a sample from the population in order to get something that we can say about the population

A

TRUE

26
Q

W want to keeep part of our data aside in order to test our model

A

TRUE

27
Q

WE want to see how our model is performing on different subsets of data

A

TRUE

28
Q

We want to estimate the test prediction error of our model

A

TRUE

29
Q

Resampling: Given one sample we repeatadily draw samples from it in order to refit our model

A

TRUE

30
Q

Cross validation is when we want to evaluate the performance of our model by estimating it is test error

A

TRUE

31
Q

When you have a flexible model the training error might underestimate the test error

A

True

32
Q

We divide our data into part for creating model and part for testing

A

TRUE

33
Q

WE need to randomly split our data

A

TRUE