The RSS is the Residual Sum of Squares. Σ ei2

The Linear Model Flashcards by Sven Dukker

What is the residual (e_i) of a model?

The (vertical) difference between the estimation and real value. e_i= y_i -ŷ_i

How well did you know this?

Not at all

Perfectly

What does it mean when the residual (e_i) is lower than 0?

The model overestimates the outcome for observation i.

How well did you know this?

Not at all

Perfectly

What does it mean when the residual (e_i) is larger than 0?

The model underestimates the outcome for observation i.

How well did you know this?

Not at all

Perfectly

What is RSS?

The RSS is the Residual Sum of Squares. Σ e_i²

How well did you know this?

Not at all

Perfectly

How is the regression called if we estimate the parameters based on RSS?

Ordinary Least Squares regression (OLS)

How well did you know this?

Not at all

Perfectly

What is the expected relationship between the residuals and the fitted values of a linear model?

There should be no relationship, if there would be it would mean that the true model is not a linear model at all.

How well did you know this?

Not at all

Perfectly

What behaviour do we expect in a histogram of the residuals for a linear model?

The residuals should be symmetric around 0. (high residuals are less likely to occur than low residuals)

How well did you know this?

Not at all

Perfectly

What is RMSE? and how does the formula look?

Root mean squared error.

How well did you know this?

Not at all

Perfectly

What is RSE?

Residual standard error.

How well did you know this?

Not at all

Perfectly

What is the formula for the Coefficient of Determination or R²?

How well did you know this?

Not at all

Perfectly

What does the Coefficient of Determination or R² tell us?

It explains how much of the total observed variability is acounted for/explained by the model.

How well did you know this?

Not at all

Perfectly

Is the coefficient of determination (R²)the square of pearson’s correlated coefficient r ?

Only if we have one IV in our model.

How well did you know this?

Not at all

Perfectly

Does R² = 1 imply we have the true model? i.e. all estimated parameters of the model (ß_j) are correct to the truth.

No, when R² = 1 it means that the model accounts for all variability with the data set, however this does not mean we found the model that created the data. f.i. R² = 1 can always be reached if the model is overfitted.

How well did you know this?

Not at all

Perfectly

What is meant with two caveats?

In linear modelling, the residuals, and hence RSS and RMSE, are calculated vertically. If doen horizontally, the new estimated model will differ from the vertical one.

How well did you know this?

Not at all

Perfectly

What are the 3 main reasons to add more predictors to a model?

It reduces the RSS, and hence is more accurate.
It accounts for factors other than the one of interest, and thus adding these other factors eliminates their effect on outcome Y.
If the effect of X on Y is dependent on a third variable, we need to model it explicitly.

How well did you know this?

Not at all

Perfectly

When using the linear model with multiple predictors, the R² should be updated to account for the model complexity. What is the new relation of R² for the multiple linear model?

Study These Flashcards

Where n is the amount of training data, and p is the amount of predictors used.

Are linear models based on numerical or categorial variables or both or niether?

Study These Flashcards

Typically numerical, but categorial variables can be included using dummy variables.

What are dummy variables (z)?

Study These Flashcards

It is a way to map categorial predictors into numerical ones for a multi linear model. (f.i. no = 1, yes = 0)

How does a dummy variable (z) influence the parameter estimates of a linear model?

Study These Flashcards

As it is an ‘binary predictor’ it only influences the intercept and not the slope in the estimated parameters. (ß₀ ⇒ ß₀+ ß₁)

What does the estimated value (ß₀) of a dummy variable (z) represent?

Study These Flashcards

It represents the relative change in intercept, compared to the original intercept. (ß₀)

How many dummy levels are expected for an categorial variable with 5 levels?

Study These Flashcards

4, there is always one level incorperated into the baseline or reference level (the intercept ß₀)

Is it possible for a linear model to have no intercept level?

Study These Flashcards

Yes, when the intercept level is devided into a set of dummy variables of which only one can be active simultaneously. Then this dummy ‘parameter’ now acts as the intercept.

Answer this: What effects are displayed and which linear model explains the behavious of the coloured slopes.

Study These Flashcards

One intercepts and two slopes: we study the interaction effect and only the main effect of the numerical variable.

Y = ß₀ + (ß₁ + ß₂ x_{insurence = no}) x₁

Answer this: What effects are displayed and which linear model explains the behavious of the coloured slopes.

Study These Flashcards

Two intercepts and two slopes: we study the main- and interaction effects of the numerical and nominal variable. Y = ß₀ + ß₁x₁ + (ß₂+ ß₃x₁) x_{insurence = no}

Answer this: What effects are displayed and which linear model explains the behavious of the coloured slopes.

Two intercepts but only one common slope: we study only the main effects of the numerical and nominal variable. Y = ß₀ + ß₁x₁ + ß₂x_{insurence = no}

What is the definition of a linear model?

When the **relationship** between the predictors and response is linear, even tho the predictors itself are not.

Is this a linear model?

No, relationships between predictors and response is non-linear.

Is this a linear model?

Yes, relationships between predictors and response is linear.

The Linear Model Flashcards

(28 cards)