STAT 2 - LAB 2 Flashcards

1
Q

fit a linear model with 2 coefficients

A

lm.multiple <- lm(formula = medv ~ rm+chas,
data=Boston)
summary(lm.multiple)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

fit a model with 2 coefficients and their interaction

A

lm.multiple.add <- lm(formula = medv ~ rm*chas, data=Boston)
summary(lm.multiple.add)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

test ß3 = 0 using anova

A

anova(lm.multiple,lm.multiple.add)
1. model with 2 coefficient
2. model with interaction terms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

create a fitted vs residuals plot

A

plot(fitted(lm.simple),resid(lm.simple),xlab = “Fitted values”,ylab=”Residuals”)
abline(h=0,col=”orange”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Fit a polynomial of order 2 to predict medv through rm

A

lm.poly <- lm(formula = medv ~ rm+ I(rm^2), data=Boston)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Fit a polynomial of order 4 to predict medv through rm

A

lm.fit.nonlinear4 <- lm(formula = medv ~ poly(rm,4,raw=T), data=Boston)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Create four diagnostic plots of the model in EQ. 4 using the function plot.

A

plot(lm.poly)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Are there any points of large leverage?

A

sum(hatvalues(lm.poly) > 2*mean(hatvalues(lm.poly)))

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Are there any point with a large residual?

A

sum(abs(rstandard(lm.poly))>2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Are there any influential points

A

cooks.lm.poly <- cooks.distance(lm.poly)
sum(cooks.lm.poly>4/length(cooks.lm.poly))

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Fit the model in EQ. 4 on the Boston data if we remove the influential points.

A

no.cooks <- cooks.lm.poly <=4/length(cooks.lm.poly)
lm.no.cooks <- lm(formula = medv ~ rm + I(rm^2), data=Boston, subset = no.cooks)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

after you remove influential points, check the fitted vs residuals

A

plot(fitted(lm.no.cooks),residuals(lm.no.cooks), xlab=”Fitted values”, ylab=”Residuals)
abline(h=0,col=”orange”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Fit the multiple linear regression model with medv as the response variable, and all other variables as predictors except rad.

A

lm.fit.multiple(formula= medv ~ . -rad, data=Boston)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Calculate variance inflation factor (VIF) score for the predictor variables. Your comments?

A

library(car)
car::vif(lm.fit.multiple)
the ones above 3 are quite high

How well did you know this?
1
Not at all
2
3
4
5
Perfectly