Chapter 6 Flashcards

multi linear reg

1
Q

What is Multiple Linear Regression?

A

When a regression model has more than one input

Used to fit a linear relationship between a quantitative dependent variable Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Linear Regression

A

most commonly used predictive modelling technique, “best fit line”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Input, Target variables

A

must be numeric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Best fit

A

Minimizes the sum of squares if the vertical distance from the data points of the line.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Goodness of Fit

A

Difference between the predicted and actual values, called residuals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

R squared

A

0 to 1 value - measures the proportion of variance in the target that is explained by the input.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Y

A

Dependent Variable, aka outcome or response variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

X

A

Predictors, aka independent or input variables, regressors, covariates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

B

A

B: Coefficients

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

E

A

The noise or unexplained part

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The data are used to estimate

A

the coefficients and the variability of the noise

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Objectives of fitting a model related to a quantitative outcome

A

Understanding the relationship between factors (focus of classical stats)
Predicting the outcome of new cases (focus of Data Mining)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Explanatory vs Predictive Modeling

A

The choice of model is closely tied to which is the goal

Both use a dataset to fit a model (i.e. estimate coefficients)

However, there several differences between the two:
Explanatory fits data closely - Predictive predicts new cases accurately
Explanatory uses entire data set - Predictive splits into partitions
Performance measures:
Explanatory: How well data fits model
Predictive: Predictive accuracy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Explanatory Modeling

A

Goal: Explain relationship between predictors (explanatory variables) and target

Familiar use of regression in data analysis

Model Goal: Fit the data well and understand the contribution of explanatory variables to the model

“goodness-of-fit”: R2, residual analysis, p-values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Predictive Modeling

A

Goal: predict target values in other data where we have predictor values, but not target values
Classic data mining context
Model Goal: Optimize predictive accuracy
Train model on training data
Assess performance on validation (hold-out) data
Explaining role of predictors is not primary purpose (but useful)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

*You cannot include all the binary dummies

A

*You cannot include all the binary dummies; in regression this will cause a multicollinearity error.
Other data mining methods can use all the dummies.

17
Q

Selecting Subsets of Predictors

A

Goal: Find parsimonious model (the simplest model that performs sufficiently well)
More robust
Higher predictive accuracy

We will assess predictive accuracy on validation data
Exhaustive Search = “best subset”
Partial Search Algorithms
Forward
Backward
Stepwise
18
Q

Partial Search Algorithms

A

Forward
Backward
Stepwise

19
Q

Exhaustive Search = Best Subset

A

All possible subsets of predictors assessed (single, pairs, triplets, etc.)
Computationally intensive, not feasible for big data
Judge by “adjusted R2”
Adjusted R2 for the models with 1 predictor, 2 predictors, 3 predictors, etc. (exhaustive search method)

Adjusted R2 rises until you hit 7-8 predictors, then stabilizes, so choose model with 7 predictors, according to the adj R2 criterion

20
Q

Forward Selection

A

Start with no predictors
Add them one by one (add the one with largest contribution)
Stop when the addition is not statistically significant

21
Q

Backward Elimination

A

Start with all predictors
Successively eliminate least useful predictors one by one
Stop when all remaining predictors have statistically significant contribution

22
Q

p value

A

if it is less than 0.05 then it is less likely to occur by chance

23
Q

Stepwise

A

Like Forward Selection

Except at each step, also consider dropping non-significant predictors