Lecture 5 - Linear Regression Flashcards

1
Q

Regression Analysis

A

Fit a relationship between a numerical outcome variable and a set of predictors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Variables

A
  • Numerical outcome variable Y also called response, target, or dependent variable
  • Set of predictors X1, X2, …, Xn also referred to as independent variables, input variables, regressors, or covariates
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Linear Regression Model

A

Arranged in, or extending, along a straight or nearly straight line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Single vs Multiple Linear Regression Model

A

Single: One independent prediction, i.e., single variable X

Multiple: two or more predictors, i.e., X1, X2, …

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Intuition for Single Linear Regression

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Intuition for Multiple Linear Regression

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Linear Regression Model

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Graphical visualisation of Linear Regression

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Ordinary Least Squares (OLS)

A
  • Method for estimating the unknown parameters in a linear regression model
  • It minimises the errors associated with predicting values for the dependent variable Y
    • It uses a least squares criterion because without square we would allow positive and negative deviations from the model to cancel each other out
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Ordinary Least Square (OLS) pt2

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Objectives for single/multiple regression

A
  1. Predictive - detect the outcome value for new records given their input values
    1. Explanatory (or descriptive) - Quantifying / explaining the avg effect of inputs on an outcome & Data are treated as a random sample from a larger population of interest
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Explanatory objective - in single/multiple regression

A

Generate statements useful for decision making

E.g. a unit increase in X is associated with an average increase of 2 points in Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Predictive objective - in single/multiple regression

A

Given an ew value for X, estimate the value for Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Overfitting

A
  • *Issue:**
  • The outcome correspond exactly, or is extremely close, to the given data set
  • I.e., the model learns the existing data too well
  • *Consequences:**
  • Model fails to include additional data or
  • Generates unreliable predictions
  • *Example:**
  • Considering creating a model for predicting grades of students given the hours they study
  • Using the data from UVT, applying it to TU/e data fails due to overfitting
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Underfitting

A
  • *Issue:**
  • Model can’t accurately capture the data dependencies
  • Fails to identify effects supported by the data
  • Usually this happens due to the model’s simplicity
  • *Consequences:**
  • Model has bad generalisation capabilities when applied with new data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Complete Linear Regression Python example

A
17
Q

Quiz 1

A
18
Q

Quiz 2

A
19
Q

Quiz 3

A