Lecture 3 - Linear Regression Flashcards

Question 1

Q

What is Linear Regression?

Answer

A

It predicts a value by multiplying the input feature weight, summing them up and adding a bias - REFER TO SLIDES FOR THE FORMULA

Question 2

Q

Linear Regression Example

Answer

A

REFER TO ONENOTE

Question 3

Q

What is the Normal Vector and How is it used in the performance measure

Answer

A

A mathematical equation that gives the directly -> to find the value of theta that minimises the cost function (MSE) - REFER TO SLIDES FOR FORMULA

Question 4

Q

Linear Regression Code (With Normal Vector Code example)

Answer

A

REFER TO ONENOTE

Question 5

Q

What is Gradient Descent?

Answer

A

An Optimisation algorithm that is used to tweak parameters iteratively to minimise a cost function (MSE). It mesures the local gradient of the error function with regards to the parameter vecter theta and its goes in the direction of descending gradient. Once the gradient reaches zero its has reached a minimum.

Question 6

Q

Gradient Descent Example (CODE AND MATH)

Answer

A

REFER TO SLIDES

Question 7

Q

What are some issues with Gradient Descent?

Answer

A

When n (as in the greek letter) is too small there is a slow convergence
When n is too big it can overshoot, it never converges or it can even diverge

Question 8

Q

What are the 3 ways to do Gradient decent?

Answer

A

Batch GD
Stochastic GD
Mini-Batch GD

Question 9

Q

What is Batch GD?

Answer

A

Uses the whole training set to compute the gradients at every step, which make it very slow when the training set it large

Question 10

Q

What is Stochastic GD?

Answer

A

Just picks a random instance in the training set at every step and computes the gradients based only on that single instance. Makes the algorithm much faster since it has very little data to manipulate

Question 11

Q

Example of Batch GD

Answer

A

REFER TO SLIDES

Question 12

Q

Example of Stochastic GD

Answer

A

REFER TO SLIDES

Question 13

Q

What is Mini-Batch GD?

Answer

A

At each step, computes the gradients on small random sets of instances (mini-batches). Get a performance boost from hardware optimization of matrix operations (e.g., vectorisation, GPUs).

Question 14

Q

Example of Mini-Batch GD

Answer

A

REFER TO SLIDES

Question 15

Q

What is Polynomial Regression?

Answer

A

Similar to Linear Regression but use when data is not a straight line. It shows the relationship with higher degree terms

Question 16

Q

Example of Polynomial Repression

Answer

A

REFER TO SLIDES

Question 17

Q

What are Learning Curves?

Answer

A

Learning Curves are plots of the model’s performance on the training set and the validation set as a function of the training set size. To generate the plots, train the model several times on different sized subsets of the training set.

Question 18

Q

What is Logistic Regression

Answer

A

Logistic Regression (also called Logit Regression) is commonly used to estimate the probability that an instance belongs to a particular class (e.g., what is the probability that this email is spam?).

Question 19

Q

How does a Logistic Regression model make a prediction?

Answer

A

A Logistic Regression model computes a weighted sum of the input features (plus a bias term). The probability ˆp estimated by the Logistic Regression - REFER TO SLIDES FOR FORMULA

Question 20

Q

Logistic Regression Example

Answer

A

REFER TO SLIDES - LINK TO TRAINING AND COST FUNCTIONS