Lecture 7A: Logistic Regression Flashcards

Question 1

Q

What does logistic regression measure?

Answer

A

Measures the relationship between categorical dependent variable and one or more independent variables by estmating probabilities using logisitc function

Question 2

Q

When to use Logistic Model

Answer

A

Independent variables are continous
Meets assumptions of linear regression models
Distribution fits linear model but target class is binary (normal distribution)

Question 3

Q

What does it mean to train the model?

Answer

A

It means finding the optimal values. Such that we get the best predictive performance, or, the best seperation of Y(1)’s and N(0)’s.

Question 4

Q

Optimal Coefficients

Answer

A

● The optimal coefficients can be used to predict the unseen features (x values in the equation)

Question 5

Q

Predictive Models

Answer

A

● Predictive models are “predictive” and they are expected to have “Errors”
○ Objective is to go as CLOSE as possible to the would be reality
○ Error is the gap between the prediction and the reality
○ Process: Feed the model with labeled data and modify the parameters to minimize the ERROR (the training process)

Question 6

Q

Feature Importance of the Model Features by

Answer

A

○ Multiplying the coefficients by the Standard Deviation
○ Convert the data set to standardized data before getting the coefficients
○ Higher coefficient values indicate larger influence of corresponding features on Outcome (Target Variable)

Question 7

Q

Linear regression is similar to logistic regression, except…

Answer

A

Logistic Regression predicts if something is true or false, instead of predicting something continous, like size…

Question 8

Q

Instead of fitting a line, like we do in linear regression, in logistic regression, we fit…

Answer

A

fits an “S” shaped “logistic function”. The “S” curve goes from zero to one

Question 9

Q

What is logistic regression usually used for?

Answer

A

It is usually used for classification

Question 10

Q

Just like linear regression, logistic regression can work with what type of data?

Answer

A

Logistic regression can work with continous data (like weight and age) and discrete data (like genotype and astrological sign)

Question 11

Q

Logistic regression does not have the same concept of a “residual” which is used in linear regression, so it can’t use least squares and it cant calculate R^2, instead it uses…

Answer

A

Maximum Likelihood

Question 12

Q

In summary, logistic regression can be used to

Answer

A

classify samples, and it can use different types of data (like size and/or genotype) to do that classification. It can also be used to assess what variables are useful for classifying samples

Question 13

Q

How do we find optimal values of the coefficients?

Answer

A

Cost Function, Loss Function, Error Function

Question 14

Q

Cost Function

Answer

A

Alternate Terms - Loss Function

Question 15

Q

Gap between “prediction” and “reality” is prediction “…”

Answer

A

“Error”

Question 16

Q

In Logistic Regression, how do we minimize error?

Answer

Study These Flashcards

A

WE feed the model with “labelled data” and continually modify its parameters (or coefficients) to minimize the ERROR (the training process).

Question 17

Q

How do we know when we reached the “trained state” where the ERROR is minimal?

Answer

Study These Flashcards

A

Its a mathemtical optimization problem. Various approaches.
In linear regression, we try to minimize the Mean Square Error(MSE).

Instead of using error values directly, we develop a function that will measure the “cost” or “loss” related to the error and the function is “continous”.

Make the function to be “convex” so there is a clear “global minima”

Lecture 7A: Logistic Regression Flashcards

(17 cards)