11-logistic regression Flashcards

Question 1

Q

What is logistic regression?

Answer

A

Logistic regression is a binary classification model. It is a probabilistic discriminative model, because it optimises P(Y|x) directly. It doesn’t assume conditional independence

Question 2

Q

What are the log odds?

Answer

A

Log odds is a transformation used in the process of defining the logistic regression formula. It is calculated as the log(P(x) / (1-p(x)))

Question 3

Q

What is the logistic regression formula?

Answer

A

P(Y|x:theta) = 1/(1+e^-(regression formula))

Question 4

Q

How should the logistic regression function be interpreted?

Answer

A

If P(Y|X;theta) > 0.5, predict y = 1, otherwise y = 0

Question 5

Q

How does multinomial logistic regression compare to binomial logistic regression?

Answer

A

The probability of each class is calculated by passing through the softmax function, a generalisation of the sigmoid function

Question 6

Q

What are the pros of logistic regression?

Answer

A

It has a probabilistic interpretation
There are no restrictive assumptions on features
Often outperforms naive bayes
Particularly suited to frequency-based features

Question 7

Q

What are the cons of logistic regression?

Answer

A

It can only learn linear feature-data relationships
There are some feature scaling issues
Often needs a lot of data to work well
Overfitting can be a big problem

Question 8

Q

What is cross-entropy loss and its relation to negative log likeli-
hood?

Answer

A

Cross-entropy measures the difference between two probability distributions, p and q.

H(p, q) = − ∑p(x)log(q(x))

Question 9

Q

What happens if perceptron is applied to non-linearly separable data?

Answer

A

It will likely not converge, it will instead oscillate between multiple solutions

Question 10

Q

What is linear separability?

Answer

A

A dataset is linearly separable if we can separate all classes by drawing a line between them.

Question 11

Q

What is a linear classifier?

Answer

A

A classifier is linear if its decision boundary is a linear function