Week 2: Logistic Regression (only) Flashcards

Question 1

Q

What are two problems with the classes in log reg that can make other criteria than the (overall) misclassification rate the biggest priority?

Answer

A

1) An asymmeitric problem - where it is more important to correctly predict some classes than other,
2) An imbalanced problem - where classes occur with very different frequencies.

Question 2

Q

Give an example of when it can be more important not to falsely predict the negative class than to falsely predict the positive class?

Answer

A

In a situation where we predict health status of patiens. Falselsy predicting the negative class could lead to predicting a sick patient as healthy.

Question 3

Q

Give an example of when we can encounter an imbalanced problem?

Answer

A

If we are modeling a very rare disease.

Question 4

Q

What is the equation that needs to be solved to compute the decision boundary in log reg? Why?

Answer

A

g(x) = 1-g(x).

This since the solution are points in input space for which the two classes are predicted to be equally probable and therefore lie on the decision boundary.

Question 5

Q

What is a linear classifier?

Answer

A

A model whose decision boundary is linear.

Question 6

Q

What is the softmax function?

Question 7

Q

How do we write the loss function L(y.hat, y.i) more specific (for log reg)?

Answer

A

L[p(y.i = 1 | x.i ; theta), y.i]

Question 8

Q

Write the full cost function where the log lik. function has been simplified, and identify the loss and cost . What do we call this loss function?

Answer

A

xx

The loss function is called the binary cross-entropy loss.

Question 9

Q

Write only the (general) binary cross-entropy loss and simplify. Speficy this general loss for logistic regression.

Question 10

Q

What is the logistic loss?

Answer

A

2 ln [1+exp(y.i theta^T x.i)]

Question 11

Q

What is the log of (1/n)?

Question 12

Q

What is a classification rule?

Answer

A

A rule that maps predicted probabilities into class prediction.

Question 13

Q

What is computationally good with the likelihood function for a Bernoulli distribution used in logistic regression?

Answer

A

That the pdf will reduce to only one part when the probability of y = 1, and only the other part when the probability of y = (-1).

Week 2: Logistic Regression (only) Flashcards

(13 cards)