machine-learning-models Flashcards

1
Q

What is the problem with linear separable data in logistic regression?

A

When training data is linearly separable, logistic regression behaves in undesired ways.

Learning tries to find decision boundaries that separate the data: this leads over complex boundaries very prone to overfitting.

If data is linearly separable: coefficients can go to infinity and indirectly provide massive conficents and confidence in the answers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is linearly separable data?

A

Data is linearly separable if there a coefficient such that:

For all POSITIVE training data:

  • Score (x) = w_hat.T h(x) > 0

For all NEGATIVE training data:

  • Score (x) = w_hat.T h(x) < 0

training error (W_hat) = 0. This is very likely an overfitting. Specially with very complex models in high dimensional spaces.

more info: https://www.coursera.org/lecture/ml-classification/optional-another-perspecting-on-overfitting-in-logistic-regression-lzGuI

How well did you know this?
1
Not at all
2
3
4
5
Perfectly