LR Flashcards
1
Q
continuous class, but a discrete learner (“classifier”)
A
discretization
2
Q
discrete class, but a continuous learner (“regressor”)
A
Build a suite of regression tasks via multi-response linear regression: • perform one regression per discrete class, with all instances of that class set to 1, and other instances set to 0 • classify a given test instance by regressing its value relative to each class, and selecting the class with the highest value
like “one-vs-rest” multi-class SVM, -> too slow....
3
Q
Logistic Regression
A
a model where 1 = interesting class and 0 = uninteresting class
4
Q
Logistic Regression and NB
A
they both try to maxmize P(c|T) but in LR we estimate it directly with beta without making any assumption like NB.
5
Q
How do determine β in LR?
A
use Gradient Descent to maximize the log-likelihood
6
Q
Pros and Cons of LR
A
Pros:
• Vast improvement on Naive Bayes
• Particularly suited to frequency-based features (so, popular in NLP)
Cons: • Slow to train • Some feature scaling issues • Often needs a lot of data to work well • often need regularisation due to overfitting