Naive Bayes Flashcards
Compare Naive Bayes to other methods (2)
- Training is very fast
- Naive Bayes computes its parameters in closed form by counting
What’s the Naive bayes assumption?
What’s wrong with the naive bayes assumption?
The features might not be independent!
What’s the naive bayes recipe for closed form MLE?
What does maximizing likelihood accomplish?
- There is only a finite amount of probability mass (i.e. sum-to-one constraint)
- • MLE tries to allocate as much probability mass as possible to the things we have observed…
- …at the expense of the things we have not observed
What are the different Naive Bayes models and what kind of features do they correspond to?
What does Naive bayes do?
Provide a framework for generative modeling
What’s the overall process for Naive bayes? (3)
- Choose p(xm | y) appropriate to the data (e.g. Bernoulli for binary features, Gaussian for continuous features)
- Train by MLE or MAP 4.
- Classify by maximizing the posterior
What kind of model is logistic regression?
Discriminative classifier
Describe discriminative classifier (3)
– Example: Logistic Regression
– Directly model the conditional: p(y|x)
– Learning maximizes the conditional likelihood
Describe generative classifiers (4)
Compare naive bayes and logistic regression in terms of
- error
- model efficiency
- assumptions
- number of examples needed to estimate the parameters
- If model assumptions are correct: Naive Bayes is a more efficient learner (requires fewer samples) than Logistic Regression
- If model assumptions are incorrect: Logistic Regression has lower asymptotic error, and does better than Naïve Bayes
- Naïve Bayes makes stronger assumptions about the data but needs fewer examples to estimate the parameters
What do we know about the features of naive bayes?
Features x are assumed to be conditionally independent
given y. (i.e. Naïve Bayes Assumption)
What does the naive bayes assumption say, in words?
Features x are assumed to be conditionally independent
given y? (double check this)
Compare Naive bayes and logistic regression wrt assumptions made about the featuers