Data Mining - Lecture Naive Bayes Flashcards
What is classification?
Training a model with data, so that that model can classify new records into pre-defined classes.
What is clustering?
Training a model to cluster records together. There are no pre-defined classes.
What is argmax P in Naive Bayes?
It means that you can compute the probability of a class given a specific set of independent variables for every class there is.
You pick the class that has the highest probability, hence argamax.
What is the Naive Conditional Independence?
Assume that all features are independent given the class label y.
This means that you can compute the probability of each (Yes given Sunny etc.) individually and multiply them all together.
How do you calculate P(Play Tennis = yes)
You look at the amount of outcomes that are yes and you divide that by the total amount of outcomes.
This extends to other classes of course.
How do you calculate P (Play tennis = yes | outlook = sunny)?
You check how many days were sunny that you could play tennis.
You divide that by the total amount of days that you played tennis
So you compute a conditional probability P(yes and sunny)/(yes)
How do you calculate the P that a record is classified in a class given 3 variables?
P(class) * (Pclass given x1) * (Pclass given x2) * (Pclass given x3)
Why do we sometimes need to use Laplace Smoothing?
If the combination of an x variable and the desired outcome class did not happen yet, the probability will be 0. This will extend to the whole formula.
How do we use Laplace Smoothing?
For every probability, you add a 1 to the numerator and the number of possible classes to the denominator.
What are the advantages of Naive Bayes?
- Easy
- Handles categorical variables well
- Computational efficient
- Good classification performance, espeically with many predictors
What are the disadvantages of Naive Bayes?
- Requires a large dataset to performel WELL
- Attributes are not always independent (which it assumes)
Student number?
2064381