8 - Naïve Bayes Classification Flashcards
What is the basis of Naïve Bayes classification methods?
Bayes Theorem
Developed by Reverend Thomas Bayes, it updates knowledge about data parameters by combining prior knowledge with new information.
What does the prior distribution represent in Bayes Theorem?
Previous knowledge about the data parameters
It is denoted as p(Y = y*).
What is the posterior distribution in the context of Bayes Theorem?
Updated parameter knowledge after observing data
Denoted as p(Y = y* | X*).
In a dataset with predictors X and response variable Y, how many class values can Y take in the given example?
Three possible class values: y1, y2, and y3
What is the objective of using Bayes Theorem in classification?
Identify the most likely class for a combination of predictor variable values
Specifically, find which of y1, y2, or y3 is most likely for the combination X*.
What does p(Y = y* | X*) represent?
The likelihood of class value y* given observed predictor values X*
How can you classify a record using the maximum a posteriori hypothesis?
Classify as the value of Y with the highest posterior probability
What is the class conditional independence assumption?
It allows writing p(X* | Y = y*) as the product of independent events
For example, p(X* | Y = y) = p(X1 | Y = y) × p(X2 | Y = y*).
What are the two predictor variables used in the wine classification example?
Alcohol content and sugar content
What is the prior probability of a wine being Red if there are 500 red wines out of 1000 total?
0.5
What is the marginal probability of Alcohol_flag being High in the wine dataset?
0.486
What is the marginal probability of Sugar_flag being Low in the wine dataset?
0.584
What is p(Alcohol_flag High | Type Red)?
0.436
What is the conditional probability p(Sugar_flag Low | Type White)?
0.4
How does the Naïve Bayes algorithm classify a wine with low alcohol and low sugar content?
It classifies it as Red based on higher posterior probability
What is the posterior probability of a low alcohol, low sugar wine being Red?
72.15%
What is the prior probability of a wine being White?
0.5
What is the probability of a low alcohol, low sugar wine being White?
30.92%
What happens to the classification when comparing prior and posterior probabilities?
Posterior probabilities can significantly change based on new data
What is the posterior probability of a wine being Red given high alcohol and high sugar content?
25.02%
Fill in the blank: The denominator in Bayes Theorem, p(X*), is known as the _______.
Marginal probability of the data
What is the posterior probability of a wine being red given high alcohol and high sugar?
25.02%
This is calculated using the Naïve Bayes algorithm.
What is the posterior probability of a wine being white given high alcohol and high sugar?
79.53%
This indicates the Naïve Bayes algorithm classifies the wine as white.
What is the Naïve Bayes classification for low alcohol and high sugar wine?
White
This classification is based on the alcohol and sugar content.