Naive Bayes Model Flashcards by Unknown Unknown

What is an assumption of Naive Bayes model?

independence among predictors.

The effect of the value of a predictor variable on a given class is not affected by the values of other predictors.

How well did you know this?

Not at all

Perfectly

Explain Bayes’ theorem

calculating the posterior probability, which is the likelihood of an event occurring after taking into consideration new information. The weather data set will help you build a model to decide whether to go outside and play soccer.

How well did you know this?

Not at all

Perfectly

Naive Bayes

calculates posterior probabilities and makes predictions based on which outcome has the highest probability.

How well did you know this?

Not at all

Perfectly

What type of machine learning model is Naive Bayes model?

supervised learning - classification

How well did you know this?

Not at all

Perfectly

Bayes’ Theorem equation

equation - find the probability of an event, A, given that another event B is true.

P(A|B) = P(B|A) * P(A) / P(B)

How well did you know this?

Not at all

Perfectly

P(A)

P(A) probability of the outcome overall. the prior probability of event A before any evidence (feature) is seen.

How well did you know this?

Not at all

Perfectly

P(B|A)

conditional probability. the probability of B, given A

How well did you know this?

Not at all

Perfectly

P(B)

probability of the value of the predictor variable

How well did you know this?

Not at all

Perfectly

In probability, what does “A” represent?

class label: one of the possible outcomes or categories within a dataset.

How well did you know this?

Not at all

Perfectly

In probability, what does “B” represent?

predictor value

How well did you know this?

Not at all

Perfectly

What does P(A|B) stand for?

the posterior probability
the probability of the class label (A) after the evidence (B, feature) has been seen.

How well did you know this?

Not at all

Perfectly

In context of probability, what is conditional independence?

Variables B and C are independent of one another on the condition that a third variable, A,assumption that each predictor variable (different Bs in the formula) is independent from the others, conditional on the class (A).

conditional independence is about how variables (Bs) interact with each other when you take into account the influence of a third variable (A).

How well did you know this?

Not at all

Perfectly

What’s the conditional independence equation?

P(B|C, A) = P(B|A)

the probability of B, given C and A, is equal to the probability of B, given A.

Or given A, introducing C does not change the probability of B.

How well did you know this?

Not at all

Perfectly

Naive Bayes assumption (in reality)

the predictor variables (B and C) are assumed to be conditionally independent of each other, given the target variable (A).

very often is not actually true.

However, Naive Bayes models still often perform well in spite of the data violating the assumption.

How well did you know this?

Not at all

Perfectly

Naive Bayes assumption on predictor variables

the individual predictor variables (Bs, Cs) are assumed to contribute equally to the model’s prediction

How well did you know this?

Not at all

Perfectly

Name 2 Naive Bayes assumption

Study These Flashcards

predictor variables are independent of each other
predictor variables have equal contribution to model’s prediction

Name 3 Advantages of Naive Bayes

Study These Flashcards

simplest classification algorithm
faster training time
highly scalable

Naive Bayes Use Cases

Study These Flashcards

document analysis/classification
spam filtering

Name 2 Disadvantages of Naive Bayes

Study These Flashcards

Few datasets have truly conditionally independent
“zero frequency” problem: when a category or event has not been observed in the training data. This leads to a probability of zero for that category

Zero Frequency problem

Study These Flashcards

dataset you’re using has no occurrences of a class label and some value of a predictor variable together. This would mean that there is a probability of zero. Since the final posterior probability is found by multiplying all of the individual probabilities together, the probability of zero would automatically make the result zero.

What type of variables are BernoulliNB used for?

Study These Flashcards

Used for binary/Boolean features

What type of dataset is CategoricalNB used for? And name a use case?

Study These Flashcards

Handles categorical features.
Uses a multinomial distribution to model the probability of each feature value given a class.
*Specifically designed for categorical data.

What is ComplementNB used for?

Study These Flashcards

Primarily designed to handle imbalanced datasets.
Calculates the probability of a feature not belonging to a class to improve performance on imbalanced data.
Typically works with multinomial data (like text)

What is GaussianNB used for?

Study These Flashcards

Used for continuous features, normally distributed features

What is MultinomialNB used for and what are some use cases?

Used for multinomial (discrete) features Suitable for: Text classification, document categorization, spam filtering, and other tasks involving count data.

What does setting stratify=y mean? When should it be done?

If our master data has a class split of 80/20, stratifying ensures that this **proportion is maintained** in both the **training and test data.** =y tells the function that it should **use the class ratio** found in the **y variable** (our target).

When to stratify=y and what are the consequences of not doing it?

* **greater your class imbalance**, the more important it is to stratify when you split the data. * If we didn’t stratify, then the function would split the data randomly, and we could get an **unlucky split** that **doesn’t get any of the minority class** in the test data, which means we get an **ineffective model evaluation.**

Naive Bayes Model Flashcards

(27 cards)