Lecture 6 - Naïve Bayes Flashcards

Question 1

Q

Classification & clustering

Answer

A

Both result in a categorisation of records into one or more classes based on their values

Question 2

Q

Classification

Answer

A

Trains a model that allows classifying new records to one of the classes
Assumes the existence of predefined classes

Question 3

Q

Clustering

Answer

A

Divides the records into clusters
Records with high similarity reside inside a cluster and records of two clusters are dissimilar

Question 4

Q

Example of classification & clustering of e-mails

Answer

A

Classification: does e-mail go to inbox or spam?

Clustering: e-mail to work, friends or family folder?

Question 5

Q

Why do we need classification?

Answer

A

Organising documents is hard work
- Route email messages into folders
- Route help-desk inquiries to correct staff
- Place documents in predefined categories/topic hierarchy
Decided about (predefined) user interests/skills/…
- User modelling
- Instead of using human-authored expert system, let computer to induce rules or models from log data

Question 6

Q

Classification: Learning & applying

Question 7

Q

Classification Techniques

Answer

A

Naïve Bayes
Nearest Neighbor
Decision Trees
Support Vector Machines
Logistic regression
Deep learning
Ensemble classification
…

Question 8

Q

Naïve Bayes example

Question 9

Q

Naïve Bayes Classifier

Question 10

Q

Bayes theorem

Answer

A

Bayes rule is a standard formula for inverting conditional probabilities

Question 11

Q

Naïve Bayes Assumption

Answer

A

Naive conditional independence: assume that all features are independent given the class label y

Question 12

Q

Laplace Smoothing

Answer

A

Having a probability zero is problematic, because it wipes out all information in other probabilities

Solution:

Laplace Smoothing, or Correction, or Estimator

Incorporates a small-sample correction in every probability computation
Increase the numerator/denominator
Thus, no probability will be zero

Question 13

Q

Lecture Summary

Answer

A

Naïve Bayes is not so Naïve:

Its beauty is in its simplicity
Ability to handle categorical variables directly
Computational eefficient
Good classification performance, especially when the number of predictors is very large

Negative aspects:

Requires a very large number of records to obtain good results
Independence assumption may not hold for some attributes

Lecture 6 - Naïve Bayes Flashcards

(13 cards)