Classification Flashcards

Question 1

Q

In Supervised Learning, what is the goal?

Answer

A

To find a function f̂ that maps input x to output y, based on training data (xi, yi)

Question 2

Q

What is the difference between Classification and Regression in Supervised Learning?

Answer

A

Classification: y belongs to discrete classes (e.g., binary or multiclass)
Regression: y is a continuous value

Question 3

Q

What is the probabilistic view of Supervised Learning?

Answer

A

Specify a model P(X, Y | Θ), estimate parameters θ, and predict output using the estimated model

Question 4

Q

What is a Generative classifier?

Answer

A

A classifier that models the joint distribution P(X, Y) and uses Bayes’ rule for prediction

Question 5

Q

What is the Naive Bayes assumption?

Answer

A

Features xi are independent, conditionally on the class

Question 6

Q

How does the Naive Bayes classifier represent documents in text classification?

Answer

A

Using a bag of words model, where each document is represented by a binary vector of word presence/absence

Question 7

Q

What is the curse of dimensionality in classification?

Answer

A

As the number of features increases, the number of parameters needed grows exponentially (2^d - 1 per class)

Question 8

Q

How does Gaussian Naive Bayes differ from a full multivariate Gaussian classifier?

Answer

A

Gaussian Naive Bayes assumes the covariance matrix Σc is diagonal (features are independent given the class)

Question 9

Q

What are the steps for classifying using MAP (Maximum A Posteriori) in a Naive Bayes classifier?

Answer

A

See hand written paper.

Question 10

Q

What are two methods for estimating parameters in Naive Bayes?

Answer

A

Maximum Likelihood Estimation (MLE) and Maximum A Posteriori (MAP) estimation