Classification Flashcards
In Supervised Learning, what is the goal?
To find a function f̂ that maps input x to output y, based on training data (xi, yi)
What is the difference between Classification and Regression in Supervised Learning?
Classification: y belongs to discrete classes (e.g., binary or multiclass)
Regression: y is a continuous value
What is the probabilistic view of Supervised Learning?
Specify a model P(X, Y | Θ), estimate parameters θ, and predict output using the estimated model
What is a Generative classifier?
A classifier that models the joint distribution P(X, Y) and uses Bayes’ rule for prediction
What is the Naive Bayes assumption?
Features xi are independent, conditionally on the class
How does the Naive Bayes classifier represent documents in text classification?
Using a bag of words model, where each document is represented by a binary vector of word presence/absence
What is the curse of dimensionality in classification?
As the number of features increases, the number of parameters needed grows exponentially (2^d - 1 per class)
How does Gaussian Naive Bayes differ from a full multivariate Gaussian classifier?
Gaussian Naive Bayes assumes the covariance matrix Σc is diagonal (features are independent given the class)
What are the steps for classifying using MAP (Maximum A Posteriori) in a Naive Bayes classifier?
See hand written paper.
What are two methods for estimating parameters in Naive Bayes?
Maximum Likelihood Estimation (MLE) and Maximum A Posteriori (MAP) estimation