Supervised Learning Flashcards

1
Q

What is Naive Bayes classification?

A

Naive Bayes is a probabilistic classifier. It is based on probability models that incorporate strong independence assumptions…therefore they are considered as naive. It is one of the simple and most effective classification algorithms which helps in building the fast machine learning models that can make quick predictions. It makes predictions based on the probability of an object.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the formula for conditional probability assuming independent events?

A

P(A | B) = P(A,B) / PB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why do you need to standardize your variables when using kNN?

A

kNN is measuring the distance between data points. If variables aren’t on the same scale, variables with larger scale numbers will disproportionately impact the distance calculations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When will a parametric approach such a least-squares linear regression outperform non-parametric approaches such as KNN regression.

A

Simple; when the parametric form that has been chosen is a close approximation of the true model form.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When it comes to the test error rate, why do we take out predictors that do not appear to be significantly related to the outcome variable?

A

Using predictors that have no relationship with the response tends to cause a deterioration in the test error rate since such predictors cause an increase in variance without a corresponding decrease in bias

How well did you know this?
1
Not at all
2
3
4
5
Perfectly