HW 3 solutioun Flashcards
linear or non linear: perceptron
linear
linear or non linear: multiplayer perceptron
nonlinear
linear or non linear: decision tree
nonlinear
linear or non linear: knn
nonlinear
linear or non linear: logistic regression
linear
Decision trees can perform _ classification
multiclass
Decision trees can reduce overfitting by having a _ max depth
smaller
An ensemble of decision trees is called a
forest
The prediction is given by the _ node in a decision tree
leaf
Feature values are encoded in the _ of decision tree
edges
If there are N features, a decision tree can perfectly encode all permutations of
features with a depth of _
N
If the decision boundary is linear, do we expect LDA or QDA to perform better on the
training set? On the test set?
We would expect QDA to perform better on the training set because its
increased flexibility will result in a closer fit. If the Bayes decision boundary is linear, we
expect LDA to perform better than QDA on the test set, as QDA could be subject to
overfitting.
If the decision boundary is non-linear, do we expect LDA or QDA to perform better
on the training set? On the test set?
If the Bayes decision boundary is non-linear, we expect QDA to perform better
on both the training and test sets.
In general, as the sample size n increases, do we expect the test prediction
accuracy of QDA relative to LDA to improve, decline, or be unchanged? Why?
We expect the test prediction accuracy of QDA relative to LDA to improve as n
gets bigger. In general, as the the sample size increases, a more flexible method will
yield a better fit as the variance is offset by the larger sample size.
True or False: Even if the Bayes decision boundary for a given problem is linear, we
will probably achieve a superior test error rate using QDA rather than LDA because
QDA is flexible enough to model a linear decision boundary. Justify your answer.
Solution: False. With fewer sample points, the variance from using a more flexible
method, such as QDA, would likely result in overfitting, yielding a higher test error rate
than LDA.