KNN Flashcards
1
Q
Can we determine the significance of the predictor in KNN?
A
No
2
Q
In which case KNN outperforms LDA and Logistic regression?
A
KNN is a completely non-parametric approach, no assumptions are made about the shape of the decision boundary. So, when the decision boundary is highly non linear, it dominates LDA and Logistic regression.
3
Q
In case of non-linear data, does KNN always outperforms other classification techniques?
A
No, even when the data exhibits the complex non-linear relationship, KNN can still give poor result if the level of smoothness is not chosen correctly
4
Q
How to determine the value of K in KNN?
A
Cross-validation
However, it is also sometime taken as Sqrt(n)
5
Q
Disadvantage of KNN?
A
- Lazy learning technique - There is no such thing like training a KNN model. So if the number of data set is too large, it is too computationally expensive.
- For high dimensional data, distance calculation using Euclidean distance is not reliable.
- Affected by outliers
- Poor performance in Imbalance dataset
- Not interpretable - Black box model
- Non-homogeneous dataset (need scaling)