Projects Flashcards
What does make_scorer do in sklearn?
P 342
To use our custom performance metric, we can
use the make scorer() scikit-learn function that takes the name of our custom function and creates a metric that we can use to evaluate models with the scikit-learn API.
SMOTE and ENN both work better when the input data is scaled beforehand. True/False why?
P 374
True, This is because both techniques involve using the nearest neighbor algorithm internally and this algorithm is sensitive to input variables with different scales.
Kde plots are somewhat like smoothed histograms, what does the difference between the shapes of kde plots of a variable for two classes mean?
P 433
Yes. We can see that the distributions for many variables do differ for the two class labels, suggesting that some reasonable discrimination between the classes will be feasible