Quiz #4 Flashcards
Exam Prep
The functions that govern how disagreements among the predictions of ensemble models are reconciled are known as \_\_\_\_\_\_\_\_\_ functions. A. stacking B. sigmoid C. allocation D. combination
D. combination
The recall of a model is a measure of the completeness of the results of its predictions. This measure has the same value as the \_\_\_\_\_\_\_\_\_\_\_\_ of the model. A. sensitivity B. specificity C. precision D. kappa
A. sensitivity
Which of these is not a problem with the partitioning approach of the holdout method?
A. It’s not always possible to create representative partitions of a data set.
B. Some samples may have too many or too few difficult cases, easy-to-predict cases, or outliers.
C. Each partition may have a larger or smaller proportion of some classes.
D. Substantial portions of data must be reserved to test and validate the model.
A. It’s not always possible to create representative partitions of a data set.
The ROC curve is a measure of the True Positive Rate against the \_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_ of a model. A. False Positive Rate B. False Negative Rate C. Specificity D. True Negative Rate
A. False Positive Rate
One of the weaknesses of a Random Forest model is that unlike a decision tree, the model is not easily interpretable.
True
False
True
One of the limitations to using the F-score is that it assumes that ________________________.
A. recall is always more important than precision
B. precision is always more important than recall
C. equal weight should be given to both precision and recall
D. the harmonic mean of precision and recall is zero
C. equal weight should be given to both precision and recall
The sampling approach that creates a training set of equal length as the original data, using sampling with replacement, is known as \_\_\_\_\_\_\_\_\_\_\_\_\_\_\_. A. cross-validation B. stratified sampling C. bootstrapping D. equal length sampling
C. bootstrapping
The AUC for a classifier with no predictive value is ___
0.5
Classification algorithms usually have parameters that need to be set before the learning process begins. These parameters are known as \_\_\_\_\_\_\_\_\_\_\_\_\_. A. hyperparameters B. tuning grids C. supervised parameters D. parametric parameters
A. hyperparameters
The error generated by a classifier during the training stage is know as the \_\_\_\_\_\_\_\_\_\_\_\_ error. A. validation B. holdout C. bootstrap D. resubstitution
D. resubstitution
One of the major disadvantages of the leave-one-out cross-validation approach is that it _______________.
A. uses too much data
B. violates the holdout principle
C. is not a good predictor of future performance
D. is computationally expensive
D. is computationally expensive
The AUC metric and ROC curve can be used interchangeably because if two models have the same or identical AUC values, they will always have the same ROC curve.
True
False
False
The technique that sequentially builds strong learners as a linear combination of weak learners is known as \_\_\_\_\_\_\_\_\_\_. A. bumming B. bagging C. boosting D. bootstrap aggregation
C. boosting
The meta-learning approach that utilizes the principle of creating a varied team of experts is known as an \_\_\_\_\_\_\_\_. A. bagged learner B. assemble C. meta-learner D. ensemble
D. ensemble
The process of conducting a search to identify the optimal combination of hyperparameters to use for the learning process using a choice of evaluation methods and metrics is known as \_\_\_\_\_\_\_\_\_\_\_\_\_\_\_ tuning. A. automated parameter B. model settings C. search space D. automatic hyper
A. automated parameter