Quiz #4 Flashcards

Exam Prep

1
Q
The functions that govern how disagreements among the predictions of ensemble models are reconciled are known as \_\_\_\_\_\_\_\_\_ functions.
  A. stacking
  B. sigmoid
  C. allocation
  D. combination
A

D. combination

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q
The recall of a model is a measure of the completeness of the results of its predictions. This measure has the same value as the \_\_\_\_\_\_\_\_\_\_\_\_ of the model.
  A. sensitivity
  B. specificity
  C. precision
  D. kappa
A

A. sensitivity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which of these is not a problem with the partitioning approach of the holdout method?
A. It’s not always possible to create representative partitions of a data set.
B. Some samples may have too many or too few difficult cases, easy-to-predict cases, or outliers.
C. Each partition may have a larger or smaller proportion of some classes.
D. Substantial portions of data must be reserved to test and validate the model.

A

A. It’s not always possible to create representative partitions of a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q
The ROC curve is a measure of the True Positive Rate against the \_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_\_ of a model.
  A. False Positive Rate
  B. False Negative Rate
  C. Specificity
  D. True Negative Rate
A

A. False Positive Rate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

One of the weaknesses of a Random Forest model is that unlike a decision tree, the model is not easily interpretable.
True
False

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

One of the limitations to using the F-score is that it assumes that ________________________.
A. recall is always more important than precision
B. precision is always more important than recall
C. equal weight should be given to both precision and recall
D. the harmonic mean of precision and recall is zero

A

C. equal weight should be given to both precision and recall

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q
The sampling approach that creates a training set of equal length as the original data, using sampling with replacement, is known as \_\_\_\_\_\_\_\_\_\_\_\_\_\_\_.
  A. cross-validation
  B. stratified sampling
  C. bootstrapping
  D. equal length sampling
A

C. bootstrapping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The AUC for a classifier with no predictive value is ___

A

0.5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q
Classification algorithms usually have parameters that need to be set before the learning process begins. These parameters are known as \_\_\_\_\_\_\_\_\_\_\_\_\_.
  A. hyperparameters
  B. tuning grids
  C. supervised parameters
  D. parametric parameters
A

A. hyperparameters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q
The error generated by a classifier during the training stage is know as the \_\_\_\_\_\_\_\_\_\_\_\_ error.
  A. validation
  B. holdout
  C. bootstrap
  D. resubstitution
A

D. resubstitution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

One of the major disadvantages of the leave-one-out cross-validation approach is that it _______________.
A. uses too much data
B. violates the holdout principle
C. is not a good predictor of future performance
D. is computationally expensive

A

D. is computationally expensive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The AUC metric and ROC curve can be used interchangeably because if two models have the same or identical AUC values, they will always have the same ROC curve.
True
False

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q
The technique that sequentially builds strong learners as a linear combination of weak learners is known as \_\_\_\_\_\_\_\_\_\_.
  A. bumming
  B. bagging
  C. boosting
  D. bootstrap aggregation
A

C. boosting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q
The meta-learning approach that utilizes the principle of creating a varied team of experts is known as an \_\_\_\_\_\_\_\_.
  A. bagged learner
  B. assemble
  C. meta-learner
  D. ensemble
A

D. ensemble

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q
The process of conducting a search to identify the optimal combination of hyperparameters to use for the learning process using a choice of evaluation methods and metrics is known as \_\_\_\_\_\_\_\_\_\_\_\_\_\_\_ tuning.
  A. automated parameter
  B. model settings
  C. search space
  D. automatic hyper
A

A. automated parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

One of the disadvantages of the random cross-validation approach is that some instances may not be used and others may be used more than once.
True
False

A

True

17
Q

The AUC for a perfect classifier is ____

A

1.0

18
Q

The kappa statistic is an adjustment of accuracy by accounting for the possibility of a correct prediction by chance alone.
True
False

A

True

19
Q
The goal of cross-validation is to \_\_\_\_\_\_\_\_\_\_\_\_\_ across the iterations.
  A. choose the coolest model
  B. improve the performance of a model
  C. evaluate future performance
  D. eliminate bad models
A

C. evaluate future performance

20
Q

Occam’s Razor states that when presented with competing hypothetical answers to a problem, one should select the answer that makes the most assumptions.
True
False

A

False