Module 3 Flashcards

Question 1

Q

Hyperparameter

Answer

A

Model parameters that are chosen before the training

Question 2

Q

Hyperparameter tuning

Answer

A

split dataset into training/validation/test
split between 60/20/20 and 80/10/10
try different hyperparameters
select best according to the validation dataset accuracy

Question 3

Q

Cross-validation

Answer

A

divide dataset into k equal folds
k-1 folds for training+validation
1 for testing
iterate k times
testing on a different portion of the data
performance on all k held-out test sets can be averaged

Question 4

Q

Cross-validation parameter tuning

Answer

A

Each iteration:

1 fold for testing
1 fold for validation
k-2 folds for training

Question 5

Q

Cross-validation in production

Answer

A

can use all the available data for training the model ✅

- don’t have a way of estimating the performance of the final trained model any more ❌

Question 6

Q

Confusion matrix: accuracy

Answer

A

TP + TN / (TP + TN + FP + FN)

Question 7

Q

Confusion matrix: precision

Answer

A

TP / (TP + FP)

Question 8

Q

Recall

Answer

A

TP + / (TP + FN)

Question 9

Q

Macro-averaged recall

Answer

A

Average of recall for each class

Question 10

Q

F-measure

Answer

A

F1 = 2 . precision . recall / (precision + recall)

Question 11

Q

Micro-averaging

Answer

A

Calculate the average of each metric (e.g. TP, FP, TN, FN)

Question 12

Q

Mean squares error

Answer

A

1/N Σ (yi - ŷi)**2

Question 13

Q

Imbalanced test set: solutions

Answer

A

downsample the majority class

- up sample the minority class

Question 14

Q

Overfitting

Answer

A

good performance on training data

- poor generalisation on other data

Question 15

Q

Underfitting

Answer

A

poor performance on training data

- poor generalisation to other data

Question 16

Q

Reasons for overfitting

Answer

A

model is too complex
examples in training set don’t represent all possible situations
learning is performed for too long

Question 17

Q

To prevent overfitting

Answer

A

right level of complexity
getting more data
stopping training earlier

Question 18

Q

Gradient descent

Answer

A

Repeatedly update parameters a and b by taking small steps in the negative direction of the partial derivative

Brainscape's Knowledge GenomeTM

Module 3 Flashcards

Brainscape's Knowledge Genome^TM