Experimental Setup Flashcards

Question 1

Q

The Training Dataset is used to ___ the parameters of the model

Question 2

Q

The Validation Dataset is used to ___ the model with the parameters selected during the ___ and to adjust ___

Answer

A

test
training phase
hyper-parameters

Question 3

Q

The Test Dataset is used to provide an ___ of a final model fit on the training dataset

Answer

A

unbiased evaluation

Question 4

Q

Overfitting happens when the techniques used tend to exploit relations in the training data that ___

Answer

A

do not generalize to the rest of the data

Question 5

Q

The K-fold Cross-Validation works by ___ the data in K sets and one by one a set is selected as ___

Answer

A

splitting

test set

Question 6

Q

Error Measurement is the process of computing the error for the ___ and the ___

Answer

A

training set

test set

Question 7

Q

Balanced dataset are datasets where all classes are ___ while in imbalanced datasets some classes are ___

Answer

A

equally represented

more prevalent than others

Question 8

Q

We can ___ the training set but never the validation and tes sets

Question 9

Q

A baseline is the ___ of a ___

Answer

A

result

very basic model

Question 10

Q

Cross-Validation is mostly used to get a more certain value of ___ for our model or to helps us ___ it better or select ___

Answer

A

performance
tune
another one

Question 11

Q

How to calculate the F1-score?

Answer

A

2* (Precision * Recall) / (Precision + Recall)

(11 cards)