Experimental Setup Flashcards
The Training Dataset is used to ___ the parameters of the model
fit
The Validation Dataset is used to ___ the model with the parameters selected during the ___ and to adjust ___
test
training phase
hyper-parameters
The Test Dataset is used to provide an ___ of a final model fit on the training dataset
unbiased evaluation
Overfitting happens when the techniques used tend to exploit relations in the training data that ___
do not generalize to the rest of the data
The K-fold Cross-Validation works by ___ the data in K sets and one by one a set is selected as ___
splitting
test set
Error Measurement is the process of computing the error for the ___ and the ___
training set
test set
Balanced dataset are datasets where all classes are ___ while in imbalanced datasets some classes are ___
equally represented
more prevalent than others
We can ___ the training set but never the validation and tes sets
balance
A baseline is the ___ of a ___
result
very basic model
Cross-Validation is mostly used to get a more certain value of ___ for our model or to helps us ___ it better or select ___
performance
tune
another one
How to calculate the F1-score?
2* (Precision * Recall) / (Precision + Recall)