Experimental Setup Flashcards

1
Q

The Training Dataset is used to ___ the parameters of the model

A

fit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The Validation Dataset is used to ___ the model with the parameters selected during the ___ and to adjust ___

A

test
training phase
hyper-parameters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The Test Dataset is used to provide an ___ of a final model fit on the training dataset

A

unbiased evaluation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Overfitting happens when the techniques used tend to exploit relations in the training data that ___

A

do not generalize to the rest of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

The K-fold Cross-Validation works by ___ the data in K sets and one by one a set is selected as ___

A

splitting

test set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Error Measurement is the process of computing the error for the ___ and the ___

A

training set

test set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Balanced dataset are datasets where all classes are ___ while in imbalanced datasets some classes are ___

A

equally represented

more prevalent than others

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

We can ___ the training set but never the validation and tes sets

A

balance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A baseline is the ___ of a ___

A

result

very basic model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Cross-Validation is mostly used to get a more certain value of ___ for our model or to helps us ___ it better or select ___

A

performance
tune
another one

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How to calculate the F1-score?

A

2* (Precision * Recall) / (Precision + Recall)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly