04_supervised concepts Flashcards

Question

What does a performance metric do?

Answer 1

provide a quantitative assessment of how well our model performs

Answer 2

- ML implementation (model type and loss function) - dataset - task to be learned

Answer 3

tailored to the model and the task and the dataset

Answer 4

By comparing the performance on the seen/training data and some previously unseen/test data

Answer 5

the data is randomly split into three parts: 1) Training data set 2) Validation data set (hyperparameter-tuning) 3) Test data set (evaluate model performance)

Answer 6

Train: 0.7 Validate: 0.15 Test: 0.15

Answer 7

If we have more than one class, all the classes are split into train validate test on their own and according to the set ratios --> preserves the class fractions in split data sets

Answer 8

"recycle" data by using cross-validation to get a better estimate

Answer 9

1) split shuffled data set into three parts 2) train with two parts and test with the third 3) repeat with reassigning test and train --> in independent runs 4) results in k performance metrics --> report: avg + std / best-of k

Answer 10

No Because it uses independent runs, which means that they don't build up on each other it just gives a more reliable estimate of the model's performance

Answer 11

- limiting model capacity - introducing uncertainty - dropout (only NN) - introducing noise - early stopping (only NN) - Bagging / Ensembling

Answer 12

1) feature engineering 2) data scaling 3) data splitting 4) define hyperparameters 5) train model on training data for fixed hyperparameters 6) evaluate model on validation data 7) repeat 4 to 6 until performance on validation data maximised 8) evaluate trained model on test data and report the test data performance --> performance metrics should be similar between test and validation data before showing the model the test data

Answer 13

Benchmarking

Answer 14

refers to the process of quantitatively assessing your ML model's performance

Answer 15

measure for performance, depends on the task and the data set

Answer 16

- MAE (mean absolute error) 1/N Summe I yi' -y^iI - RMSE (root mean square error) Wurzel 1/N Summe (yi' - y^i)^2 Intuition: by how much deviates your model prediction from the ground-truth on average

Answer 17

RMSE is more sensitive to outliers it depends on the model and the problem if this is beneficial or not

Answer 18

- Accuracy - Precision - Recall

Answer 19

What is the overall fraction of correct predictions? Accuracy = (TP + TN) / (TN + TP + FP + FN) we correctly identified 95% of all dogs in the image

Answer 20

What fraction of our positive predictions is truly positive? (quantifies "correctness") Precision = TP / (TP + FP) 95% of the dogs we predicted are actual dogs

Answer 21

What fraction of actual positives has been identified? (quantifies "completeness") Recall = TP / (TP + FN) we correctly found 95% of the dogs that are in the image

Answer 22

Class imbalance! eg "Will this asteroid impact Earth" here, Recall is the most important because we don't want to miss any hitting asteroids

Answer 23

a common way to visualize the performance of a classification model provides information on systematic confusion learned by the classifier all elements in one row must sum up to unity

Answer 24

the diagonal values should be as high as possible while off-diagonal elements should be as low as possible

Answer 25

No would be very high by looks at all individual pixels. no indication if the correct pixels were identified correctly

Answer 26

Intersection over union metric

Answer 27

Intersection over union metric IoU = intersection / union = (AnB)/(AuB) A is prediction-Shape, B is ground-truth

Answer 28

Where there is no ground truth

Answer 29

- individual metric - best-of-n (eg with cross-validation) - averaging results (average metric over n model runs + standard deviation) best choice depends on the specific problem and use case

04_supervised concepts Flashcards

(53 cards)