Safety and Reliability Flashcards

Question 1

Q

Erroneous Behaviour of a Classifier

Answer

A

Given a trained classifier f : R_n -> R_k (from n features to k values) and a target function h : R_n -> R_k, an erroneous behavior of the classifier f is demonstrated by a legitimate input x which exists in R_n such that
arg max_j f(x) != arg max_j h(x)

Question 2

Q

Loss function

Answer

A

L(y, f(x)) ; loss between prediction f(x) and ground truth y.

Question 3

Q

Empirical Loss

Answer

A

Average loss over a set.

Question 4

Q

Expected Loss

Answer

A

The estimated loss (loss of accuracy) before being tested.

Question 5

Q

Generalisation Loss

Answer

A

Empirical loss - Expected loss. Too big of this value is a result of overfitting.

Question 6

Q

Overfitting

Answer

A

A machine learning model is overfitted if it performs well on training data but not on test data samples.

Question 7

Q

Adversarial Examples

Answer

A

Represent erroneous behaviours which introduce safety implications.

Question 8

Q

Measurements of adversarial examples

Answer

A

Magnitude of perturbation -> ||x-x’||
Probability gap between and after the perturbation -> |f_y(x) - f_y(x’)|
(With f(x) being the regular example and f(x’) being the adjusted example.)

Question 9

Q

Data Poisoning

Answer

A

The injection of malicious data into a training process, making the algorithm perform something it should not.

Question 10

Q

Model Stealing

Answer

A

Given model f, a model stealing agent reconstructs another model f’ (etc querying model f).

Question 11

Q

Membership Inference

Answer

A

Identifies the training data via shadow models for the training model by observing the models behaviour and the outcomes.

Safety and Reliability Flashcards

(11 cards)