MLSEC 10 Flashcards

Question 1

Q

ML Pipeline

Answer

A

Data Collection and Labeling

System Design and Learning

Performance Evaluation

Deployment and Operation

Question 2

Q

Pitfall in Data Collection and Labeling

Answer

A

Sampling Bias

Label Inaccuracy

Question 3

Q

Pitfall in System Design and Learning

Answer

A

Biased parameters

Spurious correlations

Data snooping

Question 4

Q

Pitfall in Performance
Evaluation

Answer

A

Inappropriate baselines

Inappropriate measures

Base-rate fallacy

Question 5

Q

Pitfall in Deployment and Operation

Answer

A

Lab-only evaluation

Inappropriate threat model

Question 6

Q

Sampling Bias

Answer

A

The collected data does not sufficiently represent the true data distribution of the underlying security problem

Question 7

Q

Label Inaccuracy

Answer

A

The ground-truth labels are inaccurate, unstable, or errorenous, affecting the estimated performance.

Question 8

Q

Data Snooping

Answer

A

The learning-based system is trained with data or knowledge typically not available in practice.

Question 9

Q

Spurious Correlations

Answer

A

Artefacts unrelated to the security problem create shortcut patterns for separating the classes

Question 10

Q

Biased Parameter Selection

Answer

A

Parameters of the learning-based systems are not entirely fixed at training time and indirectly depend on the test data

Question 11

Q

Inappropriate Baseline

Answer

A

The evaluation is conducted with limited baseline methods

Question 12

Q

Inappropriate Performance Measures

Answer

A

The performance measures do not account for the constraints of the security problem, such as class imbalance

Question 13

Q

Base Rate Fallacy

Answer

A

Class imbalance is ignored when interpreting the performance measures, leading to overestimations

Question 14

Q

Lab-Only Evaluation

Answer

A

The learning-based system is solely evaluated in a laboratory setting. Practical constraints are not considered.

Question 15

Q

Inappropriate Threat Model

Answer

A

Security of machine learning is not considered, exposing the learning-based system to attacks