Anomaly detection Flashcards

Question 1

Q

Which of the following metrics is used to create a ROC (receiver operative characteristics) curve?

Select one:

a. Recall
b. F1 score
c. Precision
d. Accuracy

Answer

A

a. Recall

Question 2

Q

Conserning an isolation forest algorithm, select the wrong statement among the following:

Select one:

a. For each tree, the attribute selection order is random.
b. Classification is performed computing the average depth for a given input samples.
c. Anomalies usually place at the lowest average depth for the trees
d. Node splitting is repeated until every input points is located on a leaf.

Answer

A

c. Anomalies usually place at the lowest average depth for the trees

Question 3

Q

Which among the following conditions makes the WDAD (Well-Defined Anomaly Distribution) assumption NOT valid

Select one:

a. Impossibility of using deep learning for anomaly detection
b. Adversarial conditions
c. Unknown anomaly

Answer

A

b. Adversarial conditions

Question 4

Q

Considering a sudden decrease of temperature in July (Northern emisphere), we can state that it is a ….

Select one:

a. Collective anomaly
b. Point anomaly
c. Contextual anomaly

Answer

A

c. Contextual anomaly

Question 5

Q

Which of the following classification strategies is suitable for anomaly detection.

Select one:

a. Isolation forest
b. All of the mentioned strategies
c. One-class SVM
d. Few shots learning networks
e. None of the mentioned strategies

Answer

A

b. All of the mentioned strategies

Question 6

Q

Which among the following ones is NOT an anomaly type.

Select one:

a. Systematic anomaly
b. Collective anomaly
c. Contextual anomaly
d. Point anomaly

Answer

A

a. Systematic anomaly

Question 7

Q

Which among the following conditions makes the WDAD assumption valid?

Select one:

a. Analyst’s attention evolves and focuses on new samples during time
b. Anomalies are created by diverse and not known creation models
c. Anomalies are created by a generation process different from the nominal process
d. Anomalies are created by an adaptive adversarial process (insider threats, cyber security)

Answer

A

c. Anomalies are created by a generation process different from the nominal process

Question 8

Q

Considering accuracy metrics for anomaly detection, select the most correct statement among the following ones.

Select one:

a. A low false positive rate implies a false alarm black-hole
b. Accuracy only depends on the True Positive and false positive samples
c. F1 score is ratio between an algebraic and arithmetic mean
d. Area Under Curve (AUC) depends on True Positive rate only

Answer

A

c. F1 score is ratio between an algebraic and arithmetic mean

Question 9

Q

Considering the isolation tree anomaly detection algorithm, which of the following statements is NOT correct

Select one:

a. Final anomaly score depends on the expected depth
b. For a given tree, attribute selection is repeated until every sample in the dataset is a leaf
c. Where score 2^-d\r I low, we have an anomaly (d is the average depth)
d. In the creation of a single classification tree, threshold are chosen randomly

Answer

A

c. Where score 2^-d\r I low, we have an anomaly (d is the average depth)

Question 10

Q

Which among the following ML and DL architectures is not suitable in an anomaly detection clean data learning set up

a. RNN
b. ARMA models
c. Multi-class SVM
d. Deep auto encoder

Answer

A

c. Multi-class SVM

Question 11

Q

When WDAD is not valid?

Answer

A

adversarial conditions (fraud, insider threats, cyber security). It means that there is an attacker and there is an anomaliest and knows that there is detection data. The attacker tries to make anomalies samples look like nominal samples (TTL attack is one of the examples of adversarial conditions).
The diverse set of modes (new failures, not known) - some modes are not profiled by an attacker. When we have something new we don’t know if it’s nominal or anomalous.
User’s notion of anomaly changes in time (anomaly = interesting point or new data type).

Question 12

Q

What is Precision?

Answer

A

Precision = detected true anomalies (TP) / samples classified as anomalies(TP+FP) -> Positive predicted Value (PPV)

Question 13

Q

What is recall?

Answer

A

Recall = detected true anomalies (TP) / total anomalies (TP+FN) -> True Positive Rate (TRP) or Sensitivity

Question 14

Q

What is specificity?

Answer

A

Specificity = detected false anomalies(FP) /total nominals(TN+FP)-> False Positive Rate (FRR)

Question 15

Q

What is the accuracy?

Answer

A

Accuracy TP+TN/total -> accuracy of the classifier

Question 16

Q

Select traditional unsupervised learning algorithms for anomaly detection

RKDE: Robust Kernel Density Estimation
EGMM: Ensemble Gaussian Mixture Model
one-class SVM
SVDD: Support Vector Data Description
ABOD:kNN Angle-Based outlier detector
LOE: Local Outlier factor
IFOR: Isolation Forest
LODA: Lightweight Online Detection of Anomalies

Answer

Study These Flashcards

A

all of them

Anomaly detection Flashcards

(16 cards)