Anomaly Detection Flashcards

Question 1

Q

DTF never results in distance strictly greater than Euclidean distance

Question 2

Q

DTF cannot be applied to sequences of diff length

Question 3

Q

DTF can only be applied to single-variate (one-dimensional/ one feature) sequence

Question 4

Q

DTF normalization is useful

Question 5

Q

What does a low local reachability density mean? (lrd)

Answer

A

It means large average distance

Question 6

Q

LOF(q) < 1 means what?

Answer

A

Inlier, higher density

Question 7

Q

LOF(q) > 1 means what?

Answer

A

Outlier, lower density

Question 8

Q

Advantages of NN?

Answer

A

used in unsupervised setting
no assumptions about data distribution
intuitively appealing, uses distances

Question 9

Q

Disadvantages of NN?

Answer

A

computationally expensive when testing
requires distances, so all disadvantages of distances apply

Question 10

Q

Advantages of PCA?

Answer

A

Useful for modeling feature interaction
Computationally efficient

Question 11

Q

Disadvantages of PCA?

Answer

A

Based on assumption that normal/ anomaly are distinguishable in the reduced space
Context not taken into account
PCA sensitive to outliers

Question 12

Q

What are the three types of anomalies?

Answer

A

Point (point x is strange)
Contextual (point x strange given set S)
Collective (set S is strange)

Question 13

Q

Outliers have no effect on PCA?

Question 14

Q

PCA assumes relationship between variables is linear?

Question 15

Q

LOF uses reachability distance instead of actual distance to lower effect of outliers?

Question 16

Q

LOF does not require distance metric to work properly and return sensible results?

Answer

Study These Flashcards

A

True

Question 17

Q

If p, q have same distances to nearest neighbours, it is possible that LOF returns p as anomaly and q as normal?

Answer

Study These Flashcards

A

True

Question 18

Q

Main use of LOF is to find collective anomalies?

Answer

Study These Flashcards

A

False (I think its point anomalies)

Question 19

Q

What is NN not suitable for?

Answer

Study These Flashcards

A

Datasets that have modes with varying density

Question 20

Q

PCA assumptions?

Answer

Study These Flashcards

A

-relationship between variables/ features are linear
- principle components are orthogonal (linearly independent)
- direction with largest variance is the most informative

Question 21

Q

Is DTW scale invariant?

Answer

Study These Flashcards

A

No

Anomaly Detection Flashcards

(21 cards)