Anomaly Detection Flashcards
Anomaly Detection / Deviation Detection / Exception Mining
The process of identifying patterns or instances in data that deviate significantly from the norm or expected behaviour.
2 Characteristics of Anomaly Detection Methods
Model-based - Learns a model of both the normal and anomalous classes.
Model-free
2 Type of Model-Based Anomaly Detection
Unsupervised - Anomalies are those points that don’t fit well
Supervised - Anomalies are regarded as a rare class
2 Anomaly Detection Techniques
Statistical Approaches
Gaussian Mixture Model (GMM)
The instances located in areas below that threshold density => outliers
Proximity-based
Distance-Based Approaches - The outlier score of an object is the distance to its kth nearest neighbour
Proximity-based approaches - Local perspective
Anomaly Detection vs Novelty Detection
Anomaly Detection - The algorithm is trained on a dataset that may contain outliers, and the goal is typically to identify these outliers (within the training set), as well as outliers among new instances.
Novelty Detection - The algorithm is trained on a dataset that is presumed to be “clean,” and the objective is to detect novelties strictly among new instances.