Jon's Topics Flashcards
Robust, Repeatable, Invariance and Constraints
Robust and Repeatable computer vision is achieved through engineered Invariance and applied Constraints. (We wants Robust and Repeatable, we achieve using Invariance and Constraints)
Feature Space
Abstract space defined by a feature extraction process. Transforming raw data into feature of some fixed number of elements (dimensionality of the space).
Distance Measures
Euclidean (L2) Distance: 2rt[SUMi=1->n (q_i-p_i)^2]
L1 Distance: SUMi=1->n ( | p_i-q_i | ). Where p,q = points
Supervised ML: Classification
Binary - Linear: Learn a hyperplane that separates two classes with a minimum error. Non-Linear: Can learn curved lines to fit data, lose generality by overfitting.
Multiclass - KNN
Unsupervised ML: Clustering
Aims to group data without any prior knowledge. K-Means: Guaranteed to find optimal results, doesn’t perform consistently. Trial and error to find best k value.
Image Features (4 types)
Global, Grid-based, Region-based, Local
Global Features
Extracted contents from entire image, outputs single Feature Vector. Image Histogram is common. Joint Colour Histogram usually considers all pixel colour components together.
Segmentation
Used for segmenting image into regions for Region based features. Using: Thresholding - Basic, Otsu, and Local or Adaptive Thresholding. K-Means
Otsu Thresholding
Otsu: Assumes image contains two classes of pixels (fore/background). Calculates optimum threshold to separate two classes with minimal intra-class variance.
Local or Adaptive Thresholding
Computes a different threshold for each pixel, based on values of neighbouring pixels. E.g. Mean adaptive thresh sets pixel to BG if it’s value is less than mean of it’s neighbours plus an offset.
Segmentation using K-Means
Simple method: Cluster colour vector of all pixels, assign each pixel based on closest cluster centroid. Could spatially encode FV to help base segments on relative position of pixels, to avoid loner different region pixels.
Connected Components
Type of segment in which all pixels are reachable to each other through a path of spatially adjacent pixels. Commonly 4-connectivity or 8-connectivity
Connected Component Labelling
Takes binary image and produces a set of connected components. Uses 2 pass algorithm
Simple Scalar Features
Area: #pixels in component Perimeter: Length around component Inner Border: Inner pixels forming edge Outer Border: Outer pixels of edge Compactness: Measure how tightly packed pixels are Dispersion: How spread out shape is
Moments
Describes Distribution of pixels in a shape. Standard 2D Cartesian Moments can be used as shape descriptors. Different shapes have different moments. Not invariant to scaling, translation, and rotation.
Central Moments + Normalised Central Moments + Hu Moments
Central: Translation invariant as moments computed about centroid of component.
Normalised Central: Translation and Scale Invariant.
Hu: 7 scale, rotation and invariant moments.