[1] Data Mining Flashcards

Question

What is the capacity of a function?

Answer 1

A measure of its expressive power and flexibility etc.

Answer 2

A function f(theta) shatters a set of point X if for all assignments of classes to that point, there exists a theta such that all the points are correctly classified

Answer 3

A measure of #capacity# for a function based on the largest number of points it can shatter. Note that it might only work for a particular set of locations for the points

Answer 4

The process of finding models with the lowest *risk* Risk is the chance of prediction error on some unseen data points which are iid for some unknown distribution

Answer 5

The *expected risk* is the true risk, but it can't be determined However, it is bounded by the *empirical risk* (based on points already classified) and the *VC dimension confidence* (based on the number of points and the model's VC)

Answer 6

- Train a better model to reduce to empirical risk - Minimize the VC dimension i.e. use a simpler model - Use more data points, as this lowers the VC dimension confidence

Answer 7

The VC dimension can only be calculated for a limited number of models

Answer 8

- TSS (total sum of squared error) - MSE - RMSE - R-squared - the ratio of the error to the variance of the response

Answer 9

It is easily interprete. R2 = 1 - TSS/Var(y)

Answer 10

Accuracy is the portion of instances correctly classified. *Error rate* is the portion of instances incorrectly classified

Answer 11

The portion of all actually true instances which are classified as true. It is also known as sensitivity or recall TPR = TP / (TP + FN)

Answer 12

The portion of truly negative instances which are classified as negative It is also known as specificity

Answer 13

They sum to 1 i.e.e FPR+TPR = 1

Answer 14

It plots how the FPR on the x-axis affects the TPR on the y-axis

Answer 15

A diagonal line (y=x), if it were below this, predictions could be flipped

Answer 16

It is bounded between 0.5 and 1, with higher values being better However: - it is difficult to calculate when the points are not well spread across the ROC space - it depends a lot on the non-interesting part of the ROC curve

Answer 17

The distance from the center of the ROC and the intersection of the ROC curve and the minor diagonal

Answer 18

- Detection rate (DR) - False alarm / objects (FA/O) - Extended ROC

Answer 19

objects found / total objects

Answer 20

non-objects reported / total objects It can be greater than 100% for difficult problems

Answer 21

It has FA/O on the x-axis and DR on the y-axis Unlike ROC, models can perform worse than the diagonal

Answer 22

Precision and recall

Answer 23

The quality of records retrieved: relevant documents retrieved / number of documents retrieved

Answer 24

How useful the information retrieved was: number of relevant documents retrieved / total relevant documents in DB

Answer 25

* computer vision* - the outputs are for machines | * image processing* - the outputs are for people

Answer 26

* image restoration* - return image to its original appearance * image enhancement* - make image better than ever

Answer 27

The ratio between brightest and darkest values that can be shown i.e. the contrast ratio

Answer 28

256 i.e. 0 to 255

Answer 29

Multi-spectral

Answer 30

They have 24 bit color (16 million colors); compress well but have artifacts; don't support transparency or animation

Answer 31

They use lossless compression. Tehy support 24-bit color but can't be animated

Answer 32

Support transparency and animation. They use a palette of 256 colors. They are lossless if less than this many colors

Answer 33

They support a range of color formats i.e. indexed color, 24-bit RGB, and 32-bit RGBA They can have a range of compression methods and support layers

Answer 34

They support 8-bit, 16-bit or 24-bit RGB color, but don't scale well

Answer 35

PNM (Portable aNyMap). It supports: - binary (.pbm) - grayscale (.pgm) - pixmap (.ppm)

Answer 36

- A magic number - The width of the image - The height of the image - The maximum grayscale value i.e. 15 or 255 - Optional: comments starting with # - The actual image data: pixels separated by whitespace

Answer 37

P2 for PGM in ASCII, P5 for PGM in RAWBITS

Answer 38

- Region of Interest Extraction - Image Algebra - Spatial filters

Answer 39

Use cropping, zooming, shrinking, translation or rotation focus on part of the image

Answer 40

*zero-order hold* repeats pixel values; *first-order-hold* uses averaging or convolution etc.

Answer 41

- Addition: superimpose image - Subtraction: remove additive noise or detect motion - Multiplication: adjust brightness (generally with a constant) - Logical operators: masking

Answer 42

Image operations which change each pixel based on its neighborhood. They include: - mean filters - median filters - enhancement filters - image quantization

Answer 43

They are form of *linear filter*, which means that: - If the coefficients sum to 1, the average brightness is maintained - If the coefficients sum to 0, the average brightness is lost - If the coefficients alternate positive and negative, edge information will be returned - If the coefficients are all positive, the image will be blurred

Answer 44

They enhance features in an image Laplacian filters enhance the image in all directions equally: - -1 on edges with 8 in the centre - -1 in NESW, 4 in the centre, 0 on the diagonals Difference filters enhance in a particular direction. Note they have two 1s and one -1

Answer 45

The process of reducing the amount of image data. It includes: - Grey-level reduction - Spatial reduction

Answer 46

A form of image quantization in which the number of grey levels is reduced. Thresholding makes it binary based on whether the value is above or below the threshold

Answer 47

Image quantization where groups of pixels that are #spatially adjacent# are mapped to a single pixel This can be done with averaging, median or *decimation* in which some of the pixels are simply eliminated

Answer 48

The process of identifying sharp changes in brightness over short spatial distances This is particularly difficult in noisy images

Answer 49

- Roberts operator - Sobel operator - Canny edge detector

Answer 50

It only marks edge points. It does so by checking change of intensity in the diagonal direction. It is generally used for binary images

Answer 51

Detect edges in the horizontal and vertical directions. The kernels are: [-1 -2 -1; 0 0 0; 1 2 2] for horizontal (S1) The edge magnitude is sqrt(S1^2 + S2^2). The direction in atan(S1/S2)

Answer 52

A general process for edge detection: - Smooth the image using Gaussian to minimize the impact of noise - Estimate the gradients using partial derivatives or Sobel - Thin edges using *non-maxima suppression* (suppress pixels that aren't larger than their neighbor in the positive and negative #gradient direction#

Answer 53

It divides images into segments (#connected# pixels)

Answer 54

Four-connectivity, eight-connectivity and six-connectivity (includes top-left and bottom-right)

Answer 55

Uses thresholding to segment based on brightness. It is very vulnerable to noise. The threshold can be chosen manually, or automatically using K-means clustering or by minimizing group variance

Answer 56

The *splitting and merging algorithm* [1] Define a *homogeneity test* to measure how similar regions are [2] Split the image into equally sized regions [3] Calculate the homogeneity measure for each region. If the test is passed, attempt to merge with neighbors; Otherwise, split [4] Repeat [3] until all regions pass the homogeneity test

Answer 57

Growing - no splitting is applied | Shrinking - no merging is applied

Answer 58

By transformation from the spatial domain to the frequency domain (spectral domain)

Answer 59

The image will have the same size Areas of rapid brightness change correspond to high frequency.

Answer 60

A set of basis images is used, where each basis image is the same size of the original image T(u,v) = SumSum I(r,c)B(r,c;u,v) Fourier transforms are generally used; cosine transforms don't use sine functions

Answer 61

Low-pass filters eliminate high-frequencies, blurring the image High-pass frequencies can be used for edge detection

Answer 62

Those that let some of the signal through if it is close to the threshold

Answer 63

Transforms that include both frequency and spatial information They break the image into four sub-sampled images by high-pass and low-pass filtering the image in both directions They are commonly used for image compression

Answer 64

Feature extraction is the general process of turning image data into feature vectors The features may be domain specific or domain independent Good features are RST invariant i.e. unaffected by rotation, scale or transformation

Answer 65

- Using binary objects of interest - Histogram features - Global features - LBP - SIFT - Pixel Statistics

Answer 66

Features can be extracted based on its: - area - center of area - axis of the least second moment (gives an indication of its orientation) - thinness - the Euler number

Answer 67

A measure of how thin an object is based on the ratio of its area to perimeter: T = 4 pi A/P^2 Note that a circle has a thinness of 1, a very thin object approaches 0

Answer 68

The number of objects in an image minus the number of holes

Answer 69

Global features based on the image's brightness or contrast

Answer 70

Global features generally assume that there is only one object in the image; local filters can be more difficult because you have to implement the case when there are multiple objects Local features generally perform better than global if there is significant clutter or occlusion

Answer 71

Local Binary Pattern is a #global# feature based on: - For each central pixel, identify p points at distance r from the central pixel - Calculate the difference between the central pixel and each pixel - Encode these differences in binary - 1 if the positive values and 0 for negative - Create a histogram of these binary codes across the image

Answer 72

An algorithm for matching features between images.

Answer 73

- Resolution - what is the minimum size objects that can be distinguished - Number of features - algorithms take longer if more features are used - Invariances - are the features affected by rotation or lighting changes etc. Redundancy - could fewer features encode the same amount of information? Completeness - can the original image be reconstructed based on the features

Answer 74

The image is divided into regions and aggregates of pixel values are computed for each region

Answer 75

Template matching - the template is swept across the image to find matches Nearest neighbor - find labelled images most similar to the current image

Answer 76

With a cutout square. It should be large enough to contain any object, but not large enough to contain multiple objects

[1] Data Mining Flashcards

(104 cards)