w6 gemini Flashcards

Question

What does a line between interest points in the real correspondence example represent?

Answer 1

A putative match.

Answer 2

It identifies a model consistent with a large number of matches (inliers) and rejects inconsistent matches (outliers).

Answer 3

It can find correspondence even with a high number of outliers.

Answer 4

Fitting algorithms, such as fitting a straight line to a set of points.

Answer 5

Hough Transform.

Answer 6

A straight line.

Answer 7

The fitted line will not represent the majority of the data points.

Answer 8

Whether those points are inliers or outliers for that specific line hypothesis.

Answer 9

Simple and effective, general method for various model fitting problems (segmentation, camera transformation, object trajectory).

Answer 10

Requires many iterations if the percentage of outliers is high, lots of parameters to tune.

Answer 11

Finding matching image elements across images.

Answer 12

Stereo vision, video analysis, object recognition.

Answer 13

Grouping looks for similar elements in a single image, while correspondence looks for the same elements in multiple images.

Answer 14

Correlation-based methods and feature-based methods.

Answer 15

Matching image intensities, usually over a window of pixels.

Answer 16

Matching sparse sets of image features.

Answer 17

Which locations to match, what properties to match, where to look for matches, how to evaluate matches, how to find true correspondence.

Answer 18

Image intensities, descriptors of image properties (e.g., SIFT).

Answer 19

All locations (correlation-based) or selected interest points (feature-based).

Answer 20

Exhaustive search or restricted search.

Answer 21

Using similarity measures (correlation, normalized correlation) or difference measures (SSD, SAD).

Answer 22

Matching based on a sparse set of features within each image.

Answer 23

Detect interest points, find corresponding pairs of points by comparing features.

Answer 24

Relatively insensitive to illumination and size changes, less computationally expensive than correlation-based methods.

Answer 25

Provides a sparse correspondence map, only suitable when good interest points can be extracted.

Answer 26

Repeatable detection and distinctive descriptors.

Answer 27

The same point is detected independently in both images, even with changes in scale, rotation, translation, and illumination.

Answer 28

Corresponding points can be correctly matched with high probability.

Answer 29

Points where two edges meet, characterized by high intensity gradients in two directions.

Answer 30

The maximum slope of intensity gradient at two orthogonal directions.

Answer 31

Both eigenvalues are large.

Answer 32

By defining a measure R based on the determinant and trace of the Hessian matrix. R = det(H) -k*(Trace(H))^2

Answer 33

Corner: R is large and positive. Edge: R is negative with large magnitude. Flat: |R| is small. Therefore corner is where R > threshold

Answer 34

Taking the points of local maxima of R after thresholding.

Answer 35

A process of setting the R value of a pixel to 0 if it has a neighbor with a larger R value.

Answer 36

To ensure that only the strongest corner responses in a neighborhood are selected.

Answer 37

Compute derivatives compute products of derivatives compute sums of products define the Hessian matrix compute the detector response R = Det(H) - k*(Trace(H))^2 threshold and apply non-maximum suppression.

Answer 38

It is invariant to translation and rotation. Eigenvalues remain the same.

Answer 39

It is not scale invariant.

Answer 40

By performing corner detection across a range of scales using an image pyramid (Harris-Laplacian).

Answer 41

Scale Invariant Feature Transform (SIFT).

Answer 42

Harris-Laplacian uses the Harris corner detector in space and scale, while SIFT uses the Difference of Gaussians.

Answer 43

To detect interest points in scale space.

Answer 44

If a pixel's value is larger or smaller than all its neighbors in a 3x3x3 neighborhood in scale space.

Answer 45

Keeping points with high contrast and sufficient structure using a threshold based on the ratio of trace and determinant of the Hessian matrix.

Answer 46

A measure of similarity between the points (a descriptor).

Answer 47

A small window around the interest point (pixel intensity values).

Answer 48

Euclidean distance, SSD, SAD.

Answer 49

Not robust to rotation, scale, changes in viewpoint or illumination.

Answer 50

A 128-element vector of intensity gradient orientations around the interest point.

Answer 51

1. Calculate orientation and magnitude of intensity gradient. 2. Create a histogram of orientations. 3. Create separate histograms for sub-windows.

Answer 52

By rotating all orientations based on the dominant orientation.

Answer 53

A 128-element vector, normalized to unit length.

Answer 54

Euclidean distance between vectors.

Answer 55

Robust to translation, rotation, scale, changes in viewpoint and illumination.

Answer 56

Matching pixel values within image regions.

Answer 57

The area in the second image where the corresponding region is searched for.

Answer 58

Using measures like cross-correlation, normalized cross-correlation, or correlation coefficient.

Answer 59

Size of correlation window and search area, and the method to measure similarity.

Answer 60

May not capture enough structure, may be noise sensitive.

Answer 61

Decreases precision, decreases tolerance to viewpoint.

Answer 62

Full correlation is computationally expensive.

Answer 63

Arbitrarily around the original pixel location or using task-specific knowledge (e.g., epipolar geometry).

Answer 64

Cross-correlation, normalized cross-correlation, correlation coefficient, SSD, SAD, Euclidean distance.

Answer 65

Cross-correlation, normalized cross-correlation, correlation coefficient.

Answer 66

SSD, Euclidean distance, SAD.

Answer 67

It is simple and computationally efficient, and the performance difference is often negligible.

Answer 68

Easy to implement, provides a dense correspondence map.

Answer 69

Computationally expensive, needs images with distinct patterns, doesn't work well with viewpoint changes or illumination changes.

w6 gemini Flashcards

(94 cards)