Feature detection/ matching Flashcards

1
Q

Describe the steps in the RANSAC algorithm

A
  1. Calculate model parameters from n random data points, where n is the minimum required points
  2. Evaluate inlier percentage for this model with a distance threshold t
  3. If inlier percentage is the current max, save this model
  4. Repeat N times
  5. Use the inliers from the best model and create a better model using least squares or similar methods
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Describe RANSAC in one sentence

A

RANSAC is a robust (handles outliers) method for estimating the parameters of a mathematical from observed data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What should the RANSAC threshold be if the noise in the data has a normal distribution

A

Around 2 sigma

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The formula used for deciding N(Number of runs) in the RANSAC algorithm is log(1-p)/log(1-w^n). p is the probability of sampling at least one set without outliers, n is the number of data points used to estimate parameters and w is the er probability of a data point being an inlier. How is p and w usually decided

A

p is usually set to 0.99

w is approximated for each run.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the full name of RANSAC

A

RANdom SAmple Consensus

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How is the canonical orientation of a feature point normally decided

A

The local patch is rotated so that the direction of the maximum gradient is pointing upwards.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Describe the steps from feature detection to feature matching

A
  1. Detect feature points in the image
  2. Define a local patch around the feature
  3. Extract and normalize the local patch
  4. Create a local descriptor
  5. Match features by calculating the distance between the descriptors.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is SIFT short for

A

Scale Invariant Feature Transform

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What type of descriptor do the SIFT and SURF algorithms use

A

HoG (Histogram of Gradients).

The other type of descriptors are binary descriptors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe the steps in the SIFT algorithm

A
  1. Use LoG pyramids to determine the canonical scale and HoG to determine the canonical orientation.
  2. Normalize the patch to 16x16 pixels, compute the gradients and apply a Gaussian weighting to the gradients.
  3. Divide the 16x16 patch into 4x4, compute 8 gradient directions in each square and concatenate these gradients into a 128 feature vector.
  4. Normalize to unit length, threshold to 0.2 and renormalize.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the main advantage of binary descriptors, and how is this achieved

A

The main advantage is computational speed. The descriptors are binary strings and the hamming distance (XOR) can be used for matching. The XOR function is computationally very efficient.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Name some binary descriptor algorithms

A

BRIEF, ORB, BRISK, FREAK

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Name some distance functions used for feature matching

A

L1, L2, Hamming(XOR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Explain the ROC curve

A

The ROC curve measures true positive rate( true positives/matched features) vs. false positive rate( false postivies/ unmatched features)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Explain the ratio test

A

Keep the two best matches. if distance of the first divided by distance of the second is larger than some threshold (0.8) throw away the match.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the minimum number of corresponding data points in two images required to calculate a homography

A

4, where no 3 points are collinear

17
Q

Describe the DLT (direct linear transform) for finding homographies.

A
  1. Transform the problem to a matrix equation on the form Ah = 0.
  2. Normalize
  3. Obtain the SVD of A.
  4. If S is diagonal with positive values in descending order, h is the last column of V
  5. Denormalize and reconstruct the homography H from h.
18
Q

Why are the parameters often normalized in the DLT.

A

The DLT performs best when all parameters are of similar scale.

19
Q

What different kind of errors can be used when using RANSAC to determine Homographies

A

Algebraic error:
e_i = ||A_i*h||

Geometric errors:
e_i = d(Hu_i, u_i2) + d(u_i, (H^-1)u_i2) (Reprojection error)
e_i = d(Hu_i, u_i2)
e_i = d(u_i, (H^-1)u_i2)

20
Q

How is the SVD used to solve matrix equations on the form: Ah=0

A

The nullspace of A is a linear combination of the singular vectors with singular value equal to 0 or no singular value.

21
Q

How can we detect lines in images

A

Using the Hough transform

22
Q

What are the important characteristics of good feature points

A
  1. Distinct
  2. Local
  3. Efficient
  4. Reprodusable
23
Q

What is the interpretation of the eigenvalues and vectors of the M matrix in corner detection?

A

The eigenvalues describe the max/min change and the vectors describe the direction of the max/ min change

24
Q

What is an important property for edge features

A

Small movements in any direction should equal large changes in the feature point. This is equal to the smallest eigenvalue of M being large.

25
Q

What operator is often used to score corner features

A

The Harris operator (det(M) - alpha*trace(M))

26
Q

Is the Harris detector invariant to affine changes in pixel intensity?

A

Only partially (Yes to additive changes, no to scaling)

27
Q

Why is the Harris detector invariant to additive changes in pixel intensity

A

Only the derivatives are used.

28
Q

How can we solve the problem that the Harris detector isn’t invariant to image scaling?

A

We can compute the Harris score for several scales, and choose the largest. This is the canonical scale.

29
Q

When will the Laplace operator have the maximum response for a binary circle?

A

When the zeros of the Laplace align perfectly with the circle edge.

30
Q

Why is the Laplace pyramid used in blob detection?

A

To detect blobs at different scales.

31
Q

Name some methods used for edge detection

A

Canny, Laplace, Sobel

32
Q

What is the idea behind binary feature detectors

A

A local neighborhood is divided into points. These points are connected to each other and score of 1 is assigned if a point has a larger value than the previous point, else 0.

33
Q

What is the M matrix in edge feature detection?

A

The M matrix describes changes if the local patch is slightly moved.

34
Q

When are two images related by a homography?

A

If the images are captured from a planar scene

35
Q

What is a canonical affine transformation?

A

A local patch normalization using affine transforms. This allows rotated planes etc. to be matched.

36
Q

Describe the cross check test (Alternative to the ratio test)

A

Measures the projection both ways. Matches are accepted only if fa is the best match for fb AND fb is the best match for fa.

37
Q

What is: Invariance and Covariance?

A

– Invariance: image is transformed and corner locations do not change
– Covariance: if we have two transformed versions of the same image, features should be detected in corresponding locations