Lecture 6: Object Recognition & Line Extraction Flashcards

1
Q

Concerning the Bag of Words(BoW) approach, what are the 2 questions asked?

A

Is this image in my database?
Robot: Have I been to this place before?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Concerning the Bag of Words(BoW) approach, what 3 analogies can be used from text retrieval?

A

Visual words, Vocabulary of Visual Words, “Bag of Words”(BoW) approach

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the succession order in image retrieval?

A

Image collection –>Extract features –>Cluster Descriptors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is image retrieval in BoW?

A

represents a more general problem of object or place recognition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What can we do with Image Retrieval?

A

describe a scene as a collection of words and look up in the database for image with a similar collection of words

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What if we need to find an object/scene in a database of millions of images?(2 things)

A
  • Build Vocabulary Tree via hierarchical clustering
  • Use the Inverted File system: a way of efficient indexing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the format of the Inverted File System?

A

Each node in the tree is associated with a list of images containing an instance of this node.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Concerning Vocabulary trees, what is K-means clustering? What is minimized?

A

partitioning a point cloud into k clusters, such that each point belongs to one cluster.
the Sum of Squared Euclidean Distances between points and their nearest cluster-centers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the algorithm for Vocabulary trees?

A
  • Randomly initialize k cluster centers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Until Convergence, what 2 things do you do in the algorithm for Vocabulary?

A

-Assign each data-point to its nearest cluster-center
-Re-compute each cluster-center as the means of all points assigned to each cluster

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

In Vocabulary tree construction, what does each node represent?

A

a cluster of descriptors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

In Vocabulary tree construction, what does each leaf represent?

A

a visual word

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does an inverted file DB list? Inside it, what does each word point to?

A

all possible visual words.
a list of images where this word occurs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does a Voting array have?

A

as many cells as the images in the DB - each word in query image votes for an image

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is tf-idf acronym for?

A

Term
Frequency
-Inverse
Document

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what does tf-idf measure?

A

the importance of a visual word inside a document(as part of a document DB)

17
Q

what is term frequency?

A

frequency of word in image

18
Q

why should tf-idf of word in image j ?

A

use it to weigh the importance of each word when voting for corresponding image

19
Q

What are the 3 main problems in line extraction from a point cloud?

A

How many lines are there?
(Which points belong to which line?)Segmentation
(Given points that belong to a line, how to estimate the line parameters?) Line Fitting/Extraction

20
Q

What are 4 algorithms used in line extraction from a point cloud?
(S-a-M, LR, 5, H-T)

A

Split-and-merge,
Linear regression,
RANSAC,
Hough-Transform

21
Q

When it comes to split-and-merge, how does it work?

A

Iterative end-point-fit: simply connects the end points for line fitting

22
Q

When it comes to line-regression, how does it work?

A

“Sliding window” of size N points; fit line-segment to all points in each window; merge overlapping line segments + re-compute line parameters for each segment.

23
Q

What does RANSAC stand for?

A

RANSAC = RANdom SAmple Consensus

24
Q

What is RANSAC?

A

A generic & robust fitting algorithm of models in the presence of outliers (i.e., points which do not satisfy a model)

25
Q

Where can RANSAC be applied?

A

in general to any problems, where the goal is to identify the inliers which satisfy a predefined model

26
Q

What are the 6+ typical RANSAC applications?

A

line extraction from 2D range data,
plane extraction from 3D data,
feature matching,
structure from motion,
camera calibration,
homograph estimation, etc

27
Q

How does RANSAC work?(2 adjectives)

A

is iterative and non-deterministic –> the probability to find a set free of outliers increases as more iterations are used.

28
Q

What is the drawback of RANSAC?

A

a non-deterministic method, results are different between runs

29
Q

What are the 5 steps of RANSAC?
(S.C.C.S.R)

A
  1. Select sample of 2 points at random
  2. Calculate model parameters that fit the data in the sample
  3. Calculate error function for each data point
  4. Select data that supports current hypothesis
  5. Repeat sampling
    Set with the maximum number of inliers obtained after k iterations
30
Q

How many iterations does RANSAC need?

A

k = #
* Ideally: check all possible combinations of 2 points in a dataset of N points.
* Number of all pairwise combinations: ?
* Computationally infeasible if N is too large.
* Example: 10,000 points to fit a line through –> need to check all
10,000* 9,999/2 = 50 million combinations!

31
Q

In RANSAC, do we really need to check all combinations, or can we stop after some iterations?

A
  • Checking a subset of combinations is enough if we have a rough estimate of the percentage of inliers in our dataset.
32
Q

In Hough-Transform, what do points vote for?

A

plausible line parameters

33
Q

What does Hough-Transform do?

A

maps image-space into Hough-space

34
Q

What is Hough-space?

A

voting accumulator, parameterized w.r.t, line characteristics

35
Q

What does a line in image space correspond to in Hough space?

A

a point

36
Q

What does a point in image space correspond to in Hough space?

A

a downward line

37
Q

In Hough-Transform, where is the line that contains both X0,Y0 and X1,Y1?

A

It is the intersection of the lines b = -x0m+ y0 and b = -x1m+ y1