wk4 Flashcards

1
Q

What is the issue with treating vision as an objective medium

A

-Vision is inferential and highly dependent on context
-changes in illumination can shift perceived colour of surrounding images

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What should selected features be invariant with respect to

A
  • illumination
  • scale
  • rotation
    -affine (stretching of image)
  • perspective
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do we help with illumination, scale and rotation invariance

A

illumination: histogram normalization or difference-based shift

Scale: change images to smaller versions via average or mean filter (similar to max pooling)( or use Difference of Gaussian

Rotation: rotate all features in same direction by taking the histogram of all directions and rotate all features to the most dominant direction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How could you compress a dynamic video

A

look for static elements in the video e.g. background which doesn’t change and make the pixels static

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is optical flow

A

visualisation of motion between two images:
-identifies features in t and t+1 which correspond to one another
-plots a vector of movement of each pixel or feature from t to t+1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a simplistic way to track motion in two images t and t+1

A

(t+1) - (t) -> if the resulting pixel is greater than some threshold, motion. Else nothing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the problem with simply subtracting two images to determine motion and what should the solution be

A

-Two images could be noisy
-noise (being random by nature) will not be correspondent between the two images
-hence subtracting the two images will show differences in noise mistaken for motion
-gaussian filtering will suppress features so is not appropriate
-solution is using connectedness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is 4 and 8 neighbour and connectedness

A

4 neighbour:
- a feature is only a feature if it shares an adjacent pixel with 4 other features

8 neighbour:
- feature is a feature if it shares a common corner with 8 other features

Connectedness:
-P and Q are connected if a path from P to Q can be joined by a series of pixels such that the path contains 8 adjacent pixels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the aperture problem

A

If we only have access to say a slit or a pinhole camera so we see only local features of an image, we cant truly determine the overall motion of an image, directionality nor proper features when looking locally

As a solution we shouldn’t look at local features but instead identify interesting features

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What interesting feature should we investigate rather than edges

A

Corners

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

which detector is used for detecting corners

A

Moravec operator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How does Moravec operator work

A

-Take a filter e.g. 5x5, 7x7 etc.
- Place this filter on some point in the image call it filter A
- Shift the filter in each of the component directions (left, right, up, down, the 4 verticals) call this filter B
- for each B_i where i =[0,8]. Take (A-B)^2
- sum all these differences
- threshold the resulting pixels, the largest ones will be those with the greatest magnitude change in all directions which corresponds to corners

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what are the 3 principles of motion correspondence

A
  • Distinctiveness: individual points must be distinct from one another
  • Similarity: two points should resemble each other if they are the same point affected by motion i.e. point 1 in t and t+1 should look similar
  • Consistency: two matches should have moved in analogous ways to other matches
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

outline of the algorithm for motion correpsonence

A

-find points of interest using Moravec
-pair features of img1 at t and img2 at t+1
- calculate the degree of similarity between each point of interest ,i, in img1 and img2
-calculate the likelihood of each match by calculating similarity weights and converting them to probabilities
-do this for each patch in img1 to each patch in img2, highest probability = most likely match

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is the equation for similarity weight from patch i to patch j where patch i is in some image at t and j is in some image at t+1

A

w_i,j = 1 / 1 + (alpha) S_i,j

where S_i,j is similarity between i and j which is equal to patch_j - patch i

and alpha is a constant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Once you have all similarity weights, how do you determine the probability/likelihood of a match

A

Normalize the weights such that their sum is 1. The resulting weights correspond to the probability of a match

Highest probability is most likely match

17
Q

How do you translate between cartesian and polar coordinates

A

theta = tan^-1 (y/x)
w = x * cos ( theta ) + y * sin ( theta )

18
Q

How does a hough transform work

A
  • Edge or threshold an image ( can do without it but do it generally)
  • Given a point, translate it to its polar coordinates
    -each point has a representation of w and theta, but each point in the Hough space may be represented by different w and theta scaling proportionally
    -For this point in the hough space, find different values w and theta for it and plot lines to generate these points on the hough space w = x cos ( theta ) + y sin ( theta ). Do this for some range of theta e.g. 0-180
  • Repeat this for multiple points from the original image, this will generate many lines in the hough space which will intersect at different points
    -points in the hough space with many intersections are likely to represent actual / true lines and edges of the image
    -The same can be done for circles and elipses
19
Q

What are the advantages and disadvantages of the Hough Transform

A

Hough transform is good at finding lines and locating them in an image

It however does not find where the line starts or ends

20
Q

How could you potentially improve a hough transform

A
  • try suppress local maxima,
  • edge thinning
  • fill gaps between two lines e.g. if two lines are within some distance, connect them
  • use canny / sobel as a basis for the though transform