Lecture 4: Computer Vision Flashcards

Question 1

Q

What is the most powerful sense?

Question 2

Q

The retina has millions of what with a data-rate of what?

Answer

A

photoreceptors with a data-rate of 3 GBytes

Question 3

Q

Name 5 applications of Computer Vision.

Answer

A

AI, ML, Psychology NeuroSci, Robotics, Autonomous driving, hazardous waste clean-up, search and rescue, space exploration

Question 4

Q

What does automatic extraction mean?

Answer

A

‘extracting’ meaningful info from images and videos

Question 5

Q

What are the two types of info that can be extracted and provide examples for both.

Answer

A

Semantic Info(ground, outdoors, shade, roof, sky, door, building)
Geometric Info(shapes)

Question 6

Q

What are three examples of Automatic Reconstruction and Recognition?

Answer

A

Partial 3D from overlapping images
Dense 3D surface model
Face recognition

Question 7

Q

What reduces blurring in image formation?

Answer

A

Blocking most of the rays by adding a barrier between the object and the photoreceptive surface.

Question 8

Q

What is the opening in image formation called?

Answer

A

an Aperture

Question 9

Q

The depth of the room/box is what?

Answer

A

the effective focal length

Question 10

Q

What is the pinhole model? What’s the name of the point? Where is that image formed?

Answer

A

captures beams of rays through a single point.
that point is the Center of Projection or Optical Center.
the image is formed on the Image Plane

Question 11

Q

What are the effects of shrinking the aperture?

Answer

A

The image could suffer from diffraction effects because less light gets through, which increases the exposure.

Question 12

Q

What is the ideal pinhole?

Answer

A

There’s only one ray of light that reaches each point on the film.

Question 13

Q

What could make images more blurry?

Answer

A

making the pinhole bigger

Question 14

Q

Perspective is what?

Answer

A

the dependence of the apparent size of an object on its depth

Question 15

Q

What is Stereo Vision?

Answer

A

It uses 2 cameras with known relative position T and orientation R to recover the 3D scene information

Question 16

Q

What is Structure from Motion?

Answer

Study These Flashcards

A

It recovers the 3D scene structure & the camera poses(up to scale) from multiple images, from potentially unknown cameras.

Question 17

Q

What is Stereopsys?

Answer

Study These Flashcards

A

the brain allows us to see the left and right retinal images as 1 3D image, we can observe image disparity.

Question 18

Q

What is the ideal, simplified case of Stereo Vision?

Answer

Study These Flashcards

A

assume both cameras are identical and are aligned with the x-axis

Question 19

Q

What is the ideal, simplified case of Stereo Vision?

Answer

Study These Flashcards

A

assume both cameras are identical and are aligned with the x-axis

Question 20

Q

What is the name of the place where all rays parallel to the optical axis converge?

Answer

Study These Flashcards

A

focal point

Question 21

Q

What two ways do we use to measure distances with cameras?

Answer

Study These Flashcards

A

Stereo Vision, Structure from Motion

Question 22

Q

Vision is increasingly popular as a sensing modality because of what 4/5 things?

Answer

Study These Flashcards

A

-Descriptive,
-Compactness/compatibility
-Low cost
-Hardware advances necessary to support the processing of images

Lecture 4: Computer Vision Flashcards

(22 cards)