Lecture 4: Computer Vision Flashcards
What is the most powerful sense?
Vision
The retina has millions of what with a data-rate of what?
photoreceptors with a data-rate of 3 GBytes
Name 5 applications of Computer Vision.
AI, ML, Psychology NeuroSci, Robotics, Autonomous driving, hazardous waste clean-up, search and rescue, space exploration
What does automatic extraction mean?
‘extracting’ meaningful info from images and videos
What are the two types of info that can be extracted and provide examples for both.
Semantic Info(ground, outdoors, shade, roof, sky, door, building)
Geometric Info(shapes)
What are three examples of Automatic Reconstruction and Recognition?
Partial 3D from overlapping images
Dense 3D surface model
Face recognition
What reduces blurring in image formation?
Blocking most of the rays by adding a barrier between the object and the photoreceptive surface.
What is the opening in image formation called?
an Aperture
The depth of the room/box is what?
the effective focal length
What is the pinhole model? What’s the name of the point? Where is that image formed?
captures beams of rays through a single point.
that point is the Center of Projection or Optical Center.
the image is formed on the Image Plane
What are the effects of shrinking the aperture?
The image could suffer from diffraction effects because less light gets through, which increases the exposure.
What is the ideal pinhole?
There’s only one ray of light that reaches each point on the film.
What could make images more blurry?
making the pinhole bigger
Perspective is what?
the dependence of the apparent size of an object on its depth
What is Stereo Vision?
It uses 2 cameras with known relative position T and orientation R to recover the 3D scene information
What is Structure from Motion?
It recovers the 3D scene structure & the camera poses(up to scale) from multiple images, from potentially unknown cameras.
What is Stereopsys?
the brain allows us to see the left and right retinal images as 1 3D image, we can observe image disparity.
What is the ideal, simplified case of Stereo Vision?
assume both cameras are identical and are aligned with the x-axis
What is the ideal, simplified case of Stereo Vision?
assume both cameras are identical and are aligned with the x-axis
What is the name of the place where all rays parallel to the optical axis converge?
focal point
What two ways do we use to measure distances with cameras?
Stereo Vision, Structure from Motion
Vision is increasingly popular as a sensing modality because of what 4/5 things?
-Descriptive,
-Compactness/compatibility
-Low cost
-Hardware advances necessary to support the processing of images