Stereopsis Flashcards

1
Q

What is stereopsis

A

Process of recovering 3D geometry from multiple 2D images.
Leverages the slight difference in viewpoints (disparity) between two images to infer depth

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Shape from X problem

A

Perspective cues: vanishing points provide depth information
Texture cues: Objects with detailed texture appear smaller as they recede
Shading Cues: lighting and reflections on surfaces convey depth
Focus cues: changing camera focus reveals the depth of objects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why multiple views?

A

Single image -> ambiguous due to forced perspective or occlusions
multiple views help resolve ambiguities thru differing perspectives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Binocular stereo and depth perception

A

Binocular stereo:
Simulates how two eyes/cams view the same scene from slightly different angles
Difference in disparity is used to calculate depth

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Epipolar Geometry

A

Defines geometric relationship between two images taken from different view points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Epipolar geometry key elements

A

Epipolar plane: contains 3D point P and the two camera centers
Epipolar lines: where the epipolar plane intersects the image planes
Epipoles: Projection of one camera’s optical center onto the other’s image plane

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Epipolar constraint:

A

Point in one image corresponds to a point along the epipolar line in the other image
Reduces search space for correspondence from 2D to 1D

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Epipolar geometry:
- Essential Matrix

A

encodes epipolar geometry when camera intrinsic parameters are known
links points in camera coordinates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Epipolar geometry:
- Fundamental matrix

A

Encodes epipolar gemotry for uncalibrated cameras
Maps a point in one image to a line in the other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Depth from disparity

A

Triangulation:
Calculates the 3D position of a point P from its projection in two images
Depth is inversely proportional to disparity
Small disparities indicate distance objects, while large d indicate nearby objects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Stereo Matching

A

Goal: Find corresponding points or patches between left and right images
Point matching: match specific features (corners, edges) across images
Region matching: Matches patches instead of individual points to include contextual information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Image rectification

A

Purpose:
Aligns images so that corresponding points lie on the same horizontal line
Simplifies stereo matching to search along one axis
Method: apply homography to warp images onto a common plane parallel to the baseline.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Applications of Stereopsis

A

Depth maps: Calc depth for each pixel, used in augmented reality, 3d reconstruction, and robotic navigation
3D reconstruction: recover 3D shapes from stereo images
Visual effects: replays in sports that reconstruct 3D movements
Autonomous vehicles: use depth maps to detect obstacles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Active vs Passive sensing

A

Passive sensing: uses ambient light and camera pairs, relies on natural textures for matching.
Active sensing: projects patterns or uses lasers to simplify correspondence
- eg. Kinect: projects structured light for depth sensing, LIDAR: emits laser pulses to measure distance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Epipolar in depth
Two cameras:
Image planes

A

Each camera captures a 2D project of a 3D scene
Optical centers Ol, and OR of left and right cameras
Img plane: Each camera has an image plane where 3D points are projected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

2 3D point and Projections

A

a 3D point P in the world is projected onto pL, left image plane
pR, right image plane

17
Q

Epipolar plane in depth

A

Plane defined by P, oL, and oR is called the epipolar plane.
epipolar plane intersects the two image planes along lines called epipolar lines.

18
Q

Epipolar lines in depth

A

Epipolar lines are the intersections of the epipolar plane with the image planes
a point pL in the left image corresponds to a point pR in the right image along the corresponding epipolar line