3D Vision Flashcards

Question 1

Q

3D Vision:
Name a difficulty when real world scenarios are projected from 3D to 2D in camera images.

Answer

A

Depth estimation

Question 2

Q

What are the main two types for 3D Data Acquisition?

Answer

A

Passive and Active range sensing

Question 3

Q

How many types of passive methods are there?

Answer

A

4
Shape from:
-stereo;
-motion;
shading;
-focus.

Question 4

Q

What are the pros of shape from stereo?

Answer

A

Cheap (use cameras)
Fast acquisition

Question 5

Q

What are the cons of shape from stereo?

Answer

A

Highly dependant on
correspondences quality
Still challenging

Question 6

Q

What are the characteristics of Shape from motion?

Answer

A

– Similar to stereovision in many ways
– Successive images might be considered as stereo pairs
– With texture, possible to find correspondences (matching techniques, optical flow…) and find fundamental and
essential matrix.

Question 7

Q

What are the characteristics of shape from shading?

Answer

A

given a continuous surface, and known illumination: intensity variation in the surfaces depends on its orientation
Since most surfaces are not uniform and lighting is difficult to control, normally it is combined with other methods

Question 8

Q

What is shape from focus?

Answer

A

objects away from focal point are out of focus;
with different images with different focus its possible to extract depth information

Question 9

Q

How many types of Active range sensing are there?

Answer

A

Structured Light Systems
Laser Range Finder – Time of Flight

Question 10

Q

How does Structured Light Techniques are implemented?

Answer

A

-Projection of a known pattern
- Acquisition with camera, 3D from pattern deformation in scene.

Question 11

Q

What are the only positive point about Structured Light Techniques?

Answer

A

It is very accurate

Question 12

Q

What are the cons of Structured Light Techniques?

Answer

A

Takes time (often need to scan through an area)
-Sensitive to environment brightness, usually only
implemented in dark or indoor areas.
Short range

Question 13

Q

When are the Laser Range Finders (LRF) used for?

Answer

A

They are used for larger areas (buildings, rooms)

Question 14

Q

How do Laser Range Finders (LRF) work?

Answer

A

Working principle:
light pulse time of flight - laser ray in -> reflect on object -> laser ray out
- phase shift: amplitude of frequency modulation - comparison of phases

Question 15

Q

What are the LRF pros?

Answer

A

independent from external lighting;
no need of texture in scene;
-provide directly 3D measurements

Question 16

Q

What are the LRF cons?

Answer

A

They are:
-expensive;
-large sensors = aquisition more difficult;
-limited spatial resolution;
-no color texture map

Question 17

Q

How do 3D ToF Cameras work?

Answer

A

They phase shift principle of emitted and received infrared light to measure depth

Question 18

Q

How could we define the perfomance and cost of Structured Light:

Answer

A

It has:
- best depth accuracy | shortest range | require dark environment
- Highest cost

Question 19

Q

How could we define the perfomance and cost of Time of Flight (ToF):

Answer

A

-performance is up to hundred meters. depending on emitting
power
- moderate cost

Question 20

Q

How could we define the perfomance and cost Camera Array:

Answer

A

largest depth error | range depend on baseline (dist. between cameras = usually around 10 m)| require bright environment
lowest cost | Development mainly on software side

Question 21

Q

In a nutshell Which perform better in the following categories, Active (Range - TOF) or Passive (camera arrays)?
Cost, Acquisition, Depth error, Texture map, Lighting, Texture relevance and 3D processing

Answer

A

Cost, Acquisition, Depth error, Texture map (first 4 categories):
Intensity (camera arrays) perform better - Passive

Lighting, Texture Relevance and 3D Processing (last 3 categories):
Are better performed by Range (TOF) -Active

Question 22

Q

What type does Kinect use?

Answer

A

Active - infrared pattern.

Question 23

Q

What composes a Kinect?

Answer

A

The kinect has:
- a multi-array mic
- 3D Depth sensors
- RGB camera
- motorized tilt

Question 24

Q

Where can 3D Vision be applied?

Answer

A

Robotics:
-Navigation, localization, mapping, avoiding collision
AR / VR:
- sensing real 3D environments and reconstructing them in the virtual world

Question 25

Q

To what the AR / VR Interaction’s devices must respond accurately?

Answer

A

To the 3D movement, therefore needing high-performance depth sensor

Question 26

Q

What is a Range image?

Answer

A

-It is a rectangular array of numbers that quantifies the distance from the sensor to the surfaces within the field of view.
-It isAlso referred as depth image and easily
transform to cloud of points.

Question 27

Q

How do Ranges differ from intensity to range images?

Answer

A

-In intensity images: edges related to intensity changes (due to
geometry or aspect - for example colour or shadow)
-In range images: jump or step edge; roof or crease edge; smooth edge

Question 28

Q

What is Registration?

Answer

A

It estimates Rigid Body Transform that minimize the distance
between 2 scans

Question 29

Q

What is the ICP (Iterative Closest Point) and how does it work?

Answer

A

It is an algortithm used to perform the registration:
– Find closest point
– Compute transform that minimizes error
– Repeat until ending condition.

Question 30

Q

What are the ICP problems?

Answer

A

algorithm may fall in local minima
typically requires an initial guess
may result in many outliers

Question 31

Q

Besides ICP, it can also be done registration from clouds to surfaces: How many of these are there?

Answer

A

– Non parametric curves (triangles,…)
– Parametric curves (cylinders, quadrics, …)

Question 32

Q

What are some examples of triangulation algorithms?

Answer

A

Delaunay triangulation 2D;
Marching cubes;
Marching triangles;
Ball-pivoting;
Poisson Surface Reconstruction;
Moving least-squares (MLS)

Question 33

Q

How does the Delaunay triangulation work?

Answer

A

For a set of 2D points P ensure that none points of the set
is inside the circumcircle of any triangle.

Question 34

Q

What is Zippering?

Answer

A

remove overlapping portion of meshes
clip mesh together
remove triangles

Question 35

Q

What is Texture Mapping?

Answer

A

Some 3D reconstruction techniques that provide automatically texture like for example
– Shape from …
– Structured Light Techniques

Question 36

Q

Do all kinds of reconstruction techniques provide texture?

Answer

A

No.
For example: initial Laser Range Finder does not provide texture