Perceiving Depth Flashcards

1
Q

Sources of depth information that would be available in a 2D picture (cues that rely on only one eye)

A

Pictorial (monocular)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

– Information obtained from/relating to motion

A

Kinematic Cues

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

– Cues based on sensing the position of the eyes and muscle tension

A

Oculomotor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

– Information obtained by comparing input from the left and right eyes (binocular)

A

Stereoscopic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Pretend this puppy is driving this car… Objects nearer to him appear to move

A

Faster

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Motion Parallax/Motion Perspective

A
  • when observer moved, displacement of an object’s image on the eye depends on its distance
  • closer objects move more than farther objects
  • Optic flow: when the whole visual field is considered,
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Expansion/Contraction

A
  • when an object approaches, its image expands

- if on a hit path the expansion is symmetric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Accretion/Deletion of Texture

A

when a surface moves relative to another, the nearer surface progressively occludes background texture on the farther surface

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Kinetic Information

A
  • means relating to motion
    • motion perspective/motion parallax
    • optical expansion/contraction
    • accretion/deletion of texture
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Stereoscopic: Binocular Disparity

A
  • differences in the two eyes’ views of an object
  • amount of disparity depends on the distance of an object from the observer
  • the two images of a three-dimensional world are not the same
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Stereoscopic: Horopter

A
  • refers to sets of points in the world having identical binocular disparities.
  • crossed disparity indicates that a point is nearer to the observer than the point being fixated.
  • uncrossed disparity indicates that a point is farther from the observer than the point being fixated
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Oculomotor: Accommodation

A
  • refers to changes in the shape of the lens to achieve focused images at varying distances
  • may provide distance info via unconscious sensing of the muscular movements (in the ciliary muscles) that produce the lens changes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Oculomotor: Convergence/Divergence

A
  • refers to the turning of the two eye to get a particular point in the center of fixation (fovea) of each eye
  • provides depth info via unconscious sensing of the muscular movements used to turn the eyes

Divergence: when the eye moves to see something further away

  • doesn’t work past 2 meters (near space)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Oculomotor provide ______

A

metric: absolute distance information

* limitation not useful past 2 meters, lens (accommodation) is at thin as it gets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Pictorial Information (monocular)

A
  • monocular cues (can operate with only one eye)
  • mostly relates to rules of optics and geometry that governs the projection of the world onto the retina
  • involves using rules of projection (inverse optics) in reverse
  • laws of optics: scene → retina
  • inverse optics: retina → scene
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are some results of the laws of optics that the brain might use to infer depth of objects in a 2D image?

A
  • Nearer objects take up more of the visual field

- further away an object is, the nearer it appears to the vertical horizon (vanishing point)

17
Q

Perception as an inference:

A

brain infers scene due to its probability

the nervous system calculates the probability of each scene given the sensory evidence, and prior knowledge, and chooses the scene that has the highest probability

    • unnoticed judgement: Al-Haytham/Alhazen
    • unconscious inference: von Helmholtz
  • Bayes rule
18
Q

Bayes Rule

A
  • probability of a specific scene given an image is proportional to the probability that, that scene can give rise to that image times the probability of the likelihood of that retinal image in general
  • *P(Sx | I) [posterior]: P(I | Sx)[likelihood]P(Sx)[prior]
19
Q

Combining depth cues

A
  • P(depth | cue1, c2, c3, …) : P(c1 | d)P(c2 | d)…P(d)
  • the nervous system follows an optimal statistical rule of combination in combining different cues (weighted average if P(d) is uniform)
20
Q

Which cue(s) provide non-metric information?

A

Occlusion

21
Q

Which cue(s) provide metric information?

A

Convergence, Accommodation, and Familiar Size

22
Q

Which cue(s) provide relative metric information?

A

Motion Parallax
Relative Size
Relative Height
Binocular Disparity

23
Q

Pictorial Depth cues

A
Relative Size
Familiar Size
Texture Gradient 
Relative Height
Linear Perspective
Occlusion 
Aerial Perspective
24
Q

Bayesian Inference

A

Combining multiple sources of information to arrive at a final percept of the depth to an object or a final interpretation of a 2D image

The final percept will be some combination of prior beliefs and the evidence at hand

25
Q

Priors

A

Before even looking at an image, we have an a priori belief about how likely it is that the world is in a given state

• The evidence we obtain from the image is combined with the sea priori beliefs

26
Q

Bayesian Inference: The Basic Idea

A
  • Multiple kinds of information are taken into account to arrive at a final decision or percept
  • A priori, humans have some belief about the likelihood of the world being in a given state (PRIOR)
  • The retinal image could have been produced by many possible states of the world. However, assuming a given state of the world, that state has a likelihood of producing the retinal image supposing an image of the scene were taken from a random viewpoint (LIKELIHOOD)
27
Q

Likelihood:

A

Assuming a given state of the world is true, the likelihood of that state producing the 2D image shown

28
Q

Posterior:

A

Probability that the reality is a given state of the world given that we observed image I

29
Q

Generic viewpoint:

A

The vast majority of random viewpoints of a scene will fall into this category; these viewpoints provide similar information about the scene and suggest roughly the same surfaces

30
Q

Accidental viewpoint:

A

A kind of “freak accident” viewpoint. The 3D surfaces suggested at this viewpoint are very unlike those existing in the real world.