Depth Perception Flashcards
Why is depth perception important?
3D perception is vital for interacting with the
world and recognising objects
Why is depth perception a problem?
How do we obtain a 3D
percept from two 2D
images on the retina?
Define the inverse problem
Any retinal image is consistent with infinitely
many possible configurations of the world
simply = The challenge of determining the objects and properties in the environment that produced a particular sensory input
Any retinal image is consistent with infinitely
many possible configurations of the world
simply = The challenge of determining the objects and properties in the environment that produced a particular sensory input
This is known as…?
The inverse problem
There are multiple sources of 3D information
List 4 types of 3D cues
- Binocular
- Motion
- Pictorial
- Oculomotor
- Binocular
- Motion
- Pictorial
- Oculomotor
What cues are these?
3D cues
Define binocular disparity
We have 2 eyes and each eye produces a different retinal image each
We have 2 different retinal images due to different views
Brain puts those 2 retinal images together, to see things in 3D
We have 2 eyes and each eye produces a different retinal image each
We have 2 different retinal images due to different views
Brain puts those 2 retinal images together, to see things in 3D
This is known as…?
Binocular disparity
Define motion cues
As we move, bits of the world move with our retina at different rates
As we move, bits of the world move with our retina at different rates
This is known as…?
Motion cues
What is motion parallax
Things in the distance move slowly in the retinal image to tell us about how far away things are
What is kinetic depth (KDE)
Due to object motion, bits at the side will move slower, bits in the middle will move faster and a lot more
Due to object motion, bits at the side will move slower, bits in the middle will move faster and a lot more
This is known as…?
kinetic depth (KDE)
Things in the distance move slowly in the retinal image to tell us about how far away things are
This is known as…?
Motion parallax
What are the 6 types of pictorial cues?
- Texture
- Elevation
- Relative size
- Perspective
- Shading
- Occlusion
What is occlusion pictorial cue?
If one object block the view of another object, it is closer to us
What is perspective pictorial cue?
Lines that are converging tend to move away from us
What are the 2 types of oculomotor cues?
- Convergence
- Accomodation
Lens changes shape to focus light into the retinal image
Fatter lens:
a. To focus on things far from you
b. To focus on things close to you
b. To focus on things close to you
Do our eyes converge more or less when the object is near?
More
Do our eyes converge more or less when the object is far?
Less
How do our eyes accommodate for a near object?
Lens becomes fatter
How do our eyes accommodate for a far object?
Lens becomes thinner
If we have so many different cues to depth what is the problem?
List 2 problems
- Many cues are ambiguous - 2D image compatible with infinite 3D worlds
- With multiple cues available – how do we
perceive a single unified world?
Many of the 3D cues (particularly the ____
cues) are ambiguous
Pictorial
The retinal image of ambiguity in perspective could have been created by 3 types of cues
image = a train track (parallel lines that look like they are converging at a distance) is a 2D retinal image
What are they?
- Parallel lines in the world receding away from you (e.g. A)
- Converging lines that are a constant distance from you (e.g. B)
- Infinite other 3D stimuli
The retinal image of ambiguity in shading could have been created by 3 types of cues
image = circle with a lighter top and darker bottom (shading)
What are they?
- A convex object that is lit from above (A)
- A concave object lit from below (B)
- It could be a flat surface with a pattern of light and
dark greys painted on it
Many of the 3D cues (particularly the pictorial
cues) are ambiguous
How do we overcome this ambiguity?
By using prior knowledge, or prior assumptions, to interpret the image
We can overcome ambiguity in depth perception by using prior knowledge, or prior assumptions, to interpret the image
How are these assumptions gained?
Through our knowledge and experience of the physical
properties of the world
We can overcome ambiguity in depth perception by using prior knowledge, or prior assumptions, to interpret the image
Is this bottom-up or top-down processing?
Top-down processing
We can overcome ambiguity in depth perception by using prior knowledge, or prior assumptions, to interpret the image
What approach to vision does this type of processing support?
Constructivist approach, we draw out knowledge about the world to make sense of things
We grew up in a world full of parallel lines, so we often interpret parallel lines as…?
Moving away from us
True or False?
We make the assumption
that lines in the world tend to be parallel
True
True or False?
We make the assumption that light comes from below
False
We make the assumption that light comes from above
e.g. At home, lights are on the ceiling, sunlight comes from above
The higher up the object looks in the image, the ___ it is from us
a. Closer
b. Further
b. Further
How do we resolve ambiguity in elevation?
We assume the object rests on a ground plane (parallel to the ground)
We grew up in the world with gravity, everything rests on a ground plane
The retinal image of ambiguity in elevation could have been created by 2 types of cues
What are they?
- Objects sitting on a surface (A)
- Objects floating in the air (B)
This retinal image could have been created by:
- Objects sitting on a surface (A)
- Objects floating in the air (B)
What type of ambiguous retinal image does this apply to?
a. Ambiguity in elevation
b. Ambiguity in perspective
c. Ambiguity in shading
a. Ambiguity in elevation
This retinal image could have been created by:
- Parallel lines in the world receding away from you (e.g. A)
- Converging lines that are a constant distance from you (e.g. B)
- Infinite other 3D stimuli
What type of ambiguous retinal image does this apply to?
a. Ambiguity in elevation
b. Ambiguity in perspective
c. Ambiguity in shading
b. Ambiguity in perspective
This retinal image could have been created by:
- Aconvex object that is lit from above (A)
- A concave object lit from below (B)
- A flat surface with a pattern of light and dark greys painted on it
What type of ambiguous retinal image does this apply to?
a. Ambiguity in elevation
b. Ambiguity in perspective
c. Ambiguity in shading
c. Ambiguity in shading