6 -Depth Perception Flashcards
why is depth perception important?
3D perception is vital for interacting with the world and recognising objects.
why is depth perception a problem
How do we obtain a 3D
percept from two 2D
images on the retina?
what is the inverse problem
Any retinal image is consistent with infinitely many possible configurations of the world
what are binocular cues
– Binocular disparity (Stereo vision)
Two eyes looking at stimulus from different locations
Visual system matches left and right image
Different images - use binocular disparities differences to see what the depth looks like
what are motion cues
we move objects move
Things that are close to us will move more and things further away will move less
In the opposite direction to direction were travelling
motion parallax - due to self-motion
kinetic depth (KDE) - due to object motion
When an object is moving will cause motion in the retinal image
Different parts of surface will move at different speeds, texture closest moves fast but texture at side moving slower and less
what is kinetic depth
due to object motion
When an object is moving will cause motion in the retinal image
Different parts of surface will move at different speeds, texture closest moves fast but texture at side moving slower and less
what is motion parallax
due to self-motion
Things that are close to us will move more and things further away will move less
In the opposite direction to direction were travelling
what are the 6 pictorial cues
texture elevation relative size perspective shading occlusion
what is texture
change in density and size
higher density further away
what is relative size
big image closer
smaller image gurther away
what is perspective
lines converge as they get further away from us
what is shading
patterns of light and dark can give an impression of cavities and convexities
convergence for a far target and near target
Info we get bc of eyes
Converge towards each other to fixate on certain objects
Converge more when closer
Distance - parallel
sends to visul system how far away the thing were fixating on is
what is accomodation
Focused image on retina
Depending on how far away things is lens changes to focus light
Lens fat when thing close to you to bend image
Further away thinner stretched out more to bring far away object
Send signals to higher up in visual system to say how far away the thing is
If we have so many different cues to depth what is the problem?
- Many cues are ambiguous – 2D image compatible with infinite 3D worlds
- With multiple cues available – how do we perceive a single unified world?
ambiguity in depth perception
many of the 3D cues (particularly the pictorial
cues) are ambiguous
ambiguity in shading
This retinal image could have been created by: • a convex object that is lit from above (A) • a concave object lit from below (B) • (or it could be a flat surface with a pattern of light & dark greys painted on it)
how can we overcome ambiguity in depth perception
- Many of the 3D cues (particularly the pictorial cues) are ambiguous
- We can overcome this ambiguity by using prior knowledge, or prior assumptions, to interpret the image.
- These assumptions are gained through our knowledge and experience of the physical properties of the world.
• This is a type of top-down processing and supports the constructivist approach to vision
(see week 1)
e.g. interpreting train tracks
To interpret this image we can make the assumption that lines in the world tend to be parallel.
e.g. shading in image
To interpret this retinal image we use an assumption that light comes from above
convex chape
ambiguity in texture
This retinal image could have
been created by either:
1. A 2D square with a pattern of ellipses (ovals) that change shape
2. A 3D cylinder with a 3D pattern of circles wrapped around
to interpret ambiguity in texture
To interpret these retinal images we assume surface textures are isotropic (unbiased orientation) and homogeneous (uniform density)
• Making this assumption means any changes in image texture orientation or density are attributed to changes in 3D surface orientation