perceptions of objects motion and depth Flashcards
what was evolutionary benefit of vision?
Early primates started foraging for reach source of calories: fruits and
nuts that usually grow at the ends of tree branches → require
reaching and grasping.
Need to identify shapes and 3D positions of object, then move to get
them: object recognition, motion and depth perception.
1/3 to 1/2 of the primate brain cortex is involved in visual perception.
Human brain further exploits this specialization: e.g. creating new
tools, reading.
what is ventral strream for and what is it comprised of?
Ventral stream: object
identification
Cortical areas: V1 → V2 → V4
→ Lateral Occipital Complex
(LOC) & Infero-temporal cortex
(IT)
What is dorsal stream for and what is it comprised of?
Dorsal stream: motion and
position in space.
Cortical areas: V1 → Middle
Temporal (MT or V5) → Medial
Superior Temporal (MST) →
Posterior Parietal Cortex (PPC)
How do these ususally work to encode vision and perception of things?
T**here is a Hierarchical organization: **early visual areas encode image
features, at the later stages features are combined into global
structures.
(V1 → V2 → V4 → Infero-temporal cortex)
Predictive coding: prediction of the input by the higher order areas
sent to the early visual areas; prediction error signal (mismatch
between predicted and actual input) is sent to the higher order
areas to update representation.
prediction
error signal
input
prediction
error signal
How do we group features into objects?
Principle of proximity, closure, similarity, connection and common fate.
What do the gestalt principles refer to?
Proximity: seeing all these circles right next to each other enables us to see lines (made of circles)
Common fate: when features are moving together we see them as an object.
Similarity: combinign similar features together to make an object example, red circles in square of blue ones.
Connection: features observed connected together get percieved as one object.
closure:The principle of closure states that when we look at a complex arrangement of visual elements, we tend to look for a single, recognizable pattern. In other words, when you see an image that has missing parts, your brain will fill in the blanks and make a complete image so you can still recognize the pattern.
circle being drawn with broken lines is still percieved as a circle.
Role of V1 in vision?
Detection of edges and their orientation – early stage of object processing
Direction selectivity – early stage of motion processing
Binocular cells – early stage of depth
perception
Lesion – no conscious vision
Late (>100ms) signal processing in
V1 is sensitive to global organization
of a scene due to feedback from
higher-order areas (V4, IT, or MT) →
V1 response modulation. Modulated by context dependent feedback.
What contributes to contour saliency?
Factors that contribute to contour
saliency include the number of contour elements
(compare the first and second frames), the spacing of
the elements (third frame), and the smoothness of the
contour (bottom frame). When the spacing between
contour elements is too large or the orientation
difference between them too great, one must search
the image to find the contour.
How does contour integration in V1 reflext Gestalt principles?
Contour integration reflects the perceptual rules of
proximity and good continuation. Each of the four
images here has a straight line in the center, and all
four lines have the same oblique orientation. In some
images the line pops out more or less immediately,
without searching.
What is V2?
Visual area V2, or secondary visual cortex, also called prestriate cortex,[24] is the second major area in the visual cortex, and the first region within the visual association area. It receives strong feedforward connections from V1 (direct and via the pulvinar) and sends strong connections to V3, V4, and V5. It also sends strong feedback connections to V1
In terms of anatomy, V2 has many properties in common with V1: Cells are tuned to simple properties such as orientation, spatial frequency, and color. The responses of many V2 neurons are also modulated by more complex properties, such as the orientation of illusory contours,[25][26] binocular disparity,[27] and whether the stimulus is part of the figure or the ground.[28][29]
Figure ground segration and how that is enabled by perception of illusory contours?
You can see a nice response to the real rectangle.
Then a response still quite nice to theilliterate rectangle
and then nothing to the parts of the same image.
So um what is that for?
This is very helpful in segregating um figure from the
ground, especially if you think about thenatural images.
Well, one like one item against the ground isbasically
is a highly correlated um representationagainst very low correlation.
Uh say say uh this chair against that wall, the
chair has a large flat surfaces.
So there is a lot of correlations of, ofindividual features.
How do Illusory contours further help us in perception?
Detection of illusory contours allow us to detect/percieve obejcts/ shapes partially occluded in nature and environment.
What does V4 do?
Can detect shapes. Different cells have different shape preferneces irrespective of position in receptive field (THIS IS CALLED POSTITION INVARIANCE).Receptive fields are usually bigger in V4 neurons comapred with V2. **Integrates local cues into global shape object based representation. **
What would a V 4 lesion result in?
V4 lesions in primates lead to severe disruptions of objects discrimination.
How are objects detected by LOC?
V4 → LOC & IT (LOC possibly = TE in
monkeys)
* Lateral Occipital Complex – representation
of complex shapes. Results from human
fMRI; presented real-life objects, degraded
images, textures; LOC responded
selectively to objects, both familiar and
unfamiliar, and has size invariance. (Malach
et al., PNAS 1995) .
* LOC demonstrates form-cue invariance (apple will be apple regardless of colour and how it is presented cartoon, real life etc.
An object recognition system should be insensitive to the precise physic