High level perception - Object perception Flashcards
what are the principles of the gestalt psychologists?
“The whole is more than the sum of the parts” —> There are grouping principles of perceptual organisation
what are the five grouping principles?
similarity
proximity
closure
good continuation
common fate
closure
Area bounded by closure is seen as a separate object - closing and area of space
good continuation
Good Continuation - continuity of lines to see them together
common fate
when they move in the same direction and belonging together
Figure ground
Figure ground - border creates two separate objects - like the vase with two faces
what type of model is Marrs model of recognition, what does it aim to explain ?
This is a computation model - processing and different stages
Attempts to explain how we recognise objects in the visual field.
what are the stages of Marrs model of recognition?
primal sketch
21/2 D sketch
3-d Model
primal sketch
What happens: Marr’s model begins with the detection of edges in the visual scene. These edges represent changes in intensity (light and dark) and are key for identifying boundaries of objects.
Why it’s important: Edges are the first step in understanding the basic structure of the objects in our environment. These edges are processed using edge filters, which detect contours and shape.
2.5 D
What happens: After detecting edges, the brain groups them into larger structures or objects. This is where Gestalt principles (such as proximity, similarity, and continuity) come into play, as the visual system organizes the edges into coherent shapes and surfaces. It begins to form a 2.5D sketch, which includes the shape, depth, and orientation of the object relative to the observer.
Why it’s important: This step is about organizing and simplifying the visual input into meaningful chunks that help us perceive objects in three-dimensional space.
3D model
What happens: In this stage, the visual information is combined and abstracted to create a more detailed, 3D representation of the object. The model is not specific to any one viewpoint but provides a flexible structure that can be recognized from different perspectives.
Why it’s important: This 3D representation allows for recognition from any angle or orientation, creating a more robust model for object recognition.
how do we recognise an object according to Marrs model?
What happens: Finally, the model of the object is compared to stored memory models in long-term memory (such as from previous experiences or learned knowledge). This comparison allows us to recognize and categorize the object.
Why it’s important: We match the newly processed object model with existing models stored in memory, which helps us quickly identify and understand the object.
principle axis
biggest cylinder in shape
limitation of Marrs
Problem - many objects are hard to recognise upside down
strengths of Marrs
Marr’s model predicts that:
Visibility of the principal axis is crucial for recognition. If it’s hidden, recognition is harder.
Once a 3D model is built using the axis, different orientations of the object are equally easy to recognize (viewpoint-invariant).
🔍 This explains why we struggle to recognize objects when their core structure is obscured, and why we can still recognize familiar objects from unusual angles.
Biederman Recognition by components
- Edge extraction
Surface characteristics: luminance, texture, colour - Detect arrangement of edges:
Curvature, parallel, co-terminating, symmetry, co-linear,
Such arrangements do not alter with view - Segment object into components (parts)
Detect concave parts - Determine GEON type for each component (part)
36 GEONS needed
Geon
Geon is a shape defined by whether the axis is straight or curved- the cross section (angular or smooth or symmetric)
strength of Biederman
Model properties - Need only few geons to describe most objects (3 geons = 150 million possible objects) Recognition is viewpoint invariant
limitations of Biederman
Problems - Does not differentiate objects within class (how do you tell the difference between your mug and your friends mug) , Does not use surface pattern, Recognition is viewpoint invariant - evidence suggests otherwise - view sometimes matter on how you see an object (Tarr & Bulthoff, 1995)
visual processing in v1 goes where
goes in two directions - temporal lobe and parietal lobe
when monkeys had a lesion to the temporal lobe did they struggle with pointing out the new object (one they had not been shown before) or did they struggle with reaching the food near the object?
They cannot work out which object is the new object
when monkeys had a lesion in the parietal lobe what did they struggle with?
They couldn’t reach for the food near the object
where is damage in the brain for people with object agnosia ?
temporal lobe
what are the symptoms of object agnosia?
No loss of intelligence
Failure to recognize objects
No simple visual impairment
Can see edges but cannot put them together
May draw object OK but not recognize drawing