Week 5-High-Level Perception Flashcards
Define Object Recognition
The ability to know what an object is (and process sensory information using our rod and cone cells) – it involves identifying the shape of
the object (despite changes in sensory input) and retrieving information from long-term memory about the object (e.g., its function, size, colour etc.). Object recognition takes around 200ms (very rapid).
Why is Object Recognition Important?
- Neuropsychology - Brain injury / damage – affecting object and face recognition (hard to understand without research=hard to implement treatments and intervention techniques for specific brain regions and deficits).
- Computational modelling / machine learning / robotics (can embody into different technological systems which can benefit us).
Define Object Constancy
The ability to recognise objects across variation in sensory input caused by changes in light (shadow), scale (size), viewpoint and occlusion (causing a different imprint on the retina). Essential for object / face recognition and high-level vision
Object Recognition: What is the problem of shadow?
- When there is shadow it is
challenging to identify the edges that define the object shape (using the primary visual cortex i.e., V1). - The visual system must work to figure out which edges belong to the object and which belong to the shadow.
- Our rod and cone cells can distinguish information based on lighting allowing us to extract information about edges
What other problems are there in Object Recognition?
- The problem of the variations in scale (size) – object size changes on
the retina depending on how far away they are e.g., closer to retina =larger (is there a notion of size in our long-term memory and representations? must have some stored information on what the size of an object is typically) - Object recognition is also difficult because of variations in spatial location– object position on the retina changes as objects move about.
- The problem of occlusion
caused by scene clutter – foreground objects partially
occlude background objects (note: we can still recognise objects despite incomplete sensory input! i.e., the brain has to reconstruct the missing information)
What are the Gestalt Laws of Perceptual Organisation?
These are principles that describe how the visual system can use ‘bottom-up’ processes (driven by sensory-input) to group image features into shapes to form a whole (i.e., what the brain has to overcome for successful object recognition)
-Kurt Koffka (1886-1941)
-Wolfgang Kohler (1887-1967)
-Max Wertheimer (1880-1943)
What are the 6 Gestalt Laws?
- Law of Similarity (Items that are similar to each other are grouped together)
- Law of Pragnanz (Reality should be transformed or reduced into the simplest form)
- Law of Proximity (The objects that are close to each other are grouped together)
- Law of Continuity (How our brain experiences visual line of elements that are grouped together)
- Law of Closure (The tendency of our mind to perceive incomplete shapes as a whole figure)
- The Law of Common Region (items within a boundary are perceived as a group and assumed to share some common characteristic or functionality).
What does the perception of shape involve?
Both bottom-up (for our sensory information) and top-down (previous knowledge on things) knowledge.
-NOTE! The brain uses the principles of perceptual organisation to help us make sense of sensory input
Where does Perceptual Organisation happen in the brain? Anatomical basis for high-level vision
-The VENTRAL pathway is associated with object recognition. Also associated with the parvocellular cells which rapidly respond to high spatial frequency (i.e., detailed images). Damage can vary e.g., face recognition problems but not object recognition problems
-Some of the main regions of the brain have been linked to the visual perception of different types of objects: LOC = Lateral Occipital Complex (objects).
-Other views of the brain regions associated with object recognition.
LO/pFs = Lateral Occipal Complex; FFA = Fusiform Face Area; PPA = Parahippocampal Place Area (around the hippocampus-activates when shown scenes)
-The best way to recognise things is through specialisation in brain regions according to the brain
What shape information is used to represent objects?
-Focuses on low-level image features such as edges and vertices (intersections between edges) to provide important shape information.
-We need to be able to reconstruct the shape of things.
What is Attneave’s (1954) ‘Sleeping Cat’?
- A simple demonstration that we can derive a lot of useful information about shape from the edge information alone.
- Also suggested we can recognise things from very limited information
- “Line drawing of a sleeping cat can still be identified when the smoothly curved contours are replaced by straight-line segments”
But….What is this object? (Biederman, 1987)
-The image on the right is easier to identify despite containing less edge contour. This shows that VERTICES are important, not just edges (i.e., edges aren’t enough for recognising an object vertices gives us more information about shape).
What evidence is there for edges and vertices being important sources of information for shape recognition? (Biederman, 1987)
-He presented objects with different amounts of edge contour (mid-segments) and vertices deleted, and measured recognition accuracy.
-The results showed that deletion of vertices affects recognition more than deletion of other edge contour (i.e., made more errors).
-Shows vertices are more important.
What other kind of features can describe shape?
- Surfaces (gives information about texture, patterns etc.,)
- Volumetric parts (individual parts of an object rather than a whole)
-Study note: These shape features are sometimes called ‘primitives’ which means elementary ‘units’ of shape. A volumetric primitive is one that has 3D volume (e.g., a volumetric part). E.g., we could say a table has five volumetric parts (four legs and a top)
-Low-level image features such as edges and vertices (intersections
between edges) provide important shape information.
What is one of the Structural Description Models of Object Recognition? (Biederman, 1987)
-In Biederman’s (1987) model 3D objects are represented and recognised using basic volumetric parts (primitives) known as ‘geons’ (like drawing art).
-We have 36 geons which we can mix and match to create a variety of abstract shapes
-Study note: You can think of geons as basic shapes like cylinders, bricks, pyramids etc., that you could use to ‘build’ more complex objects (like ‘Lego’ bricks!).
Goal of visual system - identify 3D parts that comprise the object