Object recognition Flashcards
The object recognition problem
Isolated neurons in V1 cannot draw conclusions by themselves on the nature of a whole object
Template Theory
visual system recognizes objects by matching the neural representation of the image with an internal representation of the same “shape” in the brain
Many templates for different object positions
- flexibility is limited because you cannot store a template for every object
General recognition theory (Ashby)
Categorization based on multivariate signal detection theory
- Categories are defined by probabilistic distributions, and categorization is based on decision boundaries that separate perceptual regions.
- If 2 objects have very similar features, they will be harder to distinguish. If 2 objects do not overlap in features, it is easier to detect differences and distinguish them.
Generalized Context Model (GCM) (Nosofsky)
Examplar based similarity : comparing new object to stored examples
Whatever exemplar is more similar to new object = what this new object is
GCM (Nosofsky) : categorizing faces
You store many specific faces you’ve seen before. When seeing a new face, you compare it to stored examples and assign the category based on similarity to past faces.
GRT (Ashby) : categorizing faces
rely on perceptual dimensions (e.g., face shape, jaw width, eye size) and make a decision based on statistical boundaries between the two categories.
Recognition by components (Irving Biederman)
We recognize objects using an alphabet of shapes, or geons (geometric ions) that together combined can form any given object
- 36 geons, that can make up any object
Limitations : does not handle object variability, very crude
Grandmother cell theory
- Suggests we have a single neuron for every single concept in our world
- Extreme version of localized representation in the brain (like template theory)
Jennifer Aniston cell?
- A single responded solely when presented with a picture of Jennifer Aniston
- If that cell died, the person would still probably recognize her
Deep Neural Network (DNN)
- Multilayer neural networks capable of being trained to recognise objects.
- Numerous instances of an object are shown to the network, with feedback
- Over time, the network learns to recognize label new instances of the object that it has never been explicitly trained on (generalization over the training set)
Deep Neural Networks (DNNs) rivalled the representational performance of the … in monkeys on an object recognition task.
inferior temporal cortex (IT)
Retinal Ganglion Cells & LGN detect _____ (localized contrast)
Spots
Primary Visual Cortex (V1) detects …
Edges and bars (orientation selectivity)
Role of Intermediate-level vision (V2, V3, V4, etc)
Grouping features into contours, textures, and surfaces.
Role of high-level vision (IT cortex)
Recognizing complex shapes, objects, and categories.
Object recognition is not just about simple features but about _______ processing across multiple visual areas.
Hierarchical
The receptive fields of _______ cells respond to visual properties crucial for object perception.
Extrastriate
Boundary Ownership
For a given edge or contour, V2 extrastriate neurons determine which side belongs to the object and which side belongs to the background—a fundamental process in figure-ground segregation.
Intermediate (mid) level vision
Loosely defined stage of visual processing after low-level feature extraction (e.g., edges, contrast) and before high-level object recognition and scene understanding.
Key Functions of intermediate vision
- Perception of edges and surfaces
- Determines which regions of an image should be grouped into objects
- Bridges low-level feature detection and high-level object recognition
Primary visual cortex (V1) neurons have ____ receptive fields that detect local edges and contrast.
Small
orientation-selective v1 neurons
Respond to edges at specific angles
Computerized edge detectors are not as effective as humans in detecting …
meaningful edges
Why do computers miss edges that humans easily perceive ?
Computers rely purely on local contrast and intensity differences.