Object Recognition Flashcards
What is object recognition for humans?
Perception of familiar items
What is object recognition for computers?
Perception of familiar patterns
Between category discriminations
e.g. Is this a chair, is it not a chair?
Is this object recognition for humans or computers?
Computers
Within category discriminations
e.g. Whose face is this?
Is this object recognition for humans or computers?
Humans
Why is object recognition difficult?
Environment contains hundreds of overlapping objects
Yet perceptual experience is of structured coherent objects which we can recognise, use and usually name
True or False?
Apparent size and shape of an object does not change despite large variations in retinal image
True
Apparent ____ and ____ of an object does not change despite large variations in retinal image
- size
- shape
What are 5 examples of variability in object recognition?
- Translation invariance
- Rotation invariance
- Size invariance
- Colour
- Partial occlusion
and presence of other objects
Define translation invariance
Recognising an object when it is not in the same location as before
Recognising an object when it is not in the same location as before
This is known as…?
Translation invariance
Define rotation invariance
Recognising an object when it is not in the same rotation as before
Recognising an object when it is not in the same rotation as before
This is known as…?
Rotation invariance
Define size invariance
Recognising an object when it is not the same size as before
Recognising an object when it is not the same size as before
This is known as…?
Size invariance
Define colour (variability)
Recognising an object when it is not the same colour as before
Recognising an object when it is not the same colour as before
This is known as…?
Colour (variability)
Define partial occlusion
and presence of other objects
Where an object is blocked off partially by some other object/space
Where an object is blocked off partially by some other object/space
This is known as…?
Partial occlusion
and presence of other objects
What is intra-class variation?
Image variations occur between different images of one class
e.g. These objects are all chairs but they all look so different. (different shapes, structure, colour, height)
Image variations occur between different images of one class
e.g. These objects are all chairs but they all look so different. (different shapes, structure, colour, height)
This is known as…?
Intra-class variation
What are the 2 challenges of object recognition?
- Intra-class variation
- Viewpoint variation
What is viewpoint variation?
Being able to recognise an object even from different viewpoints
Being able to recognise an object even from different viewpoints
This is known as…?
Viewpoint variation
e.g. Recognising that the object is a coffee pot from the top view (looks like a square with a circle in the middle and a rectangle at the side)
This is an example of…?
Viewpoint variation
How do we recognise the letter ‘A’ according to template theories?
When you see your letter A, you match it to what you have stored in your LTM memory (mini copy or template in LTM of all known patterns) and when you get a good enough match, you can recognise that as a letter A
simply = Multiple templates are held in memory, we compare stimuli to templates in memory for one with the greatest overlap until a match is found
When you see your letter A, you match it to what you have stored in your LTM memory (mini copy or template in LTM of all known patterns) and when you get a good enough match, you can recognise that as a letter A
This is known as…?
Template theories on 2D Pattern matching
How do we engage in 2D Pattern matching?
List 4 points
- Template theories
- Prototype theories
- Feature theories
- Structural descriptions
What are the 2 implications of template theories?
- Does not consider normalisation
- How many mini copies or templates of the object do we store in the LTM? Is it infinite?
Give 2 examples of template theory 2D pattern matching
- Barcodes
- Fingerprints
Multiple templates are held in memory, we compare stimuli to templates in memory for one with the greatest overlap until a match is found
This is known as…?
Template theories on 2D Pattern matching
What are the 3 limitations of template theories?
- Problem of imperfect matches
- Cannot account for the flexibility of pattern recognition system
- Comparison requires identical orientation, size, position of template to stimuli
How do we recognise the letter ‘A’ according to prototype theories?
Rather than storing lots of different templates, we have an average template stored in memory. When you see your letter A, you match it to the average template (prototype) stored in your memory.
Simply = Modification of template matching (flexible templates), Possesses the average of each individual characteristic, No match is perfect; a criterion for matching is needed
Rather than storing lots of different templates, we have an average template stored in memory. When you see your letter A, you match it to the average template (prototype) stored in your memory.
This is known as…?
Prototype theories on 2D Pattern matching
Modification of template matching (flexible templates), Possesses the average of each individual characteristic, No match is perfect; a criterion for matching is needed
This is known as…?
Prototype theories on 2D Pattern matching
Describe Franks & Bransford’s (1971) evidence for prototype theory
List 4 points
- Presented objects based on prototypes
- Prototype not shown
- Yet participants are confident they have seen the prototype
- Suggests the existence of prototypes
Simply = By seeing all these examples that all surround this prototype, you think as though you have seen the prototype even though you have not seen it at all
Which 2D pattern matching theory does this apply to?
Pattern consists of a set of features or attributes
a. Template theories
b. Prototype theories
c. Feature theories
d. Structural descriptions
c. Feature theories
Which 2D pattern matching theory does this apply to?
Modification of template matching (flexible templates)
a. Template theories
b. Prototype theories
c. Feature theories
d. Structural descriptions
b. Prototype theories
Which 2D pattern matching theory does this apply to?
Possesses the average of each individual characteristic
a. Template theories
b. Prototype theories
c. Feature theories
d. Structural descriptions
b. Prototype theories
Which 2D pattern matching theory does this apply to?
Mini copy or template in LTM of all known patterns
a. Template theories
b. Prototype theories
c. Feature theories
d. Structural descriptions
a. Template theories
Which 2D pattern matching theory does this apply to?
Multiple templates are held in memory
a. Template theories
b. Prototype theories
c. Feature theories
d. Structural descriptions
a. Template theories
Which 2D pattern matching theory does this apply to?
No match is perfect; a criterion for matching is needed
a. Template theories
b. Prototype theories
c. Feature theories
d. Structural descriptions
b. Prototype theories
Which 2D pattern matching theory does this apply to?
Compare stimuli to templates in memory for one with greatest overlap until a match is found
a. Template theories
b. Prototype theories
c. Feature theories
d. Structural descriptions
a. Template theories
Which 2D pattern matching theory does this apply to?
Problem of imperfect matches
a. Template theories
b. Prototype theories
c. Feature theories
d. Structural descriptions
a. Template theories
b. Prototype theories
Which 2D pattern matching theory does this apply to?
Cannot account for the flexibility of pattern recognition system
a. Template theories
b. Prototype theories
c. Feature theories
d. Structural descriptions
a. Template theories
Which 2D pattern matching theory does this apply to?
Comparison requires identical orientation, size, position of template to stimuli
a. Template theories
b. Prototype theories
c. Feature theories
d. Structural descriptions
a. Template theories
How do we recognise the letter ‘A’ according to feature theories?
- Pattern consists of a set of features or attributes
- A = 2 straight lines and a connecting crossbar.
- But also need to know the relationship between features
- Pattern consists of a set of features or attributes
- A = 2 straight lines and a connecting crossbar.
- But also need to know the relationship between features
This is known as…?
Feature theories on 2D pattern matching
How do we recognise the letter ‘A’ according to structural descriptions?
- Describe the nature of the components of a configuration and the structural arrangement of these parts” (Bruce & Green, 1990)
- Capital letter T = 2 parts; 1 horizontal; 1 vertical; vertical supports horizontal; vertical bisects horizontal
- Describe the nature of the components of a configuration and the structural arrangement of these parts” (Bruce & Green, 1990)
- Capital letter T = 2 parts; 1 horizontal; 1 vertical; vertical supports horizontal; vertical bisects horizontal
This is known as…?
Structural descriptions on 2D pattern matching
How do we recognise 3D objects?
List 2 steps
- Firstly we must interpret the input to the visual system as coherent structures, segregated from one another and from the background (early image processing)
simply = Must know how to segregate an object as being a distinct one object that goes together and is distinct from the background
- Must be processed to give a description– which can then be matched to the descriptions of visual objects stored in memory.
What does Marr’s Computational Approach aim to explain?
3D Object recognition
3D Object recognition can be explained by which approach?
Marr’s Computational Approach
What are the 3 steps to Marr’s Computational Approach
- Primal Sketch
- 2 1⁄2-D Sketch
- 3-D Representation
What is a primal sketch in Marr’s Computational Approach?
2-D description includes changes in light intensity, edges, contours, blobs
What is a 2 1/2 D Sketch in Marr’s Computational Approach?
Includes information about depth, motion, shading. Representation is observer-centered.
What is a 3D representation in Marr’s Computational Approach?
A representation of objects and their relationships, observer-independent
2-D description includes changes in light intensity, edges, contours, blobs
This is known as…?
Primal sketch
Includes information about depth, motion, shading. Representation is observer-centered.
This is known as…?
2 1/2 D Sketch
A representation of objects and their relationships, observer-independent
This is known as…?
3D representation
What are the 4 questions asked when recognising objects?
- What elements are used in the description? (primitives)
- How is the relationship between these elements specified?
- How is the overall description invariant across views?
simply = How can you recognise objects from different viewpoints? How is it that we can create a description that works from different viewpoints
- What about viewpoint dependence?
simply = Does it depend on the task that you are doing?
What did Marr & Nishihara (1978) express about objects in relation to cylinders?
Every object can be broken down into a series of cylinders and we must specify the relationship between cylinders to create a structural description of the object
Every object can be broken down into a series of cylinders and we must specify the relationship between cylinders to create a structural description of the object
Who proposed this?
Marr & Nishihara (1978)
The relationship between these cylinders creates a structural description of the object
This works particularly well what kinds of objects?
Biological type objects
e.g. human body
Marr & Nishihara (1978) expressed structural relations by …?
A hierarchical organisation of cylinders
According to Marr & Nishihara (1978), structural relations are based on a hierarchical organisation of cylinders
What does this mean?
Each cylinder has an axis and the way in which others are joined is expressed as coordinates
The position of each cylinder is described relative to its own axis, resulting in a description which is invariant across viewpoints.
This is useful to determine what types of objects?
Argued to be useful in determining who is a human or gorilla, depending on the length of the arms, etc.
The position of each cylinder is described relative to its own axis, resulting in a description which is invariant across viewpoints.
This is not useful to determine what types of objects?
Other objects that are not biological/bodily types of objects
e.g. a piece of paper cannot be broken down into cylinders
Works well for biological objects but not other objects
Which model does this apply to?
Marr & Nishihara’s (1978) cylinder model
Who provided an alternative model to Marr & Nishihara (1978)?
Biederman (1987;1989)
Which theory proposes that objects are composed of basic shapes?
Biederman’s Recognition-by-components theory
What does Biederman’s Recognition-by-components theory propose about objects?
Objects composed of basic shapes
According to Biederman’s Recognition-by-components theory, objects are composed of basic shapes known as…?
GEONS
What are GEONS?
Geometrical ions
Basic shapes that make up an object
Give 4 examples of GEONS
- Blocks
- Cylinders
- Arcs
- Wedges
- Blocks
- Cylinders
- Arcs
- Wedges
These are examples of…?
GEONS
Approximately how many different volumetric shapes (GEONS) are there based on Biederman’s Recognition-by-components theory?
36
Which model/theory is viewpoint invariant?
Biederman’s Recognition-by-components theory
Which model/theory proposes that objects are made up of cylinders?
Marr & Nishihara (1978)
Which model/theory proposes that objects are made up of GEONS?
Biederman’s Recognition-by-components theory
Which model/theory proposes that there are small numbers of structural relationships?
Biederman’s Recognition-by-components theory
Biederman’s Recognition-by-components theory claims that there is a small number of structural relationship
Give 4 examples
- Relative size
- Verticality
- Centring
- Relative size of surfaces at join
Viewpoint-invariant theory of recognition is known as…?
Biederman’s Recognition-by-components theory
Recognition using 3D component parts (e.g. 36 GEONS), with structural relations between the parts
Which model/theory proposes this?
Biederman’s Recognition-by-components theory
Define viewpoint-invariant
The object can be identified from any angle