3 - Object Recognition Flashcards
What is Object Recognition for humans
For humans: perception of familiar items.
is perception of objects is different for humans than for computers
yes
what is object recognition for computers
For computers: perception of familiar patterns.
why is object recognition difficult
- Environment contains hundreds of overlapping objects.
- Yet perceptual experience is of structured, coherent objects which we can recognise, use and usually name.
• Apparent size & shape of an object does not change despite large variations in retinal
image.
give 6 examples of variability
translation invariance rotation invariance size invariance colour partical occlusion presence of other objects
what is a intra-class variation in object recognition
intra-class variation e.g. recognise chair - variation as to what a chair looks like
what is view point variation in object recognition
viewpoint variation
different viewpoints
recognition from unusual views
what is template theory an example of
2D pattern matching
what happens in template theory
mini copy or template in LTM of all known patterns normalisation? numerous templates? real life examples: bar codes, fingerprints compare against templates stored in LTM
see something new, determine what it looks like match with stored template
whats the problem with template theory
– Problem of imperfect matches
– Cannot account for the flexibility of pattern recognition system
– Comparison requires identical orientation, size, position of template to stimuli
looseness in argument
more compex - tricky
how mant templates? - fonts capitals small letters - where do we stop
what is template theory
– Multiple templates are held in memory
– Compare stimuli to templates in memory for one with greatest overlap until a match is found
what is prototype theory part of
2d pattern matching
what is prototype theory
- Modification of template matching (flexible templates)
- Possesses the average of each individual characteristic
- No match is perfect; a criterion for matching is needed
what is the evidence for prototype
Franks & Bransford (1971) – Presented objects based on prototypes – Prototype not shown – Yet participants are confident they had seen prototype – Suggests existence of prototypes
what is feature theory a part of
2d pattern matching
what is feature theory
- pattern consists of a set of features or attributes.
- A = 2 straight lines & connecting cross bar.
- But also need to know A
a relationship between features.
/ \ - = A??
define a letter a by features nd attributes that make a letter a
what is structural description an example of
2d pattern matching
what is structural description
“..describe the nature of the components of a configuration and the structural arrangement
of these parts” (Bruce & Green, 1990)
- Capital letter T = 2 parts; 1 horizontal ; 1 vertical; vertical supports horizontal; vertical bisects horizontal.
know features and relationship between fetrues
what are consistuent parts and how are they organised
what is 3D object recognition
Similar but more complex process………
• Firstly must interpret input to the visual system as coherent structures, segregated
from one another & from background. (Early image processing)
determine one object - create structural description - match to familiar obejcts strored in LTM
• Must be processed to give a description – which can then be matched to the
descriptions of visual objects stored in memory.
what happens in marrs computational approach
primal sketch
2 1/2 d sketch
3-d observation
what is a primal sketch
2-D description includes
changes in light intensity, edges, contours, blobs
what happens in 2 1/2 D sketch
Includes information about depth, motion, shading. Representation is observer-centered
what happens in 3-D representation
a representation of objects and their relationships
observer - independent
what are the 4 questions in object recognition
(1) What elements are used in the description? (primitives) - components features geons
(2) How is the relationship between these elements specified?
(3) How is the overall description invariant across views? recognise from different views
(4) What about viewpoint dependence? better at recognising in view points we usually see