L3 Object Recognition Flashcards
Translational invariance
able to recognise objects when they are moved to a different location
Size invariance
able to recognise objects when they have a different retinal image size
Colour invariance
CAN STILL RECOGNISE OBJECTS WHEN ANOTHER COLOUR IS LAYERED OVER THE TOP OF THEM
Rotation invariance
recognise when objects are rotated
occlusion
recognise when objects are occluded
variability in visual scenes
recognise objects when they are surrounded by or occluded by other objects
intraclass variation
recognise them as belonging to a category despite variance from template (chairs all differ)
viewpoint variation
we recognise things from all viewpoints
template hypothesis:
2D pattern matching - we store tmeplates of everything we’ve seen which we call on for recognition
- many many templates for each object
- one template with a degree of flexibility (barcodes)
template hypothesis problems
cant account for degree of flexibility - we’d need objects to be the same size and orientation
prototype hypothesis
2D pattern matching – came from the template hypothesis. –we combine all the images of something we’ve seen to form a prototype/prototypical characteristics of an object.
prototype evidence
when shows a load of images Ps are sure they’ve seen the prototype before
Feature theory
patterns consists of attributes (A=2 lines and a bar) but was too simplistic
Structural description theory
2D pattern matching –evolution of the feature theory. – The features and a description of how they are structurally related (T = horizontal and vertical line. Vertical supports and bisects horizontal)
Marrs and Nishihara
3D object recognition – objects are all composed of a hierarchy of cylinders (that can vary in size and shape) – the cylinder position is always the same relative to its axis –works much better for biological objects
Biedermans Geon hypothesis
(invariant/viewpoint independent)
36 different geons which can be used to describe any shape. consider their relative size, verticality, centring and size of surface at joints. to determine their form and the non accidental properties to map their 3D shape
non accidental properties
geon hypothesis – the 2D rep of the NAs will translate directly into the 3D rep to give an idea of how the object looks in space. They include the curvature, parallel, cotermination, symettry and colinearity (points on a straight line) of features.
support for geon
deleting parts of the objects where geons meet makes them harder to recognise than deleting curved regions
Biederman geon good points:
flexible parsimonious and recognises importance of part recognition
Biedermans geon bad points
- why 36?
- de emphasise importance of context
- simplifies contribution of viewpoint dependence
- structural info not always enough (peach vs. nectarine)
- within category discrimination
viewpoint dependent theories
these suggest we have stored templates for each view point of the object — they were designed to coexist with independent theories. genereally beleived we use ID for between category and D for within
issues to overcome in object recognition
- binding problem – how do other modalities help us to recognise
- when presented with many objects how do we figure out which geon belongs where
Beyond recognition original:
structural description (round) –> semantic (crunchy) –> name representation (apple) –> name
cascade model:
there is cross talk between each stage of recognition explaining why the similarity leads to higher confusability – it is supported by anecdotal and empirical evidence
Apperceptive agnosia
–LATERAL -early process problem –can recognise COMPONENTS of the object but cannot put them together in a meaningful representation of the object –Bad at segmentation, degraded image tasks and have difficulty recognising objects
Associative agnosia
-Bi lateral –late process –can structurally describe object/visually process it fully but cannot retrieve semantic representation so cannot name it. perform well on segmentation and copying tasks. no perceptual problems.
category specific agnosia
bilateral temporal lobe damage – have issues with animate objects only
why category specific agnosia?
- different ways of recognising (functional vs. sensory)
- kinaesthetic representations evoked by inanimate objects
- living things are more similar than non living things (lion line drawings, 20ms)
AGNOSICS
fail to feel familiarity with the object