high level vision Flashcards
why is object recognition hard?
because objects with similar functions come in different shapes and sizes and are viewed from different perspectives
what is representation?
the ability to match object to what we have stored in memory
what are the steps of perceptual organisation?
- identify edges and uniform regions
- group figure regions together
- fill in missing edges and surfaces
what does bierdman’s recognition by components theory suggest?
- restricted the set of 3D volumes to represent part shape - geons
- geons are viewpoint invariant (can see lines and vertices despite rotations in depth)
- objects are broken into geons
- object representations are assemblies of geons and their spatial relations
what is the support for RBC?
removal of contours defining concavities affect object recognition
what is the alternative to RBC?
we may just do it as they appear from a particular view and we have a stored representation with unfamiliar views needing transformation
what is recognition best for?
- learnt viewpoints
- viewed within 40 degrees of trained viewpoints
- could also interpolate between learned views
how does visual hierarchy impact representations?
representations of shape in V4 is richer than V1 but inferno is richer in V1 than V4 as receptive field size increases and neurons become more selective
so how do we recognise objects?
no convincing theories on the debate between structural descriptions and view based representations
are cell’s grandmother?
no, probability of finding one cells that responds to one object is very small
how many cells respond to seeing one person?
each cells probably responds to 50-150 people
what are the advantages of sparse and invariant coding?
- carries a large amount of info
- energy efficient
- allow a large storage capacity
- tolerance to degradations
what are deep neural networks?
multi level networks that can be trained to recognise objects
what can deep neural networks do overtime?
recognise new instances of the object that it has never been trained on when shown many instances of an object with feedback
what is visual agnosia?
impairment in deriving the meaning of a visually presented stimulus