lecture 2- object perception Flashcards

1
Q

how are the same object at different perspective processed?

A
  • recognise objects as same
  • but visual info coming into retina is differet
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is meant by gestalt psychology?

A
  • ‘grouping principles’ of perceptual organisation
  • group things in mind depending on different criteria
  • parts of image seen as belonging together (these parts are likely to arise from the same object)
  • ‘whole is more than the sum of parts’
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what are the gestalt grouping principles

A
  1. similarity (e.g in luminance, shape, colour)
  2. proximity (tend to see groups where closer together)
  3. closure (closing areas of space into e.g a square)
  4. good continuation
  5. common fate (discs moving together at same speed and direction vs. still ones)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is a figure-ground?

A
  • area bounded by contour (closure) is seen as a separate object
  • contours seen as belonging to one object at a time (can only see one option at a time, not simultaneously)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is Marr’s model of recognition?

A
  1. primal sketch of what objects look like- 2D rep of luminance (enables us to detect edges and contours)
  2. 2 1/2 D sketch- description of depth, orientation, shading, texture, motion, binocular disparity- (viewpoint dependent)
  3. 3D model- description of 3D shape of object (view point invariant)
  • analyse image with range of edge filters (hubel weisel cells in visual cortex)
  • use gestalt grouping (continuity) to find outline
  • segment outline at concavities (dips), from there identify principal axis of sections
  • define arrangments of parts (cylinders): start with biggest (principal axis) and work progressively through smaller
    NOTE: visibility (angle) of principal axis is important
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

explain Biederman’s ‘recognition by components’ computational model

A
  1. edge extraction- surface characteristics: luminance, texture, colour
  2. detect arrangment of edges: curvature, parallel, co-terminating, symmetry (don’t alter with viewpoint)
  3. segment object into components: detect concave parts
  4. determine GEON type for each component: 36 needed- any combo of these create any object (most only need few)
  5. determine arrangment of GEONs and match GEON description to memory
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what are the problems with Biederman’s model?

A
  • does not differentiate objects within class (telling difference between different faces/mugs)
  • does not use surface pattern
  • recognition is viewpoint invariant (can recognise regardless of viewing angle) BUT can matter when see object from
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what are the two object processing pathways? (Ungerleider & Mishkin 1982)

A
  1. ‘what’- bilateral removing of inferior temporal area (TE) = severe impairment in object discrimination, but can grab correct option well
  2. ‘where’- bilateral removal of posterior parietal cortex = impairment in landmark discrimination, but could do novel object discrimination
    - DOUBLE DISSOCIATION
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what are the symptoms of object agnosia and which pathway does it affect?

A
  • ventral visual pathway
  • no loss of intelligence
    failure to recognise objects
  • no simple visual impairment
  • can see edges but cannot put them together
  • may draw object ok but not recognise drawing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is the case study for object agnosia?

A

DF- CO poisoning, Milner’s posting task
- can: accurate guidance of hand and fingers toward object (can reach/grasp/pick up)
- can’t: object size/shape/orientation (when asked to indicate size with fingers/match angle of object with slot when holding infront of it)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is the ‘titchener circles’ illusion? (Aglioti 1995)

A
  • target circle size is influenced by surrounding array of circles
    2 options:
  • discs physically the same (but look different)
  • discs perceptually the same (but actually different)
  • perception of disc size strongly affected by illusion BUT grip aperture correct despite it
  • shows object analysis for action separate from conscious perception of object
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what are the symptoms of optic ataxia and which pathway does it affect?

A
  • dorsal visual pathway
  • difficulty completing visually-guided reaching tasks
  • difficulty reaching in the right direction
    difficulty positioning fingers correctly towards an object
  • little relationship between grip aperture and object size
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what are views on the purpose of vision?

A
  • early models: to construct an internal model of reality (outside = reality, trying to create accurate version of outside in head)- foundation for all visually derived thought and action
  • now: more focus on ‘requirements of vision’ (Goodale & Milner 1992)- considering what visual info is doing for us (instead of making neural representation- modularity based upon what ‘uses’ vision can have)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what are the requirements of vision?

A
  1. vision for perception (identification of object- ‘object-centered’)
  2. vision for action (how we can change that thing in space- ‘viewer-centered’)
  • require different transformations of visual signals
  • need to encode size, orientation and location of objects relative to others
  • start with this frame of reference- needs a perceptual representation of objects that transcends particular viewpoints (while preserving info about spatial relationships)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is the function of the LOC?

A
  • lateral occipital cortex
  • how we recognise the same object in more than one location => identity representation
  • location-tolerant object info and object-tolerant location info
  • doesn’t matter where object is, can atill recognise
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is the single cell representation of objects?

A
  • Hubel & Wiesel found hierarchical processing in visual cortex
  • simple cell to complex cell to hyper-complex cell (which has n-stopping property)
  • pattern-processing in inferotemporal cortex
17
Q

what object features are the different regions of the temporal lob responsible for?

A

V1: edges
V2: contours
V4: colour & shape
PIT: simple features
AIT: elaborate features

18
Q

what is the cell selectivity for pattern processing in the inferotemporal cortex?

A
  • cells respond to code shape, colour and texture
  • respond to all objects with these properties
  • generalise across position
  • organised in columns through cortex surface (e.g columns of object-selective cells in AIT)
  • posterior region cells more orientation- and size-specific (general), anterior region more responsive to specific objects
19
Q

what are the steps in Riesenhuber & Poggio’s 2002 hierarchical model of object recognition?

A
  1. input image, first processed by simple cells (different cells for different areas of visual space) => different orientations in different space coded for
  2. complex cells provide input into hypercomplex and V2 cells
    NOTE: two options for simple to complex cells
  3. MAX operation (complex cell will respond to which simple cell responds most)
  4. pooling (weighted sum of info, adding composites together)
20
Q

what are the properties of the hierarchical model of object recognition?

A
  • anatomically and physiologically plausible- based on known connections and properties of brain cells (from V1 to IT)
  • based on earlier hierarchical models
  • copes with viewpoint dependence and independence
  • incorporates theories of learning
  • copes with multiple objects and objects in different context
21
Q

what is the role of context in object recognition?

A
  • context can be normal or abnormal
  • wihtin scene -> recognition of objects is easier in correct context
  • within object-> word context biases interpretation
  • word superiority effect- detecting a letter is easier when in a word
22
Q

explain bottom-up processing of letter recognition

A
  • provided with stimulus, detected by low-level feature detectors that have features similar to letter
  • these low-level detector cells connected to mid-level pattern detectors which have a nice representation (more like the letter)
  • mid-level excites the correct high-level object detector and inhibits the incorrect ones
23
Q

explain top-down influence on letter recognition

A
  • we have memorised concepts in temporal lobe (e.g what has a tail? rat and cat-yes, mat- no)
  • excites possible highs, inhibits ones that don’t match concepts
  • excites suitable mids
  • anatomy- more connections descend than ascend
24
Q

what happens when we combine bottom-up processing and top-down influence in letter recognition?

A
  • bidirectional processing models: info flow is bottom-up and top-down
  • expectations (of what something might be) lower threshold for likely items (allows to detect better)
25
Q

what is the role of temporal context and object permanence in expections?

A
  • we expect an object moved behind a screen to reappear with the same form when screen removed
  • Bower: >6 months olds surprised if object gone if screen removed (expects object to be there)
  • indicates at this age, they have permanence of own toy (know objects exist)
26
Q

how does similarity factor into object representation?

A
  • similarity underlies how objects are represented and organised (objects are represented in brain in terms of similarity)
  • therefore guiding classification, naming, behaviour
  • similar objects = similar neural representation
27
Q

what makes objects similar/dissimilar? explain the experiement used here

A

Cichy 2019- investigated link between brain acitvity and different object properties
- groups: asked to judge similarity of shape, function, colour, background or freely without instruction
- asked to put similar objects close to each other within circle depending on group condition (close together = similar)
- compared perceived similarity (free vs. other to see which closest)
- ‘free’ group similar to ‘function’ group

28
Q

what do we find when we compare psychological similarity matrixes against similarity matrixes from different brain voxels?

A
  • lower perceptual distance = more similar
  • perceived similarity of objects related to ventral visual cortex activity
  • representations emerge within 200ms- object colour first (earlier in hierarchy), then shape, then background and free-arrangement in ventral temporal cortex
29
Q

how might AI perform object recognition and what do we need it to do?

A
  • should provide us with accurate and meaningful information
  • it understanding context/place can be used to help determine objects
  • usual method is supervised learning- train network on known objects
30
Q

what are the two requirements for supervised learning?

A
  1. algorithm must be suited to task
  2. must have training dataset with appropriate ‘coverage’ (led to rise of multi-million datasets of objects)
31
Q

what is a convolutional neural network (CNN)?

A
  • a deep-learning algorithm that learns features directly through data
  • (for objects) CNN + 1.2 million object database + 2.5 million scene database ~ human performance
  • take image and convolve into different layers that represent edges
32
Q

what do deep neural networks (DNNs) do?

A
  • predict behavioural assessment of perceived similarity
  • show spatial invariance: can extract relevant features, handle noisy/cluttered images and image classification/object detection/semantic segmentation
33
Q

what do you need in the AI to enable to carry out CNN/DNNs?

A
  • large computers
  • that do not ‘over-fit’ (trying to detect patterns when there aren’t any)
  • layers to be interpretable
  • ability to handle irregular structures