High Level Perception (P&C) Flashcards

1
Q

What are problems in object recognition

A

Image changes with distance, position, lighting, occlusion of parts etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Gestalt Psychology

A

The whole is more than the sum of parts

‘Grouping principles’ of perceptual organisation (we group things in our mind based on the following criteria)

  1. Similarity
  2. Proximity
  3. Closure
  4. Good continuation
  5. Common fate
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define figure ground

A
  • area bounded by contour (closure) is seen as separate object
  • contours seen as belonging to one object at a time (we can either see two faces or a vase, but never at the same time)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define problems behind Gestalt theories of object recognition:

A
  1. No mechanisms – only a description of phenomena
  2. Speculative neurophysiology is rubbish
  3. Made assumptions that perceptual grouping was independent of perceptual learning – no evidence and wrong anyway.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Describe the stages Marr’s model of recognition

A
  1. Primal sketch: 2 dimensional representation of luminance - edges, countours (raw primal sketch → full primal sketch)
  2. 2 1/2 D sketch: description of depth and orientation, shading, texture, motion and binocular disparity (viewpoint dependent)
  3. 3D model: description of 3D shape of objects (viewpoint invariant)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How does Marr’s model work

A

→ Analyse image with range of edge filters

→ Use Gestalt grouping - such as continuity finds outline

→ Segment outline at concavities

→ Define arrangement of parts (cylinders)

  • start with biggest cylinder (principal axis)
  • work through progressively smaller cylinders

→ Match description of parts to 3D models in memory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does Marr’s model predict?

A

Marr’s model predicts that the visibility of principal axis is important and different orientations equally easy to recognise

→ BUT, many objects are hard to recognise if upside down or if the principle axis is tilted towards the viewer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Describe Biederman’s Recognition by Components

A
  1. Edge extraction: surface characteristics: luminance, texture and colour
  2. Detect arrangement of edges: curvature, parallel, co-terminating, symmetry, co-linear, such arrangements do not alter with view
  3. Segment object into components (parts): detect concave parts
  4. Determine GEON type for each component (part): 36 GEONS needed to have every possible version of objects in the entire world

→ determine arrangement of GEONS (above/below, bigger/smaller) and match GEON description to memory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Object processing pathways (in the monkey brain)

A

Parietal lobe and temporal lobe have diff functions for our perception of objects

  • monkey presented with two objects and made to touch the new objects in multiple trials (novel object detection) → temporal lobe
  • bowl with lid and food inside and an object closer to the bowl with food, monkey can discriminate between objects and can locate where different objects are → parietal lobe
  • then produced lesions in different bits of cortex, lesions to temporal lobe, they cannot work out with object is new but the can do the landmark recognition task
  • monkeys without the parietal lobe could not do the landmark task but could work out the novel object detection
  • double dissociation
  • damage to temporal stops ability to locate objects in space but spared object location and detection
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describe object agnosia

A
  • damage to this area of cortex leads to difficulties in recognising objects (ventral visual processing)
  • they can see the object (like a carrot) but cannot put the parts of it together and recognise what the object is
  • symptoms: no loss of intelligence, failure to recognise objects, no simple visual impairment, can see edges but cannot put them together, may draw object OK but not recognise drawing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Describe Patient DF case study

A

carbon monoxide poisoning (object agnosia)

→ tested on her ability to see and report size, shape and orietnation of objects BUT, she could guide her hand and finger towards the object altough she could not tell you what it is or the size of it

→ 2 tasks she was tested with

  • Matching: putting a disc through its specific shape, DF could not match the angle of the disc with the angle of the slot infront of her
  • Posting: if you asked her to just post the disc through the slot she could do it (can’t describe the angle but can act on the object)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe Titchener circles illusion

A
  • Target circles size is influences by surrounding array
    1. Disks physically the same
    2. Disks perceptually the same
    the circle surrounding smaller circles seems larger than the circle surrounded by large circles but they are the same size
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Define optic ataxia

A

(in dorsal visual processing)

→ difficulty completing visually guided reaching tasks

→ difficulty reaching in right direction

→ difficulty positioning fingers correctly towards and object

→ little relationship between grip aperture and object size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe in what and where pathways

A

Early models assume purpose of vision is to construct an internal model of reality - foundation for all visually derived thought and action

More of a focus on requirements of vision - modularity based upon to waht uses vision can be put

Dorsal (where) stream and ventral (what stream)

  1. Identification of object - object centered (shape/size/colour)
  2. Action of an object - viewer centered (shape/surface)

Vision for perception and vision for action

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do we recognise the same object seen in more than one location?

A
  • Lateral Occipital Cortex (LOC) = identity representation

What is LOC role in object location? → location tolerant object information and object tolerant location information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Hierarchical processing of visual information: Hubel & Weisel’s findings in visual cortex

A

Simple → complex → hyper complex
Down the temporal lobe:

V1: edges

V2: contours

V4: colour and shape

PIT: simple features

AIT: elaborate features

17
Q

Patterns processing in Inferotemporal cortex

A

Cell selectivity:

→ code shape (e.g. star shape) and colour and texture

→ respond to all objects with these properties

→ generalise accross position

→ organised in columns though cortex surface

→ posterior cells are orientation and size specific, anterior cells are less sensitive to orientation and size

so it doesnt matter what angle the shape is or how big it is, the visual aspect of the stimulus become less and less important going down the temporal lobe because the general understanding of the object is most important

18
Q

Hierarchical model of object recognition

A

(Riesenhuber & Poggio 1999) view based module

Input image - bent paper clip

Simple cells respond to different orientations

→ Some V1 cells respond to the simple lines

V1 provide input to complex cells and these input in hypercomplex cells
___
→ Anatomically and psychologically plausible - based upon knwon connections and properties of brain cells from V1 to IT
→ based on earlier hierarchical models

→ Copes with viewpoint dependence and viewpoing independence

→ Incorporates theories of learning

→ Copes with multiple objects and objects in different contexts

19
Q

Object recognition and context

A

context can help recognise objects that you’d otherwise find difficult to understand

20
Q

Bottom up processing model

A
  • bottom up processing mode (from the bottom): low lvl feature detectors -> mid level pattern detectors -> high lvl object detectors
21
Q

Top down influence model

A

from the top
memorised concepts -> high lvl object detectors -> mid lvl pattern detectors

22
Q

Top down and bottom up processing

A

Information flow is bottom up and top down
expectations of what something might be lower threshold for likely items

23
Q

Ascending and Descending information

A

Anatomy - more connections descend than ascend

→ 10 times more information going back down the visual system from temporal to LGN

Eye -> LGN -> V1 -> V2 -> V4 -> Temporal cortex

24
Q

Context within scene and objects

A
  • Within scene - recognition is easier with correct context
  • Within objects - word biases interpretation
25
Q

Word superiority effect

A

detecting a letter is easier when in a word

26
Q

Expectations: object permanence

A
  • We expect an object moved behind a screen to reappear with the same form when screen removed
  • Infants search for hidden objects as they understand they still exist

→ an object becomes hidden behind a screen and when the screen is moved >6 months infants surprised if the object is gone

→ they expect the object to still be there

27
Q

Object representation

A

Similarity underlies how objects are represented

  • similarity guides classification, naming and behaviour
  • What makes objects similar or dissimilar
  • If two objects are similar = neural representation similar
28
Q

What did Chichy et al investigate

A

investigated the link between brain activity and different object properties

→ judged similarity of shape, function, colour, background and freely without instructions

→ important to characterise the dimensions that relate psychological and neural representations of objects

29
Q

Describe the findings for object representation research

A

Perceived similarity of objects related to ventral visual cortex activity

→ Representations emerge within 200ms: object colour first and earlier in hierarchy, object shape next and later

→ Shape, background, free arangement (same place and time) in ventral temporal cortex

Object representations distributed and based upon similarity

30
Q

How might AI perform object recognition

A

→ AI should provide us with information that is accurate and meaningful

→ usually: supervised learning - you train the network on known objects

  1. Algorithm must be suited to task
  2. Must have a training dataset with appropriate coverage (led to the rise of multi million datasets)

→ convulational neural networks + 1.2M object database + 2.5M scene databse ~ human performance