Week 3 Lecture 3 - object recognition Flashcards

1
Q

What is object recognition?

A
  • perception of objects is different for humans and computers

for humans –> perception of familiar items

for computers –> perception of familiar patterns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is object recognition difficult?

A
  • environment contains hundreds of overlapping objects
  • objects have variability e.g., translation invariance, rotation invariance, size invariance, colour
  • objects can vary in the visual scene e.g., partial occlusions and presence of other objects
  • intra-class variation –> same object has different forms e.g., different types of chairs
  • only part of object may be visible
  • viewpoint variation –> may be more difficult to recognise an object from 1 viewpoint compared to another
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the theories of 2D pattern matching?

A
  • template theories
  • prototype theories
  • feature theories
  • structural descriptions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are template theories?

A
  • mini copy/template in LTM of all known patterns
  • compare stimuli to templates in memory for one with greatest overlap until a match is found
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are some problems with template theories?

A
  • imperfect matches
  • cannot account for the flexibility of pattern recognition system
  • comparison requires identical orientation, size, position of template to stimuli
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are prototype theories?

A
  • modification of template matching (flexible templates)
  • possesses the average of each individual characteristic
  • no match is perfect –> criterion is needed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is some supporting evidence for prototype theories?

A

Franks and Bransford
- presented objects based on prototypes
- prototype not shown
- ppts confident they had seen prototype
- suggests existence of prototype

however difficult to conclude how you would come up with a prototype

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are feature theories?

A
  • pattern consists of a set of features or attributes
  • e.g., letter A = 2 straight lines and connecting bar
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is an issue with feature theories?

A
  • also need to know the relationship between features
  • e.g., / \ - = A?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are structural descriptions?

A
  • describe the nature of the components and the structural arrangement of these parts
  • composition of object and how they are related together
  • e.g., Capital letter T = 2 parts; 1 horizontal; 1 vertical; vertical support horizontal; vertical bisects horizontal
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is 3D object recognition?

A
  • must interpret input to the visual system as coherent structures, segregated from one another and from background
  • must be processed to give a description which can then be matched to the descriptions of visual objects stored in memory
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What was Marr and Nishihara’s theory of 3D object recognition?

A
  • objects are comprised of cylinders
  • must specify relationship between cylinders (structural description)
  • expressed structural relations by a hierarchal organisation of cylinders
  • each cylinder has an axis and way in which other are joined are expressed as coordinates
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is an issue with Marr and Nishihara’s theory of 3D object recognition?

A

difficult to think about how to break down ALL objects into a series of cylinders

may work better for biological entities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What was Biederman’s theory of 3D object recognition?

A
  • provided alternative to Marr and Nishihara’s theory of 3D object recognition

Recognition by components theory:
- objects comprised of basic shapes
- GEONS –> geometrical ions
- approx 36 different shapes
- viewpoint invariant theory
- relationship between geons can be described structurally

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are some examples of the structural relationships set out by Biederman

A
  • relative size
  • verticality
  • centring
  • relative size of surfaces at join
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What did Biederman believe the concave parts of objects were helpful in?

A

segmenting visual image into parts

17
Q

Biederman specified Geons in terms of “non-accidental” properties

What were they?

A
  • curvature –> points on a curve
  • parallel –> sets of points in parallel
  • co-termination –> edges terminating in a common point
  • symmetry –> vs. asymmetry
  • co-linearity –> points in a straight line
18
Q

What does “non-accidental” mean?

A

regularities in the visual image thought to reflect actual regularities in the world

2D symmetry in the visual image indicates symmetry in 3D object
Present in both 2D and 3D views

19
Q

According to Biederman, what should form of degradation do?

A
  • disrupt the basis for identifying geons
  • make object more difficult to recognise
  • deletion of component affects matching stage –> reducing the number off components to match to
  • midsegment deletion makes it more difficult to determine components
20
Q

What study did Biederman conduct to show the effects of degradation?

A
  • deleted edges at points where easily reinstated or difficult to determine
  • 25, 45 or 65 % of contours removed
  • slow and inaccurate at “non-recognisable” but relatively good at “recognisable”
  • at brief exposures –> partial objects better recognised
  • at longer exposures –> midsegment deletion led to less errors
21
Q

What is some further support of Biederman’s viewpoint invariant theory?

A
  • Vogels, Biederman, Bar & Lorincz (2001) found some cortical neurons in monkeys sensitive to geons.
  • Assessed response of individual neurons in the inferior temporal cortex to change in geon or change in size of object.
  • Some neurons responded more to geon changes, providing support for geons.
22
Q

What is an evaluation of Biederman’s model?

A
  • flexible & comprehensive system for describing objects. But why 36 geons?
  • Experimental results consistent with model but doesn’t provide critical test.
  • Doesn’t explain how description are matched to those stored
23
Q

What are some advantages and disadvantages of Biederman’s theory?

A

advantages:
- Recognizes the importance of the arrangement of the parts
- Parsimonious: Small set of primitive shapes

disadvantages
- Structure is not always key to recognition: Peach vs. Nectarine –> texture necessary here
- Within category discrimination (which chair?)
- De-emphasise the role played by context in object recognition (affects later stages of object recognition)
- Simplifies the contribution of viewpoint-dependence

24
Q

What do viewpoint dependent theories assume?

A

assume changes in viewpoint reduce the speed and / or accuracy of object recognition

25
Q

What are viewpoint dependent theories?

A

object representations are collections of views that depict the appearance of objects from specific viewpoints

26
Q

What is viewpoint dependent more important for?

A

within category discrimination

27
Q

What does evidence suggest about viewpoint invariant and dependent mechanisms?

A

Evidence suggests that viewpoint invariant mechanisms used sometimes in object recognition whereas viewpoint dependent mechanisms used at other times

28
Q

What are the main differences in use between viewpoint dependent and viewpoint invariant?

A

Viewpoint dependent = complex within category decisions
Viewpoint invariant = easy categorical decisions

29
Q

What is Humphrey’s cascade model?

A
  • structural, semantic and name stages interact.
    -both within and between stages.

Makes different predictions about how subjects will perform in object naming task. Problems at one stage will have ‘knock on’ effect….

e.g. Patient JB. Naming visually confusable objects (birds, animals) had knock on effects, making it more difficult to identify their category.

30
Q

What is agnosia?

A

failure of knowledge or recognition

31
Q

What is visual agnosia?

A
  • feature processing and memory remain intact
  • recognition deficits are limited to the visual modality
  • Alertness, attention, intelligence and language are unaffected

Other sensory modalities (touch, smell) may substitute for vision in allowing objects to be recognized

32
Q

What is apperceptive agnosia?

A

problems with early processing (shape extraction)

  • Perceptual deficit, affects visual representations directly
  • components of visual percept are picked up, but can’t be integrated
  • effects may be graded
  • often affected: unusual views of objects
33
Q

What is associative agnosia?

A
  • problems with later processing (recognition)
  • Visual representations are intact, but cannot be accessed or used in recognition - Lack of information about the percept

“Normal percepts stripped of their meaning”