Lecture 3, Object Recognition Flashcards

1
Q

What is object recognition?

A

Perception of objects is different for humans and computers.
For humans –> perception of familiar items
For computers –> perception of familiar patterns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is object recognition difficult?

A
  • Environment contains hundreds of overlapping objects
  • Objects have variability, e.g., translation invariance, rotation invariance, size invariance, colour
  • Objects can vary in the visual scene, e.g., partial occlusions and presence of other objects
  • Intra-class variation -> same object has different forms, e.g., different types of chairs
  • Only part of object may be visible
  • Viewpoint variation -> may be more difficult to recognise an object from one viewpoint to another
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are theories of 2D pattern matching?

A

Template theories, prototype theories, feature theories, structural descriptors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are template theories?

A

Mini copy/template in LTM of all known patterns - compare stimuli to templates in memory for one with greatest overlap until a match is found

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are some problems with template theories?

A
  • Imperfect matches
  • Cannot account for the flexibility of pattern recognition system
  • Comparison requires identical orientation, size, position of template to stimuli
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are prototype theories?

A

Modification of template matching (flexible templates) - possesses the average of each individual characteristic. No match is perfect –> criterion is missing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Is there any supporting evidence for prototype theories?

A

Documented by Franks and Bransford:
–> Presented objects based on prototypes
–> Prototype not shown
–> Ppts confident they had seen prototype
–> Suggests evidence of prototype
HOWEVER, difficult to conclude how you would come up with a prototype

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are feature theories?

A

Pattern consists a set of features or attributes, e.g., letter A = 2 straight lines and connecting bar. However, also need to know the relationship between features. e.g., / \ - = A?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are structural descriptions?

A

Describe the nature of the components and the structural arrangement of these parts. Composition of object and how they are related together. e.g., Capital letter T = 2 parts; 1 horizontal, 1 vertical; vertical support horizontal; vertical bisects horizontal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is 3D object recognition?

A

Must interpret input to the visual system as coherent structures, segregated from one another and from background. Must be processed to give a description which can then be matched to the descriptors of visual objects stored in memory.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Marr & Nishihara’s theory of 3D object recognition?

A

Objects are comprised of cylinders; must specify relationship between cylinders (structural description). Expressed structural relationships by a hierarchical organisation of cylinders. Each cylinder has an axis and way in which others are joined are expressed as coordinates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the limitations of Marr and Nishihara’s theory of 3D object recognition?

A

–> Difficult to think about how to break down all objects into a series of cylinders
–> May work better for biological entities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Bierderman’s theory of 3D object recognition?

A

Provided an alternative to Marr & Nishihara’s theory.
- Objects comprised of basic shapes
- GEONS -> Geometrical ions
- Approx 36 different shapes
- Viewpoint invariant theory
- Relationship between geons can be described structurally

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are some examples of the structural relationships set out by Biederman?

A

Relative size, verticality, centring, relative size of surfaces at join.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What did Biederman believe concave parts of objects were helpful for?

A

Segmenting visual image into parts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Biederman specified GEONS in terms of “non-accidental” properties, what were they?

A
  • Curvature: points on a curve
  • Parallel: sets of points in parallel
  • Co-termination: edges terminating in a common point
  • Symmetry: vs. asymmetry
  • Co-linearity: points in a straight line
17
Q

What does “non-accidental” mean?

A

Regularities in the visual image thought to reflect regularities in the world.
2D symmetry in the visual image indicates symmetry in 3D object -> present in both 2D & 3D views

18
Q

What did Biederman say about form degradation?

A

It should disrupt the basis for identifying geons; make object more difficult to recognise; deletion of component affects matching stage, reducing the number of components to match to.

19
Q

Midsegment deletion makes it more difficult to what?

A

Determine components

20
Q

Describe Biederman’s study that documented effects of degradation?

A

-> Deleted edges at points where easily reinstated or difficult to determine
-> 25, 45, or 65% of contours removed
-> Slow & inaccurate at “non-recognisable” but relatively good at “recognisable”
-> At brief exposures -> partial objects are better recognised
-> At longer exposures -> midsegment deletion led to less errors

21
Q

What did Vogels, Biederman, Bar & Lorincz, 2001, find in monkeys sensitive to geons?

A
  • Cortical neurons.
    -> Assessed response of individual neurons in the inferior temporal cortex to change in geon or change in size of object.
    -> Some neurons responded more to geon changes, providing support for geons
22
Q

What are the evaluations of Biederman’s model?

A
  • Flexible and comprehensive system for describing objects, but why 36 geons?
  • Experimental results consistent with model but doesn’t provide critical test
  • Doesn’t explain how description are matched to those stored
23
Q

What are the advantages of Biederman’s theory?

A
  • Recognises importance of the arrangement of the parts
  • Parsimonious: small set of primitive shape
24
Q

What are the disadvantages of Biederman’s theory?

A
  • Structure is not always key to recognition, e.g., Peach vs. Nectarine -> texture is necessary
    -> Within category discrimination
    -> De-emphasise the role played by context in object recognition (affects later stages of object recognition)
    -> Simplifies the contribution of viewpoint-dependence
25
Q

What do viewpoint-dependent theories assume?

A

Changes in viewpoint reduce the speed and/or accuracy of object recognition. Object representations are collections of views that depicit the appearance of objects from specific viewpoints.

26
Q

What is viewpoint dependent more important for?

A

Within-category discrimination

27
Q

What does evidence suggest concerning viewpoint invariant mechanisms vs viewpoint dependent mechanisms?

A

Viewpoint dependent = complex within category decisions
Viewpoint invariant = easy categorical decisions

28
Q

What is Humphrey’s Cascade Model?

A

Structural, semantic and name stages - both within and between stages.
-> Makes different predictions about how subjects will perform in object naming task.
–> Problems at one stage will have a knock on effect.

e.g., Patient JB - naming visually confusable words (birds, animals) had knock-on effects, making it difficult to identify their category

29
Q

What is agnosia?

A

Failure of knowledge or recognition

30
Q

What is visual agnosia?

A

-> Feature processing and memory remain intact
–> Recognition deficits are limited to visual modality
—> Alertness, attention, intelligence and language are all unaffected
—-> Other sensory modalities (touch, smell) may substitute for vision in allowing objects to be recognised

31
Q

What is apperceptive agnosia?

A

Problems with early processing (shape extraction)
-> Perceptual deficit, affects visual representations directly
–> Components of visual percept are picked up, but can’t be integrated
—> Effects may be graded
—-> Often affected: unusual views of objects

32
Q

What is associative agnosia?

A

Problems with later processing (recognition). Visual representations are intact, but cannot be accessed or used in recognition - lack of information about the percept. “Normal percepts stripped of their meaning.”