Object Recognition Flashcards

1
Q

2 pathways of the brain

A

ventral and dorsal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is object agnosia

A

intact primary visual functions, verbal descriptions and logical reasoning but immediate object recognition is lost

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

which stream is damaged in object agnosia

A

ventral

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What kind of approach is Marr’s model of object recognition/vision

A

computational

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does Marr’s model of vision suggest

A
  1. vision is an information processing task
  2. we have to understand the nature of the task
  3. we have to understand how it can be accomplished
  4. we need to propose algorithms and mechanisms that can accomplish the visual task
  5. explanations of visual experience and visual physiology should come from an understanding of the implementation of those algorithms
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Different models of object recognition used

A

template-matching model
feature-detector models
structural-description models
view-dependent models

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

template-matching model

A

match the exact pattern of light on the retina with previously remembered patterns.
Neuron makes connections with pixels and when the specific pixels are detected the neuron fires

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

problems with template-matching model

A
  1. very constrained - can’t cope with diversity of natural object variation
  2. doesn’t explain how we detect variances in images
  3. there would be too many template detectors
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what model solves the problems of the template-matching model

A

feature-detector model: looks at specific features in the image rather than pixels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Who put forward the idea of ‘feature-detectors’?

A

Hubel and Wiesel

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what do feature-detector models suggest?

A

we can detect specific combinations of features rather than specific patterns
the features don’t have to exactly match a pre-existing template, it just needs to contain the same sub-features that

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what are demons in the feature-detector model

A

subroutines

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

who out forward the structural description model

A

Marr & Nishihara (1978)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What did Marr & Nishihara (1978) suggest the goal of vision is in the structural description model

A

to describe the object unambiguously in its core geometric components

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Marr & Nishihara’s critera for good representation of high level vision

A
  1. Accessibility
  2. scope
  3. uniqueness
  4. stability
  5. sensitivity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what did Marr & Nishihara (1978) suggest the primitives (basic units of information in representation) are

A

objects are described in terms of axes and the volumes around them
the description is hierarchical
this means the object can be described at many scales

17
Q

according to Marr & Nishihara (1978) how is information organised into an abject description?

A

recognition:

by finding the closest model in the model store and specifying its parameters

18
Q

What did Biederman (1987) suggest for structural description models

A

similar to Marr & Nishihara (1978) but proposed a specific “alphabet” of primitive volumes callled Geons which object are decomposed

19
Q

what properties are geons similar to

A

any 2D projection

20
Q

how many geons are there estimated to be

A

less that 36

21
Q

evidence for structural models

A

intersections of geons are particularly informative in object recognition e.g. it is hard to tell what an object is when straight lines or corners are removed

22
Q

what are IT cells

A

higher visual neurons in IT area

23
Q

Pros of structural description models

A

explain invariance well
recognition is description not matching
evidence that structural information matters to both humans and neurons

24
Q

cones of structural description models

A

extracting model parameters can be hard in real mages
structural description is difficult for some objects e.g. campfires
driven by theoretical desirability rather than behavioural evidence

25
Q

what does the the view-dependent model suggest

A

that the arbitrary image has a brute association with the object

26
Q

what are the primitives in the view-dependent model?

A

sub-regions of the image

consisting of lines, curves, textures, colour, shading etc

27
Q

what kind of model is the view-based model

A

simple feet forward activation

28
Q

evidence for view-baed models

A

human object recognition is not perfectly viewpoint invariant
Monkey IT neurons selective for particular viewpoints
simulations show good invariance and similar errors in human observers

29
Q

pros of view-dependent model

A

straightforward
newer models based directly on what we know of physiology
abstract feature units are recombines
good evidence

30
Q

cons of view-based models

A

humans often show good generalisation across viewpoints even for novel objects
more memory intensive than other models e.g. geon models
no understanding of underlying relationships