object recognition Flashcards

1
Q

Inverse projection problem:

A

The same object can project different images on the retina

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Viewpoint invariance:

A

Our ability to recognize an object from any viewpoint

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

middle vision:

A

Is a loosely defined stage of visual processing between basic feature extraction and object recognition and scene understanding

Identification of edges and surfaces

Grouping of different regions of an image into object

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Illusory contours:

A

We can perceive contours even though nothing changes from one side to the other … so can cats

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Gestalt grouping rules :

A

Describe when elements in an image will appear to group together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Good continuation is:

A

A Gestalt rule stating that two elements will tend to group together if they lie on the same contour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Good continuation can be detected by neurons with:

A

aligned receptive fields

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Middle vision rules:

A

common properties allow us to group parts of an image together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Gestalt rules:

A

good continuation, similarity, proximity, parallelism, symmetry, meaningfulness, and familiarity are examples of Gestalt grouping rules

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

dynamic grouping properties (2):

A

o elements that share a common fate (move together) tend to group together
o elements that are synchronised tend to group together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do we use the features that we have extracted from a scene to apply the rules?

A

one metaphor for how our brain does this is that it’s like committees coming to a consensus decision
different and sometimes competing principles are involved and our perception reflects the consensus that emerges

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

committee rules

A

respect physics and avoid accidents

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

an ambiguous figure is a visual stimulus that permits two or more possible interpretations of its identity or structure. In the case of ambiguity, our perceptual committees

A

Tend to obey the laws of physics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

an accidental viewpoint is a position that produces some regularity in the visual image not present in the world
our committees assume that viewpoints :

A

are not accidental

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

The Bayesian approach allows us;

A

To calculate the probability of a particular hypothesis (interpretation) given an observation (stimulus)
P(H|O)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Bayesian techniques provide:

A

A formal way to model the perceptual decision making process

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

ground =

A

background

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

figure =

A

object of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

How do we decide what parts of an image belongs to the (back)ground and what parts belong to the figure (object of interest)?

A

gestalt principles:

size: smaller regions are likely to be seen as figure

symmetry symmetrical regions are likely to be seen as figure

parralelism: regions with parralel contours are likely to be seen as figure

meaningfulness: if we recognize a shape it is more likely to be seen as figure

extremal edges: if edges are shaded such that they seem to recede into the distance, they tend to be seen as figure

relative motion (depth): if one region moves in front of another, the closer region is figure

surroundedness: a surrounding region (border) is likely to be ground

20
Q

goals of middle vision

A

o Bring together that which should be brought together
o Split asunder that which should be split asunder
o Use what you know
o Avoid accidents
o Seek consensus and avoid ambiguity

21
Q

figure-ground assignment can be explained by

A

Gestalt Principles

22
Q
  • meaningful shapes are likely to be recognised as
A

“figure”

23
Q

Naïve template theory:

A

The proposal that the visual system recognizes objects by matching the neural representation of the image with a stored representation of the same “shape” in the brain

24
Q

Structural description:

A

A description of an object in terms of the nature of its constituent parts and the relationships between those parts

25
Q

the problem with receptive fields:

A

the problem with templates is that we would need a different template for every size, orientation, and style of the same “thing”
that’s a lot of templates

26
Q

recognition by components

A

-Biederman (1987) proposed that we recognise objects by the identities and relationships of their component parts

-their component parts are constructed from a finite set of geometric icons, or geons

-we can create a wide variety of objects from geons

-and this theory would allow us to perceive objects with viewpoint invariance – ie the description of the object isn’t affected by the angle we are looking from

27
Q

problems with structural descriptions (recognition by components)

A

these descriptions can be too broad – how to we distinguish between individual cups or suitcases?
Geons aren’t always the best primitives to describe things
our ability to recognise objects isn’t completely viewpoint invariant
the farther an object is rotated away from a learned view, the longer it takes to recognise

28
Q

A possible solution to the problems with structural descriptions

A

perhaps we have different object recognition processes that depend on the category level

entry-level category : the label that first comes to mind when we see an object

subordinate-level category : a more specific term for the object

superordinate-level category: a more general term for the object

29
Q

__components can perfectly explain visual object recognition

A
  • neither simple template matching nor recognition by components can perfectly explain visual object recognition
30
Q

*we take longer to recognise objects

A

that are rotated away from familiar viewpoints

31
Q

Occlusions (i.e., objects hiding other objects):

A

are common in the environment.

32
Q

By area V4, cells are interested in stimuli such as

A

fans, spirals, and pinwheels

33
Q

FFA: Fusiform Face Area. Responds
to

A

faces more than other objects

34
Q

PPA: Parahippocampal Place Area.
Responds preferentially to

A

places, such as pictures of houses

35
Q

EBA: Extrastriate Body Area.
Specifically involved in

A

the perception of body parts

36
Q

MT: Middle Temporal area.
Specialized for:

A

motion processing

37
Q

Figure–ground assignment:

A

The process of determining that some regions of an image belong to a foreground object (figure) and other regions are part of the background (ground)

38
Q

Reliability:

A

The degree to which two line segments appear to be part of the same contour

39
Q

Global superiority effect:

A

The properties of the whole object take precedence over the properties of parts of the object

40
Q

Nonaccidental feature:

A

A feature of an object that is not dependent on the exact (or accidental) viewing position of the observer

41
Q

T junctions:

A

indicate occlusion. Top of T
is in front and stem of T is in back

42
Q

Y junctions:

A

Indicate corners facing the
observer

43
Q

Arrow junctions:

A

: Indicate corners facing
away from the observer

44
Q

Prosopagnosia:

A

An inability to recognize faces

45
Q

Agnosia

A

A failure to recognize objects despite being able to see them

46
Q

Double dissociation:

A

When one perceptual function can be
damaged without affecting the other