object recognition Flashcards

1
Q

what is object recognition

A

perception of familiar items (memory)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

why is object recognition so difficult

A

environments contain hundreds of overlapping objects

yet perceptual experience is of structured, coherent object which we can recognise, use and usually name

apparent size and shape of object does not change despite large variations in retinal image
-our understanding of the object doesn’t change

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

examples of object variability

A

translation invariance
rotation invariance
size invariance (could be bigger/smaller or closer/further)
colour
partial occlusion and presence of other objects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

intra-class variation

A

all chairs vary widely, but we still know that they are all chairs

we can recognise them when only part of an object is visible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

viewpoint variation

A

we can recognise an object from many different view points

objects may be easier to recognise from some view points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

template theory

A

mini copy or template in long term memory of all known patterns

multiple templates are held in memory

compare stimuli to templates in memory for one with greatest overlap until a match is found

good match to template = recognition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

questions about template theory

A

normalisation?
numerous templates?
what type of template?
how does this work for complex patterns?
how good a match is good enough?
what if an object has no template match?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

critique of the template theory

A

problem of imperfect matches
cannot account for the flexibility of pattern recognition system
comparison requires identical orientation, size, position of template to stimuli

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

prototype theories

A

modification of the template matching theory (flexible templates)

possess average of each individual characteristic (prototype)

no match is perfect; a criterion for matching is needed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Frank and Bransford 1971

A

evidence for the prototype theory

presented items based on prototypes
prototype not shown
yet participants were confident that they had seen the prototype
suggests the existence of prototypes

however prototypes cannot account for all objects/ patterns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

feature theories

A

pattern consists of a set of features or attributes
need to know the relationship between features

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

structural distribution

A

describe the nature of components of a configuration and the structural arrangement of these parts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

2D pattern matching theories

A

template theories
prototype theories
feature theories
structural description

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

basics steps of 3D object recognition

A

early image processing: must first interpret input to the visual system as coherent structures, segregates from one another and from the background

then must be processes to give a description- which can then be matched to the descriptions of visual objects stored in memory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

three questions for object recognition

A

what elements are used in the description? (primitives)

how is the relationship between these elements specified?

how is the overall description invariant across views?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Marr and Nishihara (1978)

A

objects are made of cylinders, must specify relationship between cylinders to give a structural description

expressed structural relations by a hierarchal organisation of cylinders

each cylinder has and axis and way in which other are joined are expressed as coordinates

the description of each cylinder is describes relative to its own axis, resulting in a description which is invariant across viewpoints

17
Q

Biederman (1987, 1989) recognition by components theory

A

objects comprised of basic shapes

geons = geometrical ions
-blocks, cylinders, arcs, wedges
-approximately 36 different voluntary shapes
-viewpoint invariant theory

relationship between geoms allows for object identification

small number of structural relationships

concave parts of an objects contour helpful in segmenting visual image into parts

geons specified in terms of ‘non-accidental’ properties

regularities in the visual image thought to reflect actual (non-accidental) regularities in the world

according to model, forms of degradation which disrupts the basis for identifying geons should make the object more difficult to recognise

18
Q

Biederman structural relationships

A

relative size
verticality
centring
relative size of surfaces at joint

19
Q

non-accidental properties of geons

A

curvature - points on a curve

parallel - set of points in parallel

co-termination - edges terminating in a common point

symmetry - versus asymmetry

co-linearity - points in a straight line

20
Q

biederman (1987)

A

slow an inaccurate at ‘non recognisable’ stimulus, but relatively good at recognisable

points of concavity more easy to recognise

deletion of component affects matching stage - reducing the number of components to match to
-at brief exposures (65ms) partial objects better recognised

midsegment deletion makes it more difficult to determine components
-at longer exposures (200ms) midsegment deletion led to less errors

21
Q

biederman support

A

vogels, biederman, bar and Lorincz (2001) found some cortical neurons in monkeys sensitive to geons

assessed the response of neurons in the inferior temporal cortex to change in geon or change in size of object

some neurons responded more to geon changes, providing support for geons

22
Q

evaluation of biederman

A

flexible and comprehensive system for describing objects, but why 36 geons?

experimental results consistent with model but doesn’t provide a critical test

doesn’t explain how descriptions are matches to those stored (how does object recognition occur?)

23
Q

advantages of biederman

A

recognises the importance of the arrangements of the parts

parsimonious: small set of primitive shapes
-structural description of relationships emphasised

24
Q

disadvantages of biederman

A

structure is not always the key to recognition
-texture not considered

which geons?

doesnt account for within category discrimination

de-emphasises the role played by context in object recognition

simplifies the contribution of viewpoint-dependence
-doesn’t explain why viewpoint may affect ease of recognition

25
Q

viewpoint dependent theory

A

assume that changes in viewpoint reduce the speed and/or accuracy of object recognition

evidence suggests sometime viewpoint invariant mechanisms are used, and other times viewpoint dependent mechanisms are used

viewpoint dependent mechanisms are more important for within category discriminations

26
Q

vanrie et al. (2002)

A

viewpoint dependent = complex within category decisions

viewpoint invariant = easy categorical decisions

27
Q

Tarr and Hayward (2018)

A

object representations are neither viewpoint-dependent nor viewpoint-invariant

28
Q

issues with object recognition theories

A

any theory of object recognition must address the binding problem = how do we integrate different kinds of information to produce object recognition?

when presented with several objects how do we decide which features belong to which objects?

29
Q

beyond recognition

A

once a structural description of an object is formed, it must be matched to stored representations

if there is a match then object is ‘recognised’

30
Q

evaluation Humphreys et al 1988

A

oversimplification? - as later processes may start before earlier ones have been completed? stages more integrated?

general support for model from patients with object recognition difficulties - associative agnosia