3 - Object Recognition Flashcards

1
Q

What is Object Recognition for humans

A

For humans: perception of familiar items.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

is perception of objects is different for humans than for computers

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is object recognition for computers

A

For computers: perception of familiar patterns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

why is object recognition difficult

A
  • Environment contains hundreds of overlapping objects.
  • Yet perceptual experience is of structured, coherent objects which we can recognise, use and usually name.

• Apparent size & shape of an object does not change despite large variations in retinal
image.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

give 6 examples of variability

A
translation invariance
rotation invariance
size invariance
colour
partical occlusion
presence of other objects
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is a intra-class variation in object recognition

A
intra-class variation 
e.g. recognise chair - variation as to what a chair looks like
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is view point variation in object recognition

A

viewpoint variation
different viewpoints
recognition from unusual views

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is template theory an example of

A

2D pattern matching

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what happens in template theory

A
mini copy or template in LTM of all known patterns
normalisation?
numerous templates?
real life examples:
bar codes, fingerprints
compare against templates stored in LTM

see something new, determine what it looks like match with stored template

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

whats the problem with template theory

A

– Problem of imperfect matches
– Cannot account for the flexibility of pattern recognition system
– Comparison requires identical orientation, size, position of template to stimuli

looseness in argument
more compex - tricky
how mant templates? - fonts capitals small letters - where do we stop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is template theory

A

– Multiple templates are held in memory

– Compare stimuli to templates in memory for one with greatest overlap until a match is found

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is prototype theory part of

A

2d pattern matching

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is prototype theory

A
  • Modification of template matching (flexible templates)
  • Possesses the average of each individual characteristic
  • No match is perfect; a criterion for matching is needed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is the evidence for prototype

A
Franks & Bransford (1971)
– Presented objects based on prototypes
– Prototype not shown
– Yet participants are confident they had seen prototype 
– Suggests existence of prototypes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is feature theory a part of

A

2d pattern matching

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is feature theory

A
  • pattern consists of a set of features or attributes.
  • A = 2 straight lines & connecting cross bar.
  • But also need to know A
    a relationship between features.
    / \ - = A??
    define a letter a by features nd attributes that make a letter a
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what is structural description an example of

A

2d pattern matching

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

what is structural description

A

“..describe the nature of the components of a configuration and the structural arrangement
of these parts” (Bruce & Green, 1990)

  • Capital letter T = 2 parts; 1 horizontal ; 1 vertical; vertical supports horizontal; vertical bisects horizontal.

know features and relationship between fetrues
what are consistuent parts and how are they organised

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

what is 3D object recognition

A

Similar but more complex process………
• Firstly must interpret input to the visual system as coherent structures, segregated
from one another & from background. (Early image processing)
determine one object - create structural description - match to familiar obejcts strored in LTM

• Must be processed to give a description – which can then be matched to the
descriptions of visual objects stored in memory.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

what happens in marrs computational approach

A

primal sketch
2 1/2 d sketch
3-d observation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

what is a primal sketch

A

2-D description includes

changes in light intensity, edges, contours, blobs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

what happens in 2 1/2 D sketch

A

Includes information about depth, motion, shading. Representation is observer-centered

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

what happens in 3-D representation

A

a representation of objects and their relationships

observer - independent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

what are the 4 questions in object recognition

A

(1) What elements are used in the description? (primitives) - components features geons
(2) How is the relationship between these elements specified?
(3) How is the overall description invariant across views? recognise from different views
(4) What about viewpoint dependence? better at recognising in view points we usually see

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

what happens when processing objects comprise of cylinders

A

Objects comprised of cylinders - must specify relationship between cylinders = structural
description.

features and components
in a coordinate type system

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

what did Marr & Nishihara (1978) propose

A

Marr & Nishihara (1978) expressed structural relations by a hierarchical organisation of
cylinders.
- each cylinder has axis & way in which others are joined are expressed as coordinates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

how does the hierarchical organisation of cylinders work

A

The position of each cylinder described relative to its own axis, resulting in a description which is invariant across viewpoints.

each cylinder - in realtionship with main cylinder - work out coordinates to match what you have in LTM

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

what did Biederman suggest

A

Biederman (1987; 1989) provided alternative model to Marr & Nishihara (1978)…………..

• Biederman’s Recognition-by-components theory: Objects composed of basic shapes

  • GEONS = ‘geometrical ions’
  • blocks, cylinders, arcs, wedges
  • approximately 36 different volumetric shapes
  • Viewpoint invariant theory
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

what are GEONS

A

geometrical ions

components and features

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

what are the structural relationships according to biederman

A

relative size - bigger or smaller
verticality - one thing on top of another
centring - off set
relative size of surfaces at join - joined at short or long side

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

what is the viewpoint-invariant theory of recognition

A

recognition using:
3D component parts e.g. 36 GEONS
structural relations between the parts

relationship between geons
match whats stored in LTM

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

what does Biederman say about concave parts of objects

A

Concave parts of an object’s contour helpful in segmenting visual image into parts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

what are geons specifiend in terms of ‘non-accidental’ properties

A
Curvature - points on a curve
Parallel - set of points in parallel
Co-termination - edges terminating in a common point
Symmetry - versus asymmetry
Co-linearity - points in a straight line
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

what does Biederman say about cylinders

A

cylinder posses curved edges & two parallel edges connecting the curved edges.
properties that define a cylinder

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

what does Biederman say about regularities in the visual image

A

Regularities in the visual image thought to reflect actual (non-accidental) regularities in
the world. E.g. 2D symmetry in the visual image indicates symmetry in 3D object.
e.g. bucket anf mug - same consistent geons but different relationships

36
Q

how do we see the object according to Biederman

A

edge extraction - see object
detection of non-accidental properties
parsing of regions of concavity- chop off one geon from another
determination of components
matching of components to object representations - create - match to stored descriptions in LTM

37
Q

what makes objects more diffiuclt to recognise according to Biederman

A

According to model, forms of degradation which disrupt the basis for identifying geons should make objects more difficult to recognise….
if take away points of concavity - where one stopps and starts - makes genon more difficult to recognise

  • Biederman (1987) deleted edges at points where easily reinstated or difficult to determine.
  • Stimuli presented for 100, 200 or 750 msec with 25%, 45% or 65% contours removed.
38
Q

what is curvature

A

points on a curve

39
Q

what is parallel

A

set of points in parallel

40
Q

what is co-termination

A

edges terminating to a common point

41
Q

what is symmetry

A

versus asymmetry

42
Q

what is co-linearity

A

points in a straight line

43
Q

when Biederman deleted edges at points where easily reinstated or difficult to determine

A

slow and inaccurate at non-recognisable but relatively good at recognisable
when take away points of concavity - difficult to recognise as harder to break into constituent parts

44
Q

biedermans stimuli

what is deletion and mid segment

A

Deletion of component affects matching stage -Reducing the number of components to match to.
Midsegment deletion makes it more difficult to determine components.

45
Q

what did biederman find

A

At brief exposures (65ms) Partial objects better recognised.

At longer exposures (200ms) midsegment deletion led to less errors.

46
Q

what is support for biedermans idea

A

Vogels, Biederman, Bar & Lorincz (2001) found some cortical neurons in monkeys sensitive to geons.

  • Assessed response of individual neurons in the inferior temporal cortex to change in geon or change in size of object.
  • Some neurons responded more to geon changes, providing support for geons.
47
Q

what is the evaluation of biederman

A

flexible & comprehensive system for describing objects. But why 36 geons?

  • Experimental results consistent with model but doesn’t provide critical test.
  • Doesn’t explain how description are matched to those stored. - creating a structural description- need to make an adequate match in order to recognise
48
Q

what are the advantages of biedermans model

A

Recognizes the importance of the arrangement of the parts

– Parsimonious: Small set of primitive shapes

49
Q

what are the disadvantages of Biedermans model

A

– Structure is not always key to recognition: Peach vs. Nectarine - cant make decision based on edge info

– Which geons?

– Within category discrimination (which chair?)

– De-emphasise the role played by context in object recognition (affects later stages of object recognition) - different context - top down context info

– Simplifies the contribution of viewpoint-dependence - creating invariant descriptins - easier to recognise from familiar viewpoints

50
Q

what theory did Biederman put forward

A

recognition by components theory

51
Q

viewpoint dependent theory

A

‘object representations are collections of views that depict the appearance of objects from specific viewpoints’ (Tarr & Bulthoff, 1995)

52
Q

what do view point invariance theories say

A

According to viewpoint invariance theories (Biederman, 1987) ease of object recognition
is not affected by the observer’s viewpoint.

53
Q

what do viewpoint dependent theories say

A

viewpoint-dependent theories (Tarr, 1995; Tarr & Bulthoff, 1995; 1998) assume changes in viewpoint reduce the speed and / or accuracy of object recognition.

54
Q

what does vidence suggest about viewpoint

A

Evidence suggests that viewpoint invariant mechanisms used sometimes in object recognition whereas viewpoint dependent mechanisms used at other times

55
Q

when is view point dependent more important

A

Evidence suggests that viewpoint invariant mechanisms used sometimes in object recognition whereas viewpoint dependent mechanisms used at other times

56
Q

what type of decisions is view point dependent used for

A

complex within category decisions

57
Q

what type of decisions is view point invariant used for

A

Viewpoint invariant = easy categorical decisions

58
Q

what did vanrie et al (2002) say

A

‘The key question is no longer if object recognition is viewpoint-dependent or viewpoint independent, but rather when, i.e. under which circumstances.’

59
Q

what does tarr and hayward (2018) say

A

‘object representations are neither viewpoint-dependent nor viewpoint-invariant’

60
Q

what is the binding problem

A

how do we integrate different kinds of information to produce object recognition?

61
Q

what problem must any theory of recognition address

A

the binding problem

62
Q

how are object recognised

A

Once structural description of an object is formed – it must be matched to stored representations.

  • If there is a match then object is ‘recognised’.
  • Several models which specify stages in ‘recognition’ process………
63
Q

what did humphreys say about object recognition

A

structural description
semantic representation
name representation

64
Q

what is structural description

A
face-recognise- match face
if stuck here cant get through 
see objects but cant create structura description
agnostic
cant match so dont recognise
65
Q

what is semantic representation

A

where know from
how do you use it
the context you see it

66
Q

what is name representation

A

remember name

67
Q

is there support for object recognition

A

General support for model from patients with object recognition difficulties. Associative agnosia
- for example Patient HJA; Patient JB

agnostic patients
unable to recognise objects

68
Q

why is object regonsition an oversimplification

A

as ‘later’ processes may start before earlier ones have been completed.
not the case you get stuck - not necessarily all or nothing

69
Q

which model did humphreys propose

A

the cascade model

70
Q

what is the cascade model

A

structural, semantic and name stages interact.

  • both within and between stages.
  • Makes different predictions about how subjects will perform in object naming task.

Problems at one stage will have ‘knock on’ effect….

71
Q

cascade model

A

structural description system
semantic representation
name representations
name

activations across and between stages
activations flowing around system

72
Q

is there evidence for the cascade model

A

Anecdotal and empirical evidence for a separation of structural, semantic and naming
processes in recognition.

73
Q

is there evidence for stages operating in cascade rather than independently

A

Humphreys et al (1988) propose processing across these stages operates in cascade
rather than independently.

– e.g. Patient JB. Naming visually confusable objects (birds, animals) had knock on effects, making it more difficult to identify their category.

74
Q

what is agnosia

A

Failure of knowledge or recognition = “agnosia”. (visual agnosia)

75
Q

what happens in visual agnosia

A

In visual agnosias, feature processing and memory remain intact, and recognition deficits are limited to the visual modality.
Alertness, attention, intelligence and language are unaffected.
Other sensory modalities (touch, smell) may substitute for vision in allowing objects to be recognized. - cant visually regonise but touch to determine

76
Q

what are the tasks in BORB to test agnosia

A

(A) Association match; (B) Item match;
(C) Foreshortened match;
(D) Object decision;
(E) Minimal feature match

77
Q

what is apperceptive agnosia

A
Apperceptive agnosia (lateral) – problems with early processing (shape extraction). 
vision - all objects look the same
78
Q

what happens in apperceptive agnosia

A

Perceptual deficit, affects visual representations directly, components of visual percept are
picked up, but can’t be integrated, effects may be graded, often affected: unusual views
of objects

79
Q

what is the diagnosis of apperceptive agnosia

A

ability to recognise degraded stimuli is impaired

80
Q

what are the tests for apperceptive agnosia

A

-would have trouble drawing a chair with missing contours when presented to them
would have trouble recognising a chair from a certain perspective

81
Q

what is associative agnosia (bilateral)

A

problems with later processing (recognition). - cant create structural description to map what they have in memory

Visual representations are intact, but cannot be accessed or used in recognition.
Lack of information about the percept.

“Normal percepts stripped of their meaning” (Teuber)

82
Q

what tests associative agnosia

A

Patients do well on perceptual tests (degraded images, image segmentation), but cannot access names (“naming”) or other information (“recognition”) about objects. Agnosics fail to experience familiarity with the stimulus.

When given names of objects, they can (generally) give accurate verbal descriptions.

fail to have familiarity with the objects

83
Q

Associative agnosics can copy drawings of objects but cannot …

A

name them (evidence for intactness of perceptual representations…)

84
Q

what happens when agnosia is restricted to specific categories

A

Specific deficits in recognizing living versus non- living things.

Warrington and Shallice (1984): patients with bilateral temporal lobe damage showed loss of knowledge about living things (failures in visual identification and verbal knowledge).

Their interpretation: distinction between knowledge domains – functional significance (vase-jug) versus sensory properties (strawberry-raspberry).

85
Q

what does Damasio say about inanimate objects

A

Many inanimate objects are manipulated by humans in characteristic ways.
Interpretation: inanimate objects will tend to evoke kinesthetic representations.

86
Q

what did Gaffan and Heywood (1993) say about agnosia related to specific categories

A

Presented images (line drawings) of animate and inanimate to normal humans and normal monkeys, tachistoscopically (20 ms). Both participant groups made more errors in identifying animate vs. inanimate objects.

Interpretation: Living things are more similar to each other than non-living things - “category- specific agnosia”