Chapter 4b: Object Recognition Flashcards

Question 1

Q

Poop Farts Picture

WHAT do you see in this image?

Answer

A

At the level of the retina, you “see” an array of point-lights bouncing off the page and exciting your rods and cones.

In early visual brain areas, you “see” a collection of oriented lines and a collage of red, green, yellow, and blue color patches.

But your response to this question was almost certainly not “light” or “lines” or “colors”; what we all perceive in this scene are “toys.”

The ability to organize visual sensations into coherent objects and then assign meaningful category labels to these objects is in many ways the ultimate accomplishment of vision.

Question 2

Q

The problem of object recognition:

What do you see?

Picture 1: Picture of the front of a red house.
Picture 2: Low-key abstract Watercolor painting of a house.
Picture 3: Picture of the same house from picture #1, but of the side of the house.

Answer

A

The problem of object recognition:

The pictures were just a bunch of pixels on a screen, but in each case you perceived a house

How did you recognize all 3 images as depicting a house?

How did you recognize the 1st and 3rd images as depicting the same house, but from different viewpoints?

How does your visual system move from points of light, like pixels, to whole entities in the world, like houses?

Question 3

Q

Processes in object recognition:

Answer

A

Determine features present in image
(“Low-level vision”)
Group features into objects
(“Middle vision”)
Match perceived representations to encoded representations
(“High-level vision”)

Question 4

Q

Object Recognition Challenges

Answer

A

How do we match a sensation to a memory ?

How is it possible to recognize objects from different vantage points when their optical projections can vary so dramatically?
*Pictures of tea points from different angles

Question 5

Q

Naïve template theory

Answer

A

Object Recognition Theory

The proposal that the visual system recognizes objects by matching the neural representation of the image with a stored representation of the same “shape” in the brain.
That is, maintain a memory of many different views for each object we need to recognize.

“Pandemonium” Oliver Selfridge (1959)

“Lock-and-key” representations: bar codes

Problem: You would need too many templates!
* Example of all the different A fonts.
Very many templates would be required to recognize the different ways that the letter A can be represented.

Question 6

Q

Describe this object

Answer

A

When asked to describe a novel object, observers typically do so by identifying different parts.

Question 7

Q

Structural description theory

Answer

A

Object Recognition Theory

A description of an object in terms of the nature of its constituent parts and the relationships between those parts.

I.E. exploit those properties that can distinguish most objects from one another, yet remain relatively stable over changes in view.

“Generalized Cones” David Marr (1977)

“Recognition-by-Components” Biederman (1987)

Question 8

Q

Object recognition by components

Answer

A

Biederman (1987)

Objects are defined as configurations of qualitatively distinct parts called Geons.

Geons are defined by configurations of non-accidental properties.

Question 9

Q

Geons

Answer

A

configurations of qualitatively distinct parts

defined by configurations of non-accidental properties.

Each type of geon is defined by a particular configuration of non-accidental properties.
(cone, cylinder, block, etc.)

Question 10

Q

Geons are distinguished by their non-accidental properties

Answer

A

the number of straight and curved edges

which edges are parallel to one another

the number of vertices of each type

the presence of symmetries

Question 11

Q

Meaning in the Edges

Answer

A

Non-accidental features provide clues to object structure

T junctions mostly signal OCCLUSION (One object in front of another)

Y and ARROW junctions signal a corners (and not occlusion) most of the time.

These rules FAIL when viewing objects from ACCIDENTAL VIEWPOINTS (that’s why we can have the wrong representation of an image).

Question 12

Q

Objects

Answer

A

Each type of object is defined by a particular configuration of geons.

Objects = Cup, telephone, suitcase, etc.

Geons = Cone, cylinder, block, etc.

Question 13

Q

Prediction Recognition by components:

Answer

A

Deletion of contours in an image should have the greatest effect on recognition performance if it masks non-accidental properties or geons.

Question 14

Q

Geon Theory Task

Answer

A

Subjects are presented with an intact or contour deleted object, and they are asked to name it as quickly as possible.

Recognition performance is more severely impaired by vertex deletion than by midsection deletion.

Recognition performance is more severely impaired by geon deletion than by midsection deletion.

Question 15

Q

Some evidence suggests that object recognition is only possible for viewpoints that are close to those that were observed during training,

Answer

A

This is the opposite of what Recognition by Components predicts.

Question 16

Q

Problems with structural-description theories

Answer

Study These Flashcards

A

Object recognition is not completely viewpoint-invariant.

Geons aren’t always the best descriptions of objects.

Observers show some viewpoint effects in object recognition.
- The farther an object is rotated away from a learned view, the longer it takes to recognize

Chapter 4b: Object Recognition Flashcards

(16 cards)