Chapter 2 - Perception Part 1 (Start - Perception & Action) Flashcards
What is the difference between perception and sensation?
Sensation refers to a PASSIVE process of bringing information from the outside world into the body and to the brain.
Perception is an ACTIVE process. Refers to the ACTIVE organization and interpretation of sensory information into something meaningful.
What are the 2 different types of theories we use to recognise patterns?
Templates and Features.
According to the template theory, what is a template?
Forms or patterns stored in long-term memory corresponding to each of the visual patterns we know.
Pattern is recognized when its template is matched to the stimulus input. -> provides closest match
List some pros and cons of using the template theory to recognise patterns.
Pros:
- Common-sense; a rather intuitive method
Cons:
- Matching will fail even with minor differences between the template and the input.
- Ill-equipped to account for the flexibility shown by people when recognizing alphabetical and numerical symbols –> becomes obvious when the stimulus belongs to an ill-defined category where no single template will suffice (eg building)
What are some assumptions you can make to improve template theories?
1) Assume that the visual stimulus undergoes a normalization process.
2) Assume that there is more than one template for each letter and digit.
How does the assumption that visual stimulus undergoes normalization help in pattern recognition?
Helps you to produce an internal representation of the visual stimulus in a standard position.
Makes you search for a matching template.
However, by assuming that visual stimulus undergoes normalization, are there any things you might be discounting?
You overlook the possibility that your visual stimulus might not always be consistent with your template.
Not possible for whatever stimulus you experience to always be consistent with your template.
With this (cannot assume normalization) being said, what other assumption should you make to make template theories more effective?
Should assume that there is more than one template for each letter and digit.
What are the advantages of assuming that there is more than one template for each letter and digit?
Permits accurate matching of stimulus and templates across a wider range of stimuli
Does considering the fact that each digit and letter has more than one template makes the template theory more complicated?
Yes.
What assumption is held by most feature theorists?
Pattern recognition first involves specific processing and is then followed by more global or general processing to integrate information from the features.
How did Navon prove that global processing CAN precede more specific processing?
Presented observers with stimuli (a big S formed with small S or small H or big H formed with small H and small S) - 4 conditions.
Found that performance speed with the small letters was greatly slowed when the large letter differed from the small letters. In contrast, decision speed with the large letter was not influenced by the nature of the small latter. In this case, we often see the forest/global structure/big letter before the tree/specific structure/small letter.
What did Neisser do that helped to lend support to the feature theory?
2 lists of non-words with a common target letter Z. For the first list, Z is beside unrounded alphabets that consisted of straight lines like Z itself, whereas in the second list, Z is beside unrounded alphabets.
Compared time taken to detect letter Z. Performance was faster in the latter condition because the distractors shared fewer features with target letter Z.
What types of cells did Hubel and Wiesel study?
Cells in parts of the occipital cortex (at the back of the brain) associated with the early stages of visual processing.
What did Hubel and Wiesel observe?
Some cells responded in 2 different ways to a spot of light depending on which part of the cell was affected.
- An “on” response with an increased rate of firing when the light was on.
- An “off” response with the light causing a decreased rate of firing.
What did Hubel and Wiesel discover?
There are 2 types of neurons in the primary visual cortex: Simple cells and complex cells.
What are the traits possessed by simple cells?
- Have “on” and “off” regions with each region being rectangular in shape.
- These stimuli respond most to dark bars in a light field, light bars in a dark field, or straight edges between areas of light and dark.
- Any given simple cell only responds strongly to stimuli in a particular orientation
Responses of these cells could be useful and relevant to feature detection.
What are the traits possessed by complex cells?
- Responds to moving contours
- We have more of these than simple cells.
The responsiveness of end-stopped cells depends on __________ and ___________.
stimulus length, orientation.
Pattern recognition involves both top-down processes and bottom-up processes. True or false?
True.
Use the word superiority effect to explain how pattern recognition involves top-down processes.
WSE: The finding that a target letter is detected faster when presented in words than in non-words. Suggests that information about the word presented can facilitate identification of its letters.
Top-down: Information about the word presented can facilitate identification of the letters.
What is the pseudoword superiority effect?
Letters are easier to identify when presented in pseudowords than in nonwords.
What patterns do you observe in medical diagnosis when it comes to pattern recognition?
Experts are more likely than non-experts to use implicit reasoning, whereas the non-experts made more use of analytic reasoning. However, other evidence indicates that medical experts often start with fast, automatic processes but cross-check their diagnoses with analytic ones.
Explain interactive activation model and top-down activation.
Negative example: If it’s a non-word, there’s no top-down effect, and hence words won’t be activated first before letters are activated.
What are some challenges one would face in object recognition?
Think:
- Number
- Camera lens and Facebook 360 function.
- Do all objects look the same?
- What did Gestalt psychologists come up with about perceptual segregation?
1) Numerous objects in the visual environment, many of which overlap, and are overlapped by other objects.
2) We recognize an object over a wide range of viewing distances and orientations.
3) Objects vary greatly in their visual properties.
4) Figure-ground segregation: You have to figure out which part of the visual field is the figure (more important), and which part is the ground (less important).
What are the main law associated with figure-ground segregation, and what are some examples of this law?
Explain the main law and these examples.
recall PREGNANT.
Law of Pragnanz: The notion that the simplest possible organization of the visual environment is what is perceived.
1) Law of Proximity: VIsual elements tend to be grouped together if they are close to each other.
2) Law of Similarity: Elements will be grouped together perceptually if they are similar.
3) Law of Good Continuation: We group together those elements requiring the fewest changes or interruptions in straight or smoothly curving lines.
4) Law of Closure: Missing lines are filled in to complete the figure.
Why is the figure more important than the ground?
The figure is perceived as having distinct form or shape, whereas the ground lacks form.
Figure is perceived in front of the ground, and the contour separating the figure from the ground belongs to the figure.
Does figure-ground segregation require past experience (at least what the Gestalt psychologists suggested)?
Is there any evidence to show this?
No, depends very little on past experience and knowledge –> hence de-emphasized the role of past experience and knowledge.
When observers were presented with atypical shape contours that were hard to interpret, they showed rapid and flexible learning, which led to a rapid improvement in performance.
What did the Gestalt psychologists assume about figure-ground segregation?
Hint: If little past experience is needed do you think it’s more innate or learned?
Occurs very early in visual processing and always precedes object recognition.
What did Irving Biederman argue in his recognition-by-components theory? What did he also assume that was found to be incorrect?
Suggested that all objects consist of basic shapes called geons (geometric ions).
Approx 36 geons which can be arranged in limitless combinations.
Object recognition is viewpoint-invariant.
While it can be said that we are most sensitive to those visual features of an object directly relevant to identifying its geons, is there any evidence to show that this isn’t always the case?
1) Himba: a seminomadic tribe in NW Namibia.
- Debunks the notion that we develop sensitivity to visual features of an object as these people have very little exposure to manufactured objects.
- However, they are still as sensitive to geon-relevant information as individuals living in the developed world.
- What seems to matter is exposure to a great variety of naturally-occurring objects in the world around us.
2) Intact figures with degraded line drawings either preserving or not preserving parts of the contour providing information about the concavities.
- Concavities in an object’s contour provide especially useful information.
- Object recognition was much harder to achieve when parts of the contour providing information about concavities were omitted than when other parts of the contour were deleted.
Illustrate how object recognition involves both top-down processing and bottom-up processing.
BTU: Identify geons first, then you identify object –> strongly emphasised in object recognition
TD: Depends on factors such as expectation and knowledge are often important, especially when object recognition is difficult.
- Observers relied more on top-down processes if an animal picture is blurred as compared to when it is not blurred.
- Happened because there was less information for bottom-up processes to make use of with the blurred photographs.
Having established that object recognition can be both viewpoint-variant and viewpoint-invariant, when are viewpoint-invariant mechanisms used?
When object recognition involves making easy discriminations.
When will viewpoint-variant mechanisms be used?
When object recognition requires difficult within-category discriminations.
What did Tarr et al obtain with regard to the point about object recognition being both viewpoint-invariant and viewpoint-variant?
Considered recognition of the same 3D objects under various conditions.
Performance was close to viewpoint-invariant when the object recognition task was easy (and detailed feedback was provided after each trial). However, it was viewpoint-variant when the object recognition task was difficult (no feedback given after the task).
Which brain areas are activated during visual processing?
Visual processing proceeds through several areas in the occipital lobes at the back of the brain and finishes up in the inferotemporal cortex, which is of crucial importance in visual object recognition.
What do neurons vary in?
Invariance or tolerance.
What does it mean when a neuron possesses high tolerance, and what does it mean when a neuron possesses low tolerance?
High: Neurons responding almost equally strongly to a given object regardless of its orientation, size and so on.
Low: Neurons responding most strongly to an object in a specific orientation or size.
Is there any link between a neuron’s tolerance and the invariance of viewpoints adopted during object recognition?
Inferotemporal neurons having high invariance or tolerance seem consistent with theories claiming that object recognition is viewpoint-invariant.
(High tolerance -> invariant)
Inferotemporal neurons having low invariance or tolerance seem consistent with theories claiming that object recognition is viewpoint-variant.
(Low tolerance -> variant)
Can object recognition be both viewpoint-invariant and viewpoint-variant at the same time?
Yes.
Observers saw pairs of simple 3D objects formed from connected cylinders. Task was to decided whether the 2 images show the same object or different objects.
When 2 objects were different, they could differ in terms of a viewpoint-invariant feature and/or viewpoint-dependent features.
Key finding was that observers used both kinds of information. Suggests that we make use of all available information in object recognition rather than confining ourselves to only some of the information.
What is agnosia?
When one cannot recognise objects despite information reaching the visual cortex.
These people have no problems with their eyesight and visual cortex, but they cannot recognise objects.
What are the 2 kinds of agnosia?
Apperceptive agnosia: Object recognition is impaired due to deficits in perceptual processing.
(cannot perceive)
Associative agnosia: Perception is fine, but there is difficulty retrieving knowledge about objects from long-term memory. (cannot retrieve_
Apperceptive –> cannot perceive
Associative –> cannot retrieve.
The problems with object recognition occur at a(n) ________ of processing in apperceptive agnosia than in associative agnosia.
earlier or later?
Earlier
maybe just think of it as you need to perceive before you retrieve. perceiving would thus be an earlier stage.
How do we distinguish between apperceptive agnosia and associative agnosia?
1) Assess patients’ ability to copy objects they can’t recognise
- Can copy: Associative agnosia
- Cannot copy: Apperceptive agnosia
2) Gollin picture test: patients are presented with increasingly complete drawings of an object.
- Apperceptive agnosia patients would require more drawings than healthy individuals to identify the objects.