chapter three: perception Flashcards
chapter 3: perception
–
template-matching theory
A template of a visual stimulus is stored in memory.
- Presentation of visual stimulus is compared with
templates stored in memory - Templates linked to information about that type of object
(e.g., its name, what it’s used
for, what it does etc.).
notes:
An early theory of recognition is that of template matching. As the name suggests a template of every object in our environment is thought to be stored in memory when presented with a visual stimulus this stimulus is matched against templates
for example according to template matching theory the presentation of the letter a is matched templates and memory when there is a match the item is recognized
but you might wonder not every apple looks exactly the same not every letter 8 looks the same so how can a template match all these different versions of the same object? do we have a template for each variation within a category?
problems with template matching theory
Problems:
- The way we see objects is highly variable.
- Requires a huge number of templates even for a single type of
object.
the problem with template matching theory is that it does not explain why we can recognize objects that do not match a template in memory exactly for example we can recognize variations within a category, objects that are out of focus or visually degraded does this mean that we have a template for all the different types of hats? a template perhaps out of focus or when it is visually degraded?
if recognition requires that a template for each variation within a category be stored in memory this would mean we have an infinite number of templates learned and stored in memory this seems unlikely
feature detection theory
Objects can be decomposed into parts (features)
The features are matched to the features of objects in
memory.
When there is large feature overlap between the stimulus and
item in memory this will lead to recognition.
notes:
Feature detection accounts for w recognition even when there is variation in objects by examining features of the object instead of the object as a whole like with template theory
feature detection theory proposes that objects such as the letter a can be decomposed into parts for example the letter a has 2 diagonal lines and a horizontal line when the features of an object match the features of a pattern in memory this will result in recognition
FEATURE DETECTION THEORY
Neisser’s study
Neisser’s (1964) study
- If recognition involves feature detection there should be perceptual
confusions. That is, impaired recognition of objects that share
overlapping features (e.g., angular features).
Results: * Slower to detect Z among items with similar features (List 2 – angular
features) compared to dissimilar features (List 1 – curved features).
notes:
based on feature theory you should find that searching for the letter Z in a pile of letters with angular features takes longer than if it were among letters with rounded features this is because it becomes difficult to distinguish whether the features detected or that of the letter Z or that of the letter N v or W
Feature Detection Theory
Distinctive features
Distinctive features: features that help to discriminate
between two patterns.
when learning our alphabets as children, children sometimes confuse letters that look similar like E&F or O and Q or M&N or K&X, however if children are taught the distinctive feature that helps one discern one letter another by highlighting the feature in red ink children are faster to learn letter discrimination even after the highlighting is removed
problems:
- Does not take spatial relationships into account.
- Detecting individual features can be inadequate for
recognizing some patterns.
the problem with feature detection is that objects such as the letter T and the + have the same features a lowercase B&P share similar features
how do we distinguish between the two? what seems to matter is not just the features of an object but also the relation between these features
structural theory
Biederman’s Recognition by Components Model (1987)
Biederman’s Recognition by Components Model (1987)
- A Structural theory
* Involves features and the arrangement of features. - All objects are constructed from a small number of 3-
Dimensional shapes referred to as GEONS
notes:
Structural theories focus on how features are combined one such structural theory is called recognition by components which suggests that it is not just the features that matter for recognition but how the features combine
furthermore unlike feature detection theory which does not specify how many in which features are used for recognition, the recognition by components theory proposes that objects can be recognized based on a subset of 36 GEONS or shapes
the limitation of recognition by components theory is that it does not explain how come we can tell a granny Smith apple from a golden delicious apple or a gala apple?
the components in the way these components are arranged are the same for all these apples yet apple farmers would be able to distinguish between the variety of apples
perhaps recognition is more than just the components of an object and how they are put together
problems with structural theory
Relying on features and how they combine is not
enough to explain how we can distinguish specific
members of a category.
- E.g., a granny smith apple versus a gala apple
the nature of perception
Perception: Conscious experiences as a result of sensory stimulation.
characteristics:
Modifiable: can change with experience
our perceptions are influenced by our experiences, knowledge and memory since our experiences can impact perceptions, if our experience is changed this can modify our perception
- Results from a reasoning process: knowledge, memory, and experience
influences perception - Reciprocal relationship between perception and action
lastly there is a reciprocal relationship between perception and action, when we are moving within our environment or taking actions upon an object the sensory information is changing and with it our perception
bottom-up processing
Bottom-Up Processing
* Also referred to as data-driven processing.
* Process of building a whole image from a set of features.
top-down processing
Prior knowledge and expectations influence perception and
pattern recognition in the absence of (or even despite)
sensory input.
- Recognition when bottom-up information is ambiguous.
- Speeds pattern recognition when patterns are in their usual
context.
top-down processing
Word Superiority Effect (Reicher, 1969)
Procedure:
1. Brief presentation of a word (e.g., WORK) or a non-word (OWRK) or a
letter (e.g., K)
2. Asked: “Was there a D or K in the display?
Results:
Greater accuracy recognizing a letter presented rapidly when it is part of
a word rather than when it is presented alone or as a non-word!
the challenges of designing a perceiving machine
Perceiving machines have difficulty with:
- Solving the inverse projection problem
- Recognizing hidden or blurred objects
- Achieving viewpoint invariance
- Recognizing scenes
THE CHALLENGES OF DESIGNING A
PERCEIVING MACHINE
INVERSE PROJECTION PROBLEM
- Inverse projection problem: When there is the SAME image on the retina
from various objects (e.g., rectangular paper, tilted trapezoid), how do you
determine what that object is?
THE CHALLENGES OF DESIGNING A
PERCEIVING MACHINE
. Recognizing hidden or blurred objects:
- Recognizing hidden or blurred objects: When objects are obscured or
blurred, computers have difficulty with recognition.
THE CHALLENGES OF DESIGNING A
PERCEIVING MACHINE
3) Viewpoint Invariance:
Computers have difficulty with recognizing the
same object from different viewpoints.
Computers have difficulty with recognizing both photos as the
same person if they are not front-on faces.
Computers can distinguish between two front-on faces
THE CHALLENGES OF DESIGNING A
PERCEIVING MACHINE
recognizing scenes
- Recognizing Scenes: Computers have difficulty perceiving a scene
Scene: real-world environment with items that are meaningfully organized
with each other and with the background.
information for human perception
Human Perception: Involves both bottom-up and top-down processing.
Bottom-up processing: perception is based on sensory information from
the environment. Process is also referred to as data-driven processing.
Top-down processing: prior knowledge and experience influences
perception.
notes:
INFORMATION FOR HUMAN PERCEPTION
TOP-DOWN PROCESSING
see example on slide
notes: Our experiences help us perceive the blob in the figure A l different ways in different scenes because of top down processing
for instance in figure B although the image is blurred we can rely on our prior knowledge of human form and the function of the table to interpret the same BLOB as some object on the table perhaps it’s a cup
in figure C the same BLOB is presented but within the context of being at the bottom of a person’s leg thus we perceive the BLOB as a shoe
in figure D once again our knowledge of the context in the image influences our perception of the BLOB to be a car and a person
a demonstration of top down processing
A B C
12 13 14
Very quickly read the first and second line did you read it as ABC and 12/13/14 do you notice anything interesting about these two rows?
notice the second item in the top row and the second item in the bottom row are identical yet you recognize them to be different items one was recognized to be the letter B whereas in the bottom row it was probably recognized as the digit 13
why? well in the first row the second item is surrounded by letters A and C this context probably influenced your recognition for the second item
in the bottom row the numbers 12 and 14 influenced your recognition of the second item that is you identified it as a digit
in other words top down processing occurred since your prior knowledge of letters and numbers had influenced your recognition of the stimulant
a demonstration of top down processing
image of a man
now look at this image what do you see this image this second image is ambiguous it can be perceived in more than one way either man or woman looking in a mirror if you report seeing a man you may have been influenced by prior knowledge in particular the prior presentation of a man influenced your perception of the ambiguous image to be that of a man
however what would happen if I had shown you an image of a lady looking in a mirror at time 1 but probably would have happened is that you would have perceived the ambiguous image to be that of a woman
since our recognition of the ambiguous image is influenced by what we had previously encountered this is an example of top down process is influencing our response
INFORMATION FOR HUMAN PERCEPTION
Hearing Words in a Sentence: acoustic signal is often continuous. Hearing
individual words requires the ability to segment speech.
continous acoustic signal (see image)
notes:
the acoustic Signal is the pressure changes in the air and the acoustic signal for speech sounds are created by air being pushed from the lungs through the vocal tract
if you look at the acoustic signal on the screen when someone says speech segmentation you will notice that the acoustic signal is continuous furthermore it’s difficult to determine the start and end of this word speech as well as the start and ending of the word segmentation yet we’re able to perceive that there are 2 words because of our ability to segment speech in two distinct words
speech segmentation
The ability to tell when one word in a conversation ends and the next one begins is a phenomenon called speech segmentation
The fact that a listener familiar only with English and another listener familiar with Spanish can receive identical sound stimuli but experience different perceptions means that each listener’s experience with language (or lack of it!) is influencing his or her perception
he continuous sound signal enters the ears and triggers signals that are sent toward the speech areas of the brain (bottom-up processing); if a listener understands the language, their knowledge of the language creates the perception of individual words (top-down processing).
INFORMATION FOR HUMAN PERCEPTION
Speech segmentation: discriminate words in a conversation based on
Context/sentence used
* Meaning of the word
* Knowledge of word structure to assess transitional probabilities
notes:
speech segmentation is the discrimination of words and speech and our ability to perceive words in speech involves top down influences such as context or even meaning and prior knowledge and also bottom up processing is also involved that is the processing of auditory sensory information
and even with background noise you are still able to discriminate words in a conversation often because of top down influences
INFORMATION FOR HUMAN PERCEPTION
Transitional Probabilities: Likelihood that one sound follows another
sound (based on our knowledge of that language).
- Pretty Baby: “pre” is followed by “tty” -> one word!
- Pretty Baby: “tty” is not followed by “ba” -> belong to different words!
notes:
speech segmentation or the ability to detect words in speech is influenced like I mentioned by top down processes such as prior knowledge and the retrieval of this prior knowledge to help us identify words in speech
in addition we rely on statistical learning or the process of learning what it’s called transitional probabilities
for instance if we have extensive knowledge of spoken English we would have learned that the likelihood that the sounds pre and t make one word because we know that the sound for the word pretty exists
however our experiences with spoken English we also know that T ba is not word so we wouldn’t combine these two sounds together rather we know that they belong to different words in that case in this case we had segmented these speech sounds into identifiable words pretty and baby
transitional probabilities
While segmentation is aided by knowing the meanings of words, listeners also use other information to achieve segmentation. As we learn a language, we are learn- ing more than the meaning of the words
Without even realizing it we are learn- ing transitional probabilities—the likelihood that one sound will follow another within a word. For example, consider the words pretty baby. In English it is likely that pre and ty will be in the same word (pre-tty) but less likely that ty and ba will be in the same word (pretty baby).
Every language has transitional probabilities for different sounds, and the pro- cess of learning about transitional probabilities and about other characteristics of language is called statistical learning. Research has shown that infants as young as 8 months of age are capable of statistical learning.