Chapter 2: Perception Flashcards

Question

Describe an "on-off" cell.

Answer 1

if light falls on a small region of the retina at the center of the cell’s receptive field, their spontaneous rates of firing will increase. If light falls in the region just around this sensitive center, however, the spontaneous rate of firing will decrease. Light farther from the center elicits no change in the spontaneous firing rate — neither an increase nor a decrease.

Answer 2

Light at the center decreases the spontaneous rate of firing, and light in the surrounding areas increases that rate.

Answer 3

Studied the primary visual field of cats. Found four patterns in the cortical cells.

Answer 4

Their receptive fields are elongated in shape (contrasted with the circular shape of the on-off/off-on cells.

Answer 5

Cells in the visual cortex that respond most to edges in the visual field. Split vertically. Respond positively to light on one side of the line; negatively to light on the other side of the line.

Answer 6

Cells in the visual cortex that respond most to bard in the visual field. Split vertically in three parts. Bar detectors with a positive center will respond most if the bar of light just covers its center. (positive in middle, negative on edges). This also works in reverse.

Answer 7

Position, Orientation, and Width.

Answer 8

Visual cortex is divided into 2x2mm regions (called hypercolumns)

Answer 9

NEED HELP!

Answer 10

Colours of objects and whether they are moving.

Answer 11

Form, colour, and movement are processed separately.

Answer 12

A representation of the spatial locations of a particular visual feature. There are separate maps for colour, orientation, and movement. (Ex. Moving red vertical bar: separate feature maps represent its colour as red, its orientation as vertical, and its movement as occurring in that location).

Answer 13

Complex patterns (like hands and faces).

Answer 14

Information is laid out on the retina is a 2-D image and it needs to be constructed into a 3-D image.

Answer 15

texture gradient, stereopsis, and motion parallax. (other important cues involve features such as: size, position, and lighting).

Answer 16

Items that we assume are equal in size and evenly spaced appear to regularly decrease in size and pack more closely together the further you move away. Ex., Standing on a balcony and looking over a crowd (The Pope example).

Answer 17

Ability to perceive 3D depth based on the fact that each eye receives a slightly different view of the world. Ex. 3-D glasses: turning two 2D images into a 3D image.

Answer 18

Provides information about 3D structure when the observer and / or the objects in a scene are in motion. Ex: The image of distant objects will move across the observer's retina more slowly than the images of closer objects.

Answer 19

2 1/2 D sketch

Answer 20

As proposed by David Marr, a visual representation that identifies where various visual features are located in space relative to the viewer.

Answer 21

As proposed by David Marr, a representation of objects in a visual scene.

Answer 22

How lines and bars go together to form objects.

Answer 23

Principles that determine how a scene is organized into components; the principles include: - Proximity - Similarity - Good Continuation - Closure - Good form.

Answer 24

Elements close together tend to be grouped together.

Answer 25

Elements that look alike tend to be grouped together.

Answer 26

Smoothest flowing line for continuation. No breaks in curvatures or lines that are already running in a particular direction.

Answer 27

Identifying what objects are

Answer 28

a retinal image of an object is faithfully transmitted to the brain, and the brain attempts to compare the image directly to various stored patterns, called templates.

Answer 29

The image could fall on the wrong part of the retina. The image could be the wrong size. The image could be in the wrong orientation. The image might be non-standard (the wrong shape).

Answer 30

Machine vision. | Brain fMRI imaging

Answer 31

“Completely Automated Public Turing test to tell Computers and Humans Apart.”

Answer 32

A theory of pattern recognition that claims that we extract primitive features and then recognize their combinations.

Answer 33

1. ) Because features are simplier, the system might try to correct for the kinds of difficulties faced by the template-matching model in recognizing full patterns. 2. ) Feature analysis makes it possible to specify those relationships among the features that are most important to the pattern. 3. ) The use of features rather than larger patterns reduces the number of templates needed. Because the same features tend to occur in many patterns, the number of distinct entities to be represented would be reduced considerably.

Answer 34

Showed how there is behavioural evidence of features used as components in pattern recognition. *Be able to explain study! The likeness of letters C & G (pg 52).

Answer 35

very slight eye tremors.

Answer 36

30 to 70 cycles / second.

Answer 37

It is critical for the perception of whatever it is that we are looking at. When techniques are used to keep an image in the exact same position on the retina regardless of eye movement, parts of the object start to disappear from our perception. If the exact same retinal and nervous pathways are used uninterruptedly, they become fatigued and stop responding.

Answer 38

The HB letter experiment Stabilized objects disappear slowly over time. Findings: 1. ) Features are the important units in perception. 2. ) The remaining features are then combined into recognizable patterns. Even though our perceptual system may extract features, what we actually perceive are patterns composed from these features.

Answer 39

Computerized systems typically applied to object recognition tasks (including face recognition), based on layers of successively more complex pattern recognizers.

Answer 40

- Image processing starts with a stimulus (pixel representation of an image) - This is followed by 5 layers of pattern recognition. - Layer 1 acts similar to bar & edge detectors in the primary visual cortex. - Layer 5 has elements that respond to more complex patterns, similar to cells in the inferior temporal lobe.

Answer 41

150+ layers.

Answer 42

Appear to have properties like those of the human visual system.

Answer 43

A neurological disorder (damage to the temporal lobe) characterized by the inability to recognize faces.

Answer 44

A region in the temporal cortex involved in recognition of complex patterns, such as: faces and words. - The response is much stronger in the right fusiform gyrus. - Responds when faces are present in the visual field.

Answer 45

People are much better at recognizing faces presented in their upright position Vs. other objects. -When faces are presented upside down (however), there is a dramatic decrease in recognition. BUT: this is not true of other objects.

Answer 46

The minimal units of speech that can result in a difference in a spoken message. - Ex. B/A/T. Each letter is a phoneme. - Letters and phonemes are not always one-to-one. Ex. Knight = n/i/t.

Answer 47

1. Difficult because words flow from one to the other. 2. Variety among speakers of even the same language. Ex. Women and children have higher pitched voices and men have lower voices. 3. Variations among speakers of different languages.

Answer 48

When one phoneme flows into the other phonemes in the word. The phonemes overlap. - The actual sound produced for one phoneme will be determined by the context of the surrounding phonemes. - Ex. 'a' can sound soft or hard depending on where it is in the word and what other phonemes are surrounding it.

Answer 49

- Patients have lost the ability to recognize speech as a result of injury to the left temporal lobe. - They could detect other sounds and speak still. Their deficit was specific to speech perception.

Answer 50

Among the features of phonemes are: the consonantal feature, voicing, and the place of articulation.

Answer 51

A consonant-like quality in a phoneme.

Answer 52

A feature of a phoneme produced by vibration of the vocal cords. For example, the phoneme /z/ in the word zip has voicing, whereas the phoneme /s/ in the word sip does not. (EXPLAIN Pg. 59).

Answer 53

The place at which the vocal tract is closed or constricted in the production of a phoneme.

Answer 54

/p/, /m/, and /w/ are considered to have a bilabial place of articulation because the lips are closed (or constricted, in the case of /w/) while they are being generated.

Answer 55

The phonemes /f/ and /v/ are considered labiodental because the bottom lip is pressed against the front teeth. Two different phonemes are represented by /th/ — one in thy (with voicing) and the other in thigh (without voicing). Both are dental because the tongue presses against the teeth.

Answer 56

The phonemes /t/, /d/, /s/, /z/, /n/, /l/, and /r/ are all alveolar because the tongue presses against the alveolar ridge of the gums just behind the upper front teeth.

Answer 57

The phonemes /sh/, /ch/, /j/, and /y/ are all palatal because the tongue presses against the roof of the mouth just behind the alveolar ridge.

Answer 58

The phonemes /k/ and /g/ are velar because the tongue presses against the soft palate, or velum, in the rear roof of the mouth.

Answer 59

had participants try to recognize phonemes such as /b/, /d/, /p/, and /t/ by distinguishing between the sounds ba, da, pa, and ta presented in noise.2 Participants exhibited confusion, thinking they had heard one sound in the noise when in reality another sound had been presented. The experimenters were interested in which sounds participants would confuse with which other sounds. Participants most often confused consonants that were distinguished by just a single feature. Ex. when presented with /p/, participants more often thought that they had heard /t/ than that they had heard /d/. The phoneme /t/ differs from /p/ only in place of articulation, whereas /d/ differs both in place of articulation and in voicing. Similarly, participants presented with /b/ more often thought they heard /p/ (differing only in voicing) than /t/ (differing in both features).

Answer 60

/b/, the release of air and the vibration of the vocal cords are nearly simultaneous, and the vocal cord vibration continues into the articulation of the following vowel /a/.

Answer 61

In the case of the unvoiced consonant /p/, the release occurs 60 ms before the vibration begins for the vowel.

Answer 62

The presence or absence of a 60-ms interval between release and voicing. - This period of time is referred to as: voice-onset time.

Answer 63

The delay between the release of air and the vibration of the vocal cords.

Answer 64

The perception of stimuli as belonging in distinct categories without gradual variation.

Answer 65

the delay between the release of air and the onset of voicing was varied from −150 ms (voicing occurred 150 ms before release) to +150 ms (voicing occurred 150 ms after release). The participant’s task was to identify which syllables began with /b/ and which with /p/. Figure 2.24 plots the percentage of /b/ identifications and /p/ identifications against voice-onset time. Throughout most of the continuum, participants agreed 100% on what they heard, but there was a sharp switch from /b/ to /p/ at about 25 ms. At a 10-ms voice-onset time, participants were in nearly unanimous agreement that the sound was a /b/; at 60 ms, they were in nearly unanimous agreement that the sound was a /p/. Because of this sharp boundary between identifications of the voiced and unvoiced phonemes, perception of this feature is referred to as categorical.

Answer 66

People are very poor at discriminating between pairs of syllables beginning with /b/ or pairs beginning with /p/ that differ in voice-onset time but are on the same side of the phonemic boundary. However, they are good at discriminating between pairs that have the same difference in voice-onset time when one item of the pair is on the /b/ side of the boundary and the other item is on the /p/ side. It seems that people can identify the phonemic category of a sound but cannot discriminate sounds within that phonemic category. Thus, people are able to discriminate two sounds only if they fall on different sides of a phonemic boundary.

Answer 67

1. (Weaker) The weaker view is that we experience stimuli as coming from distinct categories. There seems to be little dispute that the perception of phonemes is categorical in this sense. 2. (Stronger) A stronger viewpoint is that we cannot discriminate among stimuli within a category.

Answer 68

There is increased discriminability between categories (acquired distinctiveness) and decreased discriminability within categories (acquired equivalence). But, discriminability within categories is still possible.

Answer 69

- We perceive voicing by unconsciously determining how the consonants are spoken. - We determine how we would generate the speech sounds and that we recognize them in terms of the generation process. Thus, the reason for the categorical discrimination between voiced and unvoiced is that they are generated in distinct ways (i.e., with or without vocal cord vibrations, respectively).

Answer 70

Objective: There is evidence that categorical perception is not tied to human processing of language but rather reflects a general property of how certain sounds are perceived. - Categorical perception depends on neither the signal being speech (Pisoni, 1977) nor the perceiver having a human vocal or auditory system

Answer 71

Created nonlinguistic tones that had a distinguishing acoustic feature similar to the feature of voice-onset time in voicing — a low-frequency tone that was either simultaneous with a high-frequency tone or lagged it by 60 ms. His participants showed abrupt boundaries.

Answer 72

Trained chinchillas to discriminate between da (beginning with voiced /d/) and ta (beginning with voiceless /t/). Even though these animals do not have a human vocal tract, they showed the sharp perceptual boundary between these stimuli that humans do.

Answer 73

Context. *Perception can proceed successfully when only some of the features are recognized, with context filling in the remaining features.

Answer 74

Perceptual processing of a stimulus in which information from the general context is used to help recognize the stimulus.

Answer 75

Perceptual processing of a physical stimulus in which information from the stimulus, rather than from the general context, is used to help recognize the stimulus.

Answer 76

Participants were presented very briefly with either a letter (such as D) or a word (such as WORD). Immediately afterward, they were given a pair of alternatives and instructed to report which alternative they had seen. (The initial presentation was sufficiently brief that participants made a good many errors in this identification task.) If they had been shown the letter D, they might be presented with D and K as alternatives. If they had been shown WORD, they might be given WORD and WORK as alternatives. Note that both choices differed only in the letter D or K. Participants were about 10% more accurate in identifying the word than in identifying the letter alone. Thus, they discriminated between D and K better in the context of a word than as letters alone — even though, in a sense, they had to process four times as many letters in the word context. This phenomenon is known as the word superiority effect.

Answer 77

The superior recognition of letters when the letters are presented in a word context than when they are presented alone.

Answer 78

Massaro has argued that the perceptual information provided by the stimulus and the information provided by the context are independent sources of information about the identity of the stimulus, which are combined to provide the best inference about what the stimulus might be.

Answer 79

Massaro's theory of perception, which proposes that information provided by the stimulus and information provided by the context combine to determine perception. *STUDY

Answer 80

The tendency to hear phonemes that make sense in the speech context even if no such phonemes were spoken. -Originally demonstrated by Warren (1970).

Answer 81

Study presented participants with sentences such as the following: It was found that the *eel was on the axle. It was found that the *eel was on the shoe. It was found that the *eel was on the orange. It was found that the *eel was on the table. In each case, the * denotes a phoneme replaced by a nonspeech sound. For the four sentences above, participants reported hearing wheel, heel, peel, and meal, depending on context. The important feature to note about each of these sentences is that they are identical through the critical word and beyond, up to the last word. The identification of the critical word is determined by what occurs after it — that is, by the last word. Thus, the identification of words often is not instantaneous but can depend on the perception of subsequent words.

Answer 82

- Named after Harry McGurk The effect involves watching the lips of someone making a sound like ga while hearing the sound ba. Depending on various factors such as the quality of the acoustic input, listeners report hearing da (a fusion or compromise perception — this type of compromise is the McGurk effect

Answer 83

Context is also important in visualization as well! Especially when considering the identification of an object.

Answer 84

The inability to detect a change in a scene when the change matches the context.

Answer 85

In Marr's model, the level of visual processing in which the visual features have been extracted from a stimulus. - These features are combined with depth information to get a representation of the location of surfaces in space; this is Marr’s 2½-D sketch.

Answer 86

Pg 69. Diagram

Chapter 2: Perception Flashcards

(113 cards)