Topic 6 - Speech Perception Flashcards

1
Q

Acoustic Signal

A

Produced by air that is pushed up from the lungs through the vocal cords and into the vocal tract

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Vowels

A

Vowels are produced by vibration of the vocal cords and changes in the shape of the vocal tract by moving the articulators

These changes in shape cause changes in the resonant frequency of the vocal tract and produce peaks in pressure at a number of frequencies called formants

Each vowel has a characteristic series of ‘formants’ (resonant frequencies)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Articulators

A

structures such as tongue, lips, teeth, jaw and soft palate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Formants

A

Resonant Frequencies

The first formant has the lowest frequency, the second has the next highest, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Sound/Speech spectrograms

A

Sound/Speech spectrograms are a better way to show the changes in frequency and intensity for speech

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Consonants

A

Produced by a constriction of the vocal tract and air flow around articulators

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Phoneme

A

smallest unit of speech that changes meaning of a word

In English there are 47 phonemes - 13 major vowel sounds and 24 major consonant sounds

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Variability problem

A

There is no simple correspondence between the acoustic signal and individual phonemes

  • Variability comes from a phoneme’s context
  • Acoustic signals that vary can be perceived categorically (/b/ as in ‘bat’ or ‘bite’) i.e., as ‘the same’ sound
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Coarticulation

A

overlap between articulation of neighbouring phonemes also causes variation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Variability between different speakers

A

Speakers differ in pitch, accent, speed in speaking, and pronunciation

  • This acoustic signal must be transformed into familiar words
  • People perceive speech easily in spite of the variability problems due to perceptual constancy
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Categorical perception

A

This occurs when a wide range of acoustic cues results in the perception of a limited number of sound categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Voice onset time (VOT)

A

time delay between when a sound starts and when voicing begins (vocal chords begin to vibrate)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

VOT experiment Eimas & Corbit

A

VOT for /da/ is 17ms (short) and /ta/ is 91ms (long)

Computers were used to create stimuli with a range of VOTs from long to short

Listeners do not hear the incremental changes, instead they hear a sudden change from /da/ to /ta/ at the phonetic boundary

Thus, we experience perceptual constancy for the phonemes within a given range of VOT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

The McGurk Effect

A

Visual stimulus shows a speaker saying “ga-ga.”

Auditory stimulus has a speaker saying “ba-ba.”

Observer watching and listening hears “da-da”, which is the midpoint between “ga” and “ba.”

Observer with eyes closed will hear “ba.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Physiological link between vision and speech

A

Calvert et al. showed that the same brain areas are activated for lip reading and speech perception.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

FFA activation

A

Von Kreigstein et al. showed that the FFA is activated when listeners hear familiar voices

This shows a link between perceiving faces and voices.

17
Q

Experiment by Rubin et al.

meaning and phoneme

A

Short words (sin, bat, and leg) and short nonwords (jum, baf, and teg) were presented to listeners.

The task was to press a button as quickly as possible when they heard a target phoneme

On average, listeners were faster with words (580 ms) than non-words (631 ms)

18
Q

Experiment by Warren

meaning and phoneme

A

Listeners heard a sentence that had a phoneme covered by a cough

The task was to state where in the sentence the cough occurred.

Listeners could not correctly identify the position and they also did not notice that a phoneme was missing - called the phonemic restoration effect

This did not happen for non-word sentences

19
Q

Experiment by Miller and Isard

meaning and phoneme

A

Stimuli were three types of sentences:

- Normal grammatical sentences 
- Anomalous sentences (made no sense) that were grammatical
- Ungrammatical strings of words

Listeners were to shadow (repeat aloud) the sentences as they heard them through headphones

Results showed that listeners were – 89% accurate with normal sentences – 79% accurate for anomalous sentences – 56% accurate for ungrammatical word strings

20
Q

Segmentation Problem

A

there are no physical breaks in the continuous acoustic signal

Top-down processing, including knowledge a listener has about a language, affects perception of the incoming speech stimulus

Segmentation is affected by context, meaning, and our knowledge of word structure

21
Q

Speech Segmentation

A

the perception of individual words in a conversation

22
Q

Word Structure, Segmentation

A

Transitional probabilities - the chance that one sound will follow another in a language

Statistical learning - the process of learning transitional probabilities and other language characteristics

23
Q

Statistical Learning experiment by Saffran et al.

A

Learning phase - infants heard nonsense words in two-minute strings of continuous sound that contained transitional probabilities

Nonsense words were in random order within the string

If infants use transitional probabilities, they should recognize the words as units even though the string of words had no breaks

24
Q

Indexical characteristics

A

characteristics of the speaker’s voice such as age, gender, emotional state, level of seriousness, etc

25
Q

Speaker characteristics experiment by Palmeri et al.

A

Listeners were to indicate when a word was new in a sequence of words.

Results showed that they were much faster if the same speaker was used for all the words, than when a different speaker was used for each

26
Q

Broca’s aphasia

A

Individuals have damage in Broca’s area in frontal lobe

Laboured and stilted speech and short sentences but they understand others

27
Q

Wernicke’s aphasia

A

individuals have damage in Wernicke’s area in temporal lobe

Speak fluently but the content is disorganized and not meaningful

They also have difficulty understanding others and word deafness may occur in extreme cases

28
Q

Brain damage

A

Some patients with brain damage can discriminate words but are unable to discriminate syllables (and vice versa)

29
Q

Brain scans found:

A

A “voice area” in the STS that is activated more by voices than other sounds

A ventral stream for recognizing speech and a dorsal stream that links the acoustic signal to movements for producing speech - called the dual stream model of speech perception

30
Q

Experience dependent plasticity

A

Before age one, human infants can tell difference between sounds that create all languages

The brain becomes “tuned” to respond best to speech sounds that are in the environment

Other sound differentiation disappears when there is no reinforcement from the environment