Speech Perception Flashcards
The challenges of speech perception
No clear gaps between words
accent, gender, speaking rate
time constraints - we hear up to 200 words per min
The speech chain
Linguistic level - Physiological level - acoustic level - physiological level - linguistic level
How do we produce speech?
lungs push air up the trachea
which vibrates the vocal chords in the larynx (voice box)
Sounds from the vocal chords are then shaped by the supralaryngeal vocal tract:
- pharynx
- oral cavity
- nasal cavity
Speech as sound waves
- a periodic displacement of air molecules, creating increases and decreases in air pressure
Source filter theory
Source: air vibrates vocal chords
Filter: shaped by supralaryngeal vocal tract
makes speech intelligible
Filtering appears as bands of energy at certain frequencies called formants
Acoustic cues to consonants
A stop consonant is characterised by a release burst of energy followed by silence.
Voice onset time: duration of silence between release burst and vibration of vocal chords determines whether consonant is perceived as b or p.
Acoustic cues to vowels
Changing from high to low vowels - F1 frequency increases. Changing from front to back vowels - F2 frequency decreases
How do we perceive phonemes?
Categorical perception: the tendency to perceive gradual sensory changes in discrete fashion
Yanny/Laurel example
The lack of invariance problem
the relationship between the acoustic signal and phonemes is messy
integrating information from different cues my increase detection of consistent patterns.
Context influences speech perception
Visual context ‘McGurk effect”.
Prior knowledge of speech content improves intelligibility of degraded speech
Brain basis of speech perception: classical model
Superior temporal gyrus for speech perception (wernicke’s area)
Inferior frontal gyrus for speech production (broca’s area)
Left hemisphere dominant
Ventral and Dorsal Streams
Ventral steam for mapping speech sounds onto lexical representations
- activated for tasks focussing on comprehension
bilateral (left and right hemispheres)
Dorsal stream for mapping speech sounds onto articulatory representations
- activated for tasks focusing on perception of speech sounds
left hemisphere dominant
Broca’s area part of the dorsal stream
fMRI- ventral and dorsal streams
evidence from fMRI
Dorsal stream task - repetition of nonsense words
Ventral stream task:
Listening to words
Cohort model
Uniqueness point (UP): time point in the speech signal when only one word becomes consistent with the speech unit
Words are activated immediately upon minimal input
Multiple words are activated
Words compete for recognition
TRACE model
information processing takes place through excitatory and inhibitory interactions.