Listening Flashcards
where do the initial stages of acoustic analysis happen?
superior temporal cortex
where does speech processing split into two streams
after inital cortical stages of acoustic analysis speech processing splits into ventral (speech recognition) and dorsal (speech perception) pathways
what is speech recognition
temporal regions of the brain map sound onto meaning (ventral processing stream)
lexical
what is speech perception
parietal/frontal regions map sound onto action (dorsal processing stream) phonological
what does the dual processing stream model predict
that damage to the ventral stream will impair comprehension while damage to the dorsal stream will impair processing repetition of speech sounds
dissociation between performance on auditory comprehension tasks and sub-lexical perception tasks
e.g unable to distinguish cat and cot but can appropriately match these to pictures
parallel pathways with some crosstalk
what is the segmentation problem
how are we able to perceive speech when there are few gaps in words visible in a waveform of a speech signal
e.g perceived gaps are difficult to identify within a waveform or spectrogram as there are no clear boundaries in a continuous stream yet we are able to break it down into words, syllables and phonetic features
thus there is no direct correspondence between auditory energy and perception (no correlation of amplitude and gaps)
how can we identify vowels in a spectrogram
the relative spacing of steady state portions of a spectrogram
how do we perceive speech
the continuous speech signal is perceived as segments
25-50 phonetic segments per second (4-5 syllables)
speech sounds should be identifiable by different combinations of formant frequencies and the shapes of these transitions
what information is there in a spectrogram
the frequencies of sounds produced over time
formants: bands of acoustic energy
steady state: vowel identity
transitions: consonant identity
what is meant by ‘steady state’
the steady state ratio of the first and second formant distinguishes vowels
what is meant by formant transitions
correspond to production of constant
second formant anticipates the frequency of the vowel that will be produced next
what is coarticulation
the features that characterise speech sounds occur non-linearly
we produce them in parallel rather than in sequence
anticipating what next production will be is reflected in production of a sound
overlap of movements in phoneme production
makes speech production fast and efficient, but means spectrogram will be different depending on the context of the following vowel etc
what are the problems for speech perception
non-linearity
lack of invariance
speaker variability
what is meant by non-linearity
acoustic features spread themselves across each other and are not procured as a linear sequence
what is meant by the lack of invariance
speech sounds have more than one waveform representation
the differences between phonemes are very small
there are big differences between different instances of the same phoneme
coarticulation exists whereby the articulation of one phoneme influences that of another