Listening Flashcards

1
Q

where do the initial stages of acoustic analysis happen?

A

superior temporal cortex

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

where does speech processing split into two streams

A

after inital cortical stages of acoustic analysis speech processing splits into ventral (speech recognition) and dorsal (speech perception) pathways

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is speech recognition

A

temporal regions of the brain map sound onto meaning (ventral processing stream)
lexical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is speech perception

A

parietal/frontal regions map sound onto action (dorsal processing stream) phonological

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what does the dual processing stream model predict

A

that damage to the ventral stream will impair comprehension while damage to the dorsal stream will impair processing repetition of speech sounds

dissociation between performance on auditory comprehension tasks and sub-lexical perception tasks
e.g unable to distinguish cat and cot but can appropriately match these to pictures

parallel pathways with some crosstalk

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is the segmentation problem

A

how are we able to perceive speech when there are few gaps in words visible in a waveform of a speech signal

e.g perceived gaps are difficult to identify within a waveform or spectrogram as there are no clear boundaries in a continuous stream yet we are able to break it down into words, syllables and phonetic features

thus there is no direct correspondence between auditory energy and perception (no correlation of amplitude and gaps)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

how can we identify vowels in a spectrogram

A

the relative spacing of steady state portions of a spectrogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

how do we perceive speech

A

the continuous speech signal is perceived as segments
25-50 phonetic segments per second (4-5 syllables)
speech sounds should be identifiable by different combinations of formant frequencies and the shapes of these transitions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what information is there in a spectrogram

A

the frequencies of sounds produced over time
formants: bands of acoustic energy
steady state: vowel identity
transitions: consonant identity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is meant by ‘steady state’

A

the steady state ratio of the first and second formant distinguishes vowels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is meant by formant transitions

A

correspond to production of constant

second formant anticipates the frequency of the vowel that will be produced next

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is coarticulation

A

the features that characterise speech sounds occur non-linearly
we produce them in parallel rather than in sequence

anticipating what next production will be is reflected in production of a sound

overlap of movements in phoneme production

makes speech production fast and efficient, but means spectrogram will be different depending on the context of the following vowel etc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what are the problems for speech perception

A

non-linearity
lack of invariance
speaker variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is meant by non-linearity

A

acoustic features spread themselves across each other and are not procured as a linear sequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is meant by the lack of invariance

A

speech sounds have more than one waveform representation
the differences between phonemes are very small
there are big differences between different instances of the same phoneme
coarticulation exists whereby the articulation of one phoneme influences that of another

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

how are the problems of speech perception solved

A

by perceiving a range of sounds categorically

17
Q

what is categorical perception

A

despite the lack of invariance in the acoustic signal we still perceive specific categories that correspond to phonemes (the smallest units of sound which change the meanings of words)

speech perception is categorical as the variation within phonemes is contained in a single category

18
Q

given categories can’t vary infinitely, what tasks have psychologists designed to better understand the nature of categorical perception?

A

Voice onset time e.g ga released same time as vocal chords start vibrating whereas ka has a much later voice onset time despite benign produced in the same place
Wig vs wick
Artificial speech syllables

19
Q

how is voice onset time a continuum

A

changing the time of consonant release alters the percept

this can happen for both the sound at the start or end e.g ba - ka and wig-wick

20
Q

what does a place of articulation continuum show

A

difference between ba and da is place of articulation
this is represented in the slope of the formant (usually second)
changing this incrementally produces a continuum

21
Q

what happens in categorical perception of speech

A

hallmarks: identification and discrimination tasks

identification tasks e.g on ba-da continuum produce steep sigmoidal slope characteristic of categorical perception

discrimination task: boundary established with identification task and take stimulus pairs (either same or different side of boundary: cross or within category pairs) - same acoustic unit range each time, asked if they are the same or different - looking for a peak in the discrimination function to demonstrate categorical perception (note this isn’t seen for non-speech sounds, we’re better at discriminating notes on a piano etc)

speech is special