Lecture 6 - Localisation and auditory scene analysis Flashcards
How do we localise sound?
- Binaural cues
- Monaural cues
What are some aspects of binaural cues?
-Require comparison of signals in left and right ears and are vital for signalling location of a sound in azimuth (left-right plane).
-Interaural time differences (ITDs).
-Interaural level differences (ILDs).
What are some aspects of monaural cues?
-Work with one ear and can help localise the elevation (up-down plane) and distance of a sound.
-Filter properties of the pinna (outer ear).
-Intensity and reverberation.
What are interaural time differences (ITDs)?
-Relative time at which a sound arrives at the two ears depends on its location in azimuth (left-right).
-If the sound source is straight ahead, the distance to each ear is the same and there is no difference in time.
-When the sound source is positioned to one side, the sound will reach the nearer ear first.
-The range of ITDs depends on speed of sound (typically 330m/s through air) and distance between the two ears (larger heads create a bigger range).
-Maximum ITD in humans is around 600μs (0.6ms).
-Requires precise signalling of timing.
-Most useful for low frequency or abrupt-onset sounds.
What are interaural level differences (ILDs)?
-Relative sound pressure level reaching the two ears also depends on the location of the source in azimuth.
-Reduction in sound level occurs for the far ear due to the acoustic shadow created by the head.
-This reduction occurs for high-frequency sounds but not low frequency sounds.
What is the physiology of binaural processing?
-Processing of ITDs and ILDs starts within the brainstem in the superior olivary complex (superior olive).
-Binaural localisation cues processed by different types of neurons, located in different parts of the superior olive.
-The lateral superior olive (LSO) contains neurons that are sensitive to ILDs
-The medial superior olive (MSO) contains neurons that are sensitive to ITDs.
What are some strengths of binaural cues?
-ITDs and ILDs provide complementary information about azimuth location.
-ITDs work particularly well for low-frequency sounds.
-ILDs provide information about high-frequency sounds.
What are some weaknesses of binaural cues?
- they provide ambiguous information about elevation and tell us nothing about distance.
-Cone of confusion – set of points from which a sound source will produce identical ITDs and ILDs.
Monaural localisation cues - elevation: What are the filter properties of the pinnae?
-When sound reflects off the nooks and crannies of your external ear, the relative intensity of different frequencies sound wave changes.
-This changes with sound source elevation (and azimuth*).
-Individuals have different ear shapes and will filter the frequency content of complex sounds in slightly different way.
-Artificially altering ear shape with plastic moulds impairs the ability to localise sound elevation.
Monaural localisation cues - distance: what is relative intensity?
Sound intensity decreases with distance, so closer object will tend to have greater amplitudes than farther ones.
Monaural localisation cues - distance: what is reverberation?
-The way in which sound reflects off object also provides a cue to distance.
-Multiple reflections combine to produce a persistence of sound called reverberation.
-The distance of a source alters the relative intensity and timing of direct and reverberant sounds.
What is the precedence effect?
-Similar sounds arriving in quick succession from different locations are localised according to the direction of the first sound.
-Provided the delay is short (<10 -20ms), only a single sound is perceived.
What is auditory scene analysis?
-Natural environments contain multiple sound sources, so the auditory system needs to make sense of the mixture of component sounds that makes up the auditory scene.
-It must segregate the components of the sound coming from different sound sources.
-It needs to group the components of the sound that come from the same sound source.
What are some strategies for auditory grouping/segregation?
-Follow many analogous principles e.g common fate, proximity, similarity, continuity.
-We also need to consider spectral grouping – combining different frequency sounds components that occur at the same time.
-Also need to consider sequential grouping – combining sequences of sounds over time.
What does spectral grouping consist of?
-Harmonicity
-Common frequency change