Unit 4 Flashcards

Question

What if we use paired clicks?

Answer 1

- Two pairs of clicks - In first pair, the first click is louder/weaker than the second one - The order is reversed in the second pair (or equal) - Two pairs are identical in frequency spectrum (different in amplitude) - Listeners differentiate them by detecting temporal order - Resolution is indicated by the minimal interval upon which the order can be told correctly

Answer 2

Temporal resolution

Answer 3

- The Weber’s fraction should be constant (or Weber’s law is well followed). - No difference related to bandwidth of signals

Answer 4

- Ask the subject to tell which signal is longer or shorter (only changing with duration, same signal) - The duration threshold is presented against the baseline duration (linear scale) - From this data we can predict that WL is largely followed in duration discrimination test

Answer 5

Ability to identify a (silent) gap between two sounds or an interruption of a sound by varied formats

Answer 6

Gap can be a silent period or one in which sound intensity is largely reduced

Answer 7

- Gap threshold (_t) is defined as minimal period of gap that can be identified. - Below that, the subject tells the sound to be continuous

Answer 8

Gap detection can be measured in behavior test, or in objective test like evoked potential

Answer 9

- Before the gap you have a pre-gap marker (first signal) and after the gap there is a post-gap marker (second signal) - The signal can be a tone burst or a noise burst (noise burst is better)

Answer 10

- Can use broadband noise and narrow band signals - Contamination of Frequency cues (if narrow band signal is used) - An issue when using narrow band signal and sudden on/off

Answer 11

- Masking with notch noise: can also splatter when turning masker on and off - Bandpass filter to get rig of contamination: cannot always eliminate contamination - Ramping: making it difficult to define gap duration - causes the gap to be unclear

Answer 12

Sometimes the notch noise can have frequency splattering

Answer 13

When you use tone burst, you need to think about the frequency splattering at onset and offset (provides frequency cues, not temporal cues), so you are unable to tell the temporal resolution as it is contaminated

Answer 14

- Pre- and post-gap marker can be different in terms of amplitude, duration and frequency. - If different in frequency, then tests cross-channel - If both are same in frequency, within channel test.

Answer 15

- The sensation change is due to the onset and offset of signal - When the signal turns on, our sensation takes time - When the signal turns off, our sensation of sound gradually goes down (it takes time) - If the gap is short enough, the sensation will not start from zero, but from the declined curve (whatever has not disappeared from the pre gap off) - Then, the sensation takes time to reach plateau

Answer 16

- Delta s depends upon the gap - If delta s is smaller, the off-set marker (in the second tone), will continue to to get higher up on the first tone - Shorter delta t, delta s is reduced (and eventually become 0) and the second tone will not be sensed (sensed as only one tone)

Answer 17

- The impact of sound level is seen near hearing threshold - Not changed by sound level well above threshold

Answer 18

- Gap marked with equal sounds - Markers: broadband signals and narrow signals - broader the masker BW, the better the gap detection threshold (smaller)

Answer 19

Temporal resolution

Answer 20

Gap marker BW

Answer 21

- High-fre HL: deteriorated gap threshold when using broadband markers - Attributed to the reduced audibility - Evidence: When high-pass masking is used in normal hearing subjects, similar changes were seen - High frequency channels have better temporal resolution

Answer 22

Hearing loss (especially SNHL)

Answer 23

- High pass filter just masks high frequency region, creating a fake hearing loss (if you compare artificial hearing loss and real hearing loss, there is no difference) - When you have HF HL, you naturally reduce the bandwidth of gap marker (if you use a broadband signal, the HFs will be useless due to HL) - This is why individuals with HF HL have poor temporal resolution

Answer 24

(a): within channel (same frequency band) (b) /(c) between channels (different) (c): diff in onset (must be “between channels”)

Answer 25

- % modulation: - Average amplitude/p-p% (peak to peak); - or p(peak)-t(trough)/average% dB: 20log(%), e.g., 10%~ 20log(0.1) = -20 dB

Answer 26

Minimal depth of modulation (that subject can detect)—detection threshold with modulation frequency—Modulation transfer function (MTF)

Answer 27

1. modulated vs. unmodulated 2. modulation with different MF 3. modulation with different depth

Answer 28

The MTF addresses the ability to detect the presence of amplitude modulation in a sound

Answer 29

- The MTF is typically low-pass function: larger modulation depth threshold at larger modulation frequency (MF). - In subjects with temporal processing deficits, larger modulation depth threshold is seen at high MF.

Answer 30

- Phase locking or synchronization - Envelope coding

Answer 31

- PSTH - ISIH - PRH - These require repeated stimulation

Answer 32

Image the real neuronal envelope coding by Volley principle (this is what really happens in your brain)

Answer 33

- Inter spike interval histogram - If phase locking is perfect, each individual auditory nerve will produce 1 spike per period - When the frequency of signal is high, the auditory nerve cannot follow that (can’t see a clear interval or period) - By chance, we are likely to see the interval the same as 1 period (1 spike per period) If the AN skips one period, you will see interval of 2 or 3 period - This is how we show periodicity of response

Answer 34

- Increasing sound level causes better sound locking (the spikes are more likely to occur at a certain phase) - Distribution becomes narrower and narrower with better phase locking

Answer 35

- Auditory nerves encode temporal information of sound by phase locking - Phase-locking is established by integration of responses from many neurons.

Answer 36

In reality, we don’t need to repeat stimuli many times to detect sound (volley principle)

Answer 37

- Modulation transfer function typically shows low pass function - However, by a single neuron, this is different (bandpass transfer function)

Answer 38

- Results show a very sharp MTF for each individual neurons - The peak points are the best modulation frequency - For each neuron, there is a typical best modulation frequency (can detect temporal modulation better at a specific frequency)

Answer 39

Low pass, band pass

Answer 40

- Concentric distribution of neurons with the same BMF - The neurons on the surface of the cone show the same BMF The iso-BMF surface is in a cone shape taping to dorsal side (low frequency) - Another example of place code in auditory processing

Answer 41

Flat plane

Answer 42

Cone shape

Answer 43

- 10 dB/decode = 3 dB/octave - Masking with white noise is more effective on the high frequency side due to white noise density (it will mask more)

Answer 44

- For a higher fre. signal, we need a larger SNR to hear. - Higher masking effect. - Remember the spectrum of the noise is flat.

Answer 45

- White noise has equal density across frequency - Density = power in unit frequency range. Total power in a frequency range = density * delta f - The energy/power effective for masking exists in critical band: P = density*CB

Answer 46

- For a particular signal, only the energy in a certain band around the frequency of this signal impacts the hearing of this signal. This band is called as critical band - It can also be defined as the frequency spectrum that one neuron will respond to. - Therefore, in a broad band masker, only the energy in CB will produce masking.

Answer 47

- Noise energy beyond CB is not useful - Only energy inside CB will produce masking

Answer 48

- Only energy within CB is effective - CB increases with CF in linear scale - But keep constant in ratio scale: 20% or 1/3 octave around CF - Therefore, the masker for pure tone should be narrowband noise of 1/3 octave.

Answer 49

Narrowband masker (to reduce the total level of masking so it is more acceptable by client)

Answer 50

white noise, CF

Answer 51

- Keep the total intensity of the masker the same - Increase bandwidth of the masker from zero - Within CB, masked threshold should be? Maintained - When beyond CB, masked threshold will be? Decreased (because some energy gets lost, and threshold goes down) - The turning point tells CB.

Answer 52

Threshold won't change

Answer 53

- Beyond CB, the masked threshold will decrease, because some masker energy got into other channel so that is not effective. - Outside the CB, energy from the masker is useless (only energy inside the CB is useful)

Answer 54

- Within CB: masker level should not be changed. - Beyond CB: masker level should be increased

Answer 55

The threshold will not change, energy is in the CB

Answer 56

Beyond CB, the power thins out so there is not enough energy to evoke a response within CB. Need to boost up the total sound level to increase threshold.

Answer 57

- Signal band: band around signal - Flanking band: band far apart from the signal band - Flanking bands does not change masked threshold because they are far away from CB

Answer 58

- Comodulated: see release or decrease in masked threshold - Uncomodulated: no change in masked threshold

Answer 59

Does not change the CB

Answer 60

Masking in the signal band

Answer 61

- Masking effect depends on the time relationship for signal in masker - Larger masking when the signal is close to the onset of masker - Up to 10-15 dB - Disappeared when delay (onset of masker-onset of signal)> 200 ms

Answer 62

Reduces and plateaus (>200ms)

Answer 63

The masker can be presented after signal (backward masking) or before (forward masking), or combined

Answer 64

- Monotic = signal and masker go to the same ear - Dichotic = signal goes to one ear, masker goes to other ear (no interaction between masker and signal in cochlea) - Masking occurs in the brain - Diotic = real life

Answer 65

Forward masking uses the line busy hypothesis (the vibration produced by the masker makes the cochlear partially occupied (this occupance declines with time after offset); this is why masking effect goes down with time (this doesn’t happen in backward masking because the signal occurs earlier`

Answer 66

Forward masking = largest masking occurs closer to the offset of the masker - Masking effect reduces with time as the cochlea occupation goes down

Answer 67

Backward masking = largest masking occurs closer to the onset of the masker

Answer 68

- Forward masking is relatively clear: - Overlap in BM vibration, - Neural adaptation, - Central masking (indicated by cochlear implant) - Backward masking: not sure if there is a central role

Answer 69

- Masker to contralateral ear - Similarities between central and peripheral masking: - Frequency relationship - The masking effect and time-relationship between masker and signal - Difference: much smaller threshold shift in central masking

Answer 70

Masking effect

Answer 71

Central masking causes a smaller threshold shift (the masking effect is not as larger for central masking as it is for peripheral masking)

Answer 72

Onset or offset

Answer 73

- Interaction between masker and signal at higher level of auditory pathway - No overlap between masker and signal - Opposite to peripheral masking (in cochlea), which is also called energetic masking - Depends upon the overlap of the masker and the signal - Targeted tone (or speech) in the presence of multi-frequency masker (similarity and uncertainty impacts performance) - Test masked threshold in 2IFC - Subject chooses which one contains a signal - There is a central component because it relies on context - Frequencies of the masker randomized - CB around targeted signal is “protected”—to avoid energetic masking

Answer 74

Informational masking, energetic masking

Answer 75

- The effect due to randomization - Larger the randomization, larger the effect - Difference becomes smaller with increasing number of components in the masker

Answer 76

Less components (harder to hear the signal)

Answer 77

More components (easier to hear the signal)

Answer 78

Masker (harder to hear the tone)

Answer 79

- To understand how masking changes our hearing. - To use masking as research tools. - Notice the gaps between what we have discussed and what we need for the signal detection under masking. - Further learning is required.

Answer 80

- Spatial filtering: Detect signals by differentiating the source from noise—depending on binaural process. - Spectral filtering/frequency selectivity: selectively filter out noise—but won’t work if the spectrum of noise is largely overlapped with that of signals. - Temporal filtering: distinguish signals based upon the time difference, e.g., signals in the trough of noise. - Cognitive processing: Detect signals by using experiences (familiarity to the signals)—depending on top-down process. This is shown in part of “attentional filtering”. - At a party you recognize familiar voices

Answer 81

- The function of binaural processing—related to temporal processing, inhibition, efferent etc; to spatial filtering. - The role of inhibition to spectral filtering and other process. - The role and the mechanisms of high temporal resolution in the auditory system. - The role of low-SR ANFs and efferent control of them on noise resistance in hearing. - The interaction between ascending and descending pathways. - The role of cochlear efferent control—the masking release effect. - Cognition and selective attention - This changes with HL and age

Answer 82

Pulsed signals and frequency modulation (FM)

Answer 83

- Use (slow) ramp - Masking (such as notch noise)

Answer 84

pitch fusion, continuous pedestal

Answer 85

Baseline frequency

Answer 86

Slowly increase volume to turn on and slowly decrease volume to turn off

Answer 87

Discrimination limen is the smallest change in frequency that you can detect

Answer 88

When your change in frequency is below 500 Hz, the DL is constant (easier to detect a difference in frequency at low frequency, below 500 Hz)

Answer 89

Above 500 Hz, DL will increase with frequency (larger difference to notice a difference)

Answer 90

Low frequency

Answer 91

- Slight decrease w/ intensity at high frequnency, much larger at low frequency - Weber’s law: correct above 500 Hz - WF = 0.7% = 0.007 - Below 500 Hz, delta F doesn’t change, but above 500 Hz, it does (pure tone)

Answer 92

Baseline frequency

Answer 93

Increasing frequency (its getting worse)

Answer 94

- Pulsed tone, or band noise with different cutoff - deltaf and deltaf/f are smaller than FM by factor of 3 - WF: 0.2% (versus 0.7% for FM) - Level dependent at low SL

Answer 95

- 3x larger - This difference is mainly applied to the low-middle frequency up to 2-3K, above that FM is better than pulsed signal

Answer 96

Low frequency, high frequency

Answer 97

- When the signal level is way above the threshold, the impact of level becomes smaller - The smallest delta F is around 1 Hz - Sound level way above threshold, we can discriminate the frequency as small as 1 Hz

Answer 98

Low SL, low frequency

Answer 99

Low frequency

Answer 100

At LF, increasing the sound level improves discrimination performance, but there is no more improvement well above threshold. This improvement does not have as large of an effect for HF.

Answer 101

- In intensity discrimination, the interval between the two pulses impacts the performance: larger the interval, larger the discrimination threshold—decay of short memory - In frequency discrimination, the increase of the interval improves the performance in a certain range. - Likely due to the reduction of pitch fusion with increasing interval. - For longer interval, performance will go down.

Answer 102

One pitch (this makes frequency discrimination difficult)

Answer 103

- Poorer performance using FM in frequency < 2000 Hz - Better performance using FM in frequency > 2000 Hz.

Answer 104

3 (in the middle frequency region)

Answer 105

DLF: difference limens for frequency, two tone pulses presented in sequence (two pairs), subjects indicate in which of the two successive pulses, the second pulse was higher in frequency.

Answer 106

DLC: difference limens change: subjects indicates which pair differed in frequency in two successive pairs of two tone pulses (2IFC).

Answer 107

Pulsed signals

Answer 108

Increasing duration

Answer 109

Temporal summation

Answer 110

BW is in CB

Answer 111

Not change

Answer 112

- Signal will leak to other bands and the signal will be wasted (it is below threshold and useless); therefore, the energy inside the CB becomes smaller and the threshold is lower (sound will not be heard) - Bandwidth beyond the CB, we need to increase the sound level, therefore increasing threshold - This is threshold testing

Answer 113

Only the energy of a masker in CB around probe tone makes contribution to masking

Answer 114

Effective masking

Answer 115

Within CB, sense of loudness is the same

Answer 116

Beyond CB, sense of loudness becomes louder (stimulating more of the cochlea)

Answer 117

- When the signal goes beyond CB, more auditory channels are activated and sound is louder (AR will be stronger) and threshold goes down. - When we test absolute threshold, the energy that goes to other bands is wasted and the threshold goes up (because AR is way above threshold)

Answer 118

- Hearing threshold goes up with BW - The AR threshold goes down with BW

Answer 119

20%CF, 1/3

Answer 120

Below 500 Hz

Answer 121

0-100Hz, 100-200 Hz

Answer 122

If you have 1 IHC in each frequency band, you are okay (as long as the OHCs are working) therefore, the number of IHCs are not huge (so having 6 IHCs in each step is redundant)

Answer 123

1. Place code starting from ANFs, auditory channels with frequency selectivity, inherited in CAS 2. OHCs (active amplification) increases the frequency selectivity of ANFs. 3. Temporal coding enhances frequency coding in cochlea. 4. Efferent control to cochlea enhance frequency selectivity. 5. Central inhibition enhances frequency selectivity by “masking” the response at edges, enhancing contrast. - Without central inhibition, there will be very broad tuning

Answer 124

Identical pitch (pitch fusion)

Answer 125

- Resolved = two signals that can be differentiated based upon their activation of auditory channels (they can produce distinguishable vibration upon the cochlea) - If the two signals are in two different CBs (widely different from each other) this is resolved; two different representations in cochlea - Unresolved = no corresponding vibration in cochlea, or the vibration produced by two signal cannot be differentiated (to close to each other and fall into the same CB)

Answer 126

- Harmonic: frequencies can be divided by the same integer number - Non-harmonic: components do not have that relationship, they are different in terms of generating pitch

Answer 127

- Entities: overall impression about the whole sound (music at a concert); we appreciate the overall pitch produced by the instruments - Partials: pitch represented by individual instruments (refers to the different frequency components of that instrument

Answer 128

- Integration: normally we get this (easy for everyone) - Segregation: need good training for this (a conductor tells when one person makes a mistake)

Answer 129

- Analytic pitch: can hear the different parts (segregation) - Synthetic pitch: focus on the whole sound (integration)

Answer 130

- Pitch: relating to the frequency component - Timbre: a concept that is not clearly defined, it is the quality of sound (takes into account temporal changes) - Takes in condsideration the frequency components and the temporal changes of sound

Answer 131

- Mel (stevens): 1000 mels by 1000Hz tone at 40 dB SL - Double or half: 2000 mels or 500 mels - No linear relationship with frequency

Answer 132

- Starting at 1000Hz, to increase the mel to 2000Hz, you will need a frequency change of roughly 3 times to feel the doubling of mels - Starting at slightly above 1000Hz (1200 Hz), you will need to increase the frequency more than 6 times to feel the doubling of mels - This shows the relationship is not linear

Answer 133

- Mid-frequency, no effect - High-frequency, pitch increase with intensity - Low-frequency, pitch decrease with intensity

Answer 134

- When we hear sound way above threshold, we have clear pitch perception - When sound is close to threshold, pitch is not clear

Answer 135

Intensity level (this is frequency dependent)

Answer 136

High frequency (7000Hz) – in order to maintain the same pitch, we have to decrease the frequency (or else pitch will become louder)

Answer 137

Low frequency, we have to increase frequency to maintain pitch (or else we will feel pitch decrease)

Answer 138

- Very short duration (<3 ms): tone sounds like click - > 3-4 ms or 6 cycles is required to have pitch sensation - >10 ms for f>1000, clear tonal sensation; improved over duration up to 250ms - > 250 ms, stable pitch sensation

Answer 139

3ms, 6 cycles

Answer 140

- Longer the rise/fall time, less frequency splattering - But poorer in transient - So, better tonal sensation with slow ramping, longer duration

Answer 141

Periodicity

Answer 142

Place code

Answer 143

- Missing fundamental: interaction across harmonics causing temporal fluctuation (periodically) - Residual pitch is not processed in the cochlea (not by place code) - Low frequency region of cochlea is not required in producing pitch - The missing fundamental relies upon the CF

Answer 144

Modulated (by low frequency signal)

Answer 145

CFs as carrier frequency

Answer 146

- Missing fundamental is well explained by periodicity theory. 60-Hz shift breaks the rule of common denominator. - Integrated pitch: does not require vibration in 200 Hz region - Shifting the frequency of each component by 60 Hz, we maintain the interval as 200 Hz, cannot be divided by common denominator, but pitch shifts to the higher frequency slightly (due to shifting of periodicity) - Because of the up shifting of 60Hz, I1 becomes shorter and pitch is increased (periodicity contributes to pitch perception)

Answer 147

Shortening

Answer 148

Beats (modulation frequency is the difference between the two tones); this is a way to produce amplitude modulation

Answer 149

Two tones in phase and out of phase periodically

Answer 150

- When diff is small, beat is produced (smooth modulation: 400+410 Hz) - When diff is widened (but CB): separated pitches (400 +600 Hz).

Answer 151

- Periodicity must be seen in cochlear, not only in the stimulation (in order to cause phase locking in ANF) - Separation of harmonics must not be larger than the width of critical band (or should be unresolved) - So that the two harmonics can interactive with each other, causing periodicity in the CB (unresolved). - If the separation is larger than CB, you will hear separated pitchs.

Answer 152

- Patterson: phase change leads temporal pattern change, but not pitch - Hall and Peters: Missing fundamental can be heard by sequential presentation of three harmonics (each 40 ms, interval 10 ms) in noise, but pitches of each harmonics in quiet - Goldstein: pitch can be established by presenting different harmonics dichotically. - Non-harmonic sounds can still produce pitch (such as dual-tone multi-frequency signal for telephone pads)

Answer 153

- Onset Time: two sets of partials have different onset, will be segregated as different partials - Harmonic Partials—Principle of dominant component 3rd, 4th and 5th harmonic component are dominant (if > 10 dB SL) Not fixed to harmonic number but to frequency range in which the sound is well resolved in cochlea

Answer 154

3, 4, and 5

Answer 155

Middle frequencies

Answer 156

- Modulation of one harmonic component break down entity - Fused pitch when tones to each ear commonly modulated - Coherent modulation of all harmonic partials, harmonic remain - Number of Harmonics: adding a new component increases the sense of entity

Answer 157

Hear the other pitches

Answer 158

Integrated pitch

Answer 159

More clear

Answer 160

Better sensation of pitch (if not smooth pitch perception is poor)

Answer 161

- Tone Duration - Sound Pressure Level - Relative Phases: least important - Spatial Origin (binaural hearing) - Context Effects--Stream Segregation (see in lecture of binaural hearing)

Answer 162

- Musical training - People with good training in music have a stronger ability to identify partials - Selective attention

Answer 163

- Spectral theory (two stages) - 1 frequency analysis - 2 pattern recognition (spectrum) - Temporal theory - Neither of those theories can account for all pitch perception

Answer 164

-Frequency analysis in the cochlea based upon place code -Pattern recognition based upon spectrum (all ranges of hearing)

Answer 165

Phase locking, temporal relationship

Answer 166

Pattern perception model

Answer 167

- Timbre is not clear, it is roughly emphasized by 2 factors: spectrum and dynamic characteristics - Music is the best example of timbre (different instruments)

Answer 168

- Spectral factor—steady-state feature or tone color - Dynamic characteristics (separating percussive from blown instrument)---the role of signal envelope (temporal pattern).

Answer 169

- Increase loudness - Improvement in differential limen - Better perception in noise: spatial filtering - Binaural fusion and beats

Answer 170

Binaural hearing is better than unilateral in discrimination, esp at low sensation levels (SL)

Answer 171

The binaural benefit can’t be attribute to binaural summation on loudness because it would require more than 30 dB diff in loudness to produce such difference in discrimination.

Answer 172

Binaural is better

Answer 173

- Seen in normal hearing subjects - Reduced in subject with aging and SNHL - Big benefit w/ binaural hearing aids and cochlear implants

Answer 174

- Separates target sound from noise (spatial filtering) - Improves discrimination - Improves stream tracking of target sound - Unmasking (via efferent control and others) - Reduced in aging and SNHL

Answer 175

- Binaural cues for acoustic image in space: Binaural differences in intensity, spectrum, and timing - Fused image: we do not feel that two ears work separately, but... - Dichotical signals can be different or similar, but should be connected in certain ways

Answer 176

- Binaural fusion from two ears receiving similar signals: commonalities - Example of commonality: - Co-modulation of harmonics presented dichotically (different components go to each ear). - Different speech components to each ear: complimentary for speech. - Residual pitch harmonics are presented dichotically.

Answer 177

- BB occurs in CAS, while MB in cochlea. - BB occurs in lower frequency range than MB. - BB can occur at larger level difference between the two tones; one tone can below audible level. - BB can occur at larger frequency difference between the two tones. - For MB, the two tones must be closer in terms of level

Answer 178

- Grouping units together - More than simple addition - Whole is larger than the simple sum of all parts

Answer 179

Example: tracking a target talker in a cocktail party—multiple cues may be used. - Spectrum profile of the talker’s speech - Temporal stream of the speech - Spatial separation/identification - Many more (such as familiarity, dynamic cues etc) - Bottom-up and top-down process involved.

Answer 180

- Two tones are played in sequency (high low high low high low) - Depending upon speed, when it is slow (one stream), when it is high (sense stream separately)

Answer 181

- When the F segregation is larger, you hear two streams at higher speed. However, you always hear one stream when the F segregation is small. - This shows the impact of frequency impact on the streams - Separation is high - Separation is small (higher speed merges it into one)

Answer 182

- Proximity (similarity): e.g.: similar signals for easy dichotic fusion - Common fate (e.g., on and off together - if the onset and offset are the same, we are more likely to attribute them into one stream - Good continuation - Primitive process - bottom up (based upon physical features of the sound)

Answer 183

Schema-based

Answer 184

- A: simple masking: on band of masker upon the signal - B: co-modulation masking release (CMR): reduced masking effect when the noise in the signal band and side bands are co-modulated. - C: CMR disappears due to the mismatched onset of noise between the signal band and the side bands.

Answer 185

FM (we won't feel that it's a vowel until it is frequency modulated)

Answer 186

- Continuation when blocking by a fence, not a blank gap. Top-down process is involved, especially in the shape perception in the most right graph. - If we replace the fence of block by a blank space, the continuity is no longer good - When the silent gap is filled with noise, you feel the tone continue without interruption

Answer 187

- Bottom-up and top-down in combination - We should be able to understand speech better when it is interrupted by noise rather than silent gap

Answer 188

- Horizontal plane/azimuth (the plane that we live) - Azimuth + vertical plane forms the location of any spot on 3D space

Answer 189

- Localization error: different between Apparent location and Physical location - Spatial discrimination: measured as minimal audible angle

Answer 190

- Open field: stereophony (sound comes to both ear from a speaker) leads to extracranial localization (we feel the sound source outside our head) - Close field: using headphones leads to intracranial lateralization (we feel this due to the loss of external resonance) - Reasons: loss of external ear resonances with earphone hearing

Answer 191

1. What are the cues for sound localization? 2. How are they used? Approaches (to answer the questions): behavior studies and neurological mechanisms

Answer 192

ITD/IPD - Determined by size of head, larger the head - Humans: 22-23 cm 660 micro seconds (90 azimuth) - Time difference sensitivity: 10 microsec difference across frequency - Time converts to angle and phase IID/ILD - From shadow effect of head - Larger for high frequency sound

Answer 193

- ITD = interaural time difference - IPD = interaural phase difference - Because of time difference, sound reaches both ears at different times - Has to do with size of head and location - Time difference can be converted into phase difference

Answer 194

- IID = interaural intensity difference - ILD = interaural level difference - Our head is an obstacle that blocks the flow of sound

Answer 195

- Middle line: 0 degree (no time/phase difference no matter the signal) - The difference becomes largest at 90 degrees (lateralized to your head) - Further increase past 90 degrees is less of a difference and goes back to 0 at 180 degrees

Answer 196

- Shadow effect is remarkable (largest) at high frequency, indicated by shorter wavelength - Near ear: no shadow effect - Far ear: shadow effect (sound attenuated by head)

Answer 197

Stronger signal

Answer 198

maximal time difference

Answer 199

Time difference

Answer 200

- To make the ½ period >MTD, Frequency must be smaller than a value. - In order to generate useful phase difference, signal frequency must be low - The time difference between both ears does not impact frequency, IPD does

Answer 201

- To make period/2 >0.7 ms, 1000/1.4 = 714 Hz - Therefore, max Fre for IPD < 180o is ~700 Hz

Answer 202

higher the frequency

Answer 203

- Temporal coding is better for low f - No ILD available at low f

Answer 204

- Identity circle - Must < 180 degree, change with frequency - At 90 azimuth, ITD 650 us = _ cycle of 770 Hz - At 45 azimuth, ITD 350 us = _ cycle of 1400 Hz - Close to 0 azimuth, ITD --> 0, frequency limit increase, but still low f signals make strong IPD cues

Answer 205

same frequency

Answer 206

- Always the best at 0 azimuth - No matter what types of cues are used - Largest ITD/IPD/ILD at 90 degree - But larger ITD/IPD/ILD dose not mean strong stimulation - Neurons for localization are so organized that they work best at 0 azimuth

Answer 207

- High frequency, rely on ILD - Low frequency, rely on ITD, predictable from sphere model - This is dependent on a study using pure tone (real life is complex tone)

Answer 208

- Get poorer performance in the middle frequency (which is unusual, because this is where performance is typically best) - But according to duplex theory, there is poor performance in the middle frequency

Answer 209

- We do not rely upon pure tone for localization - High frequency sound can have time cues, e.g., when modulated by low frequency - Break down front-back confusion and identical circle by pinnae cues

Answer 210

- Why use earphones: to change phase, level independently in each ear (e.g., one ear receives stronger sound but later phase than the other) - Halverson: 500 Hz tone, 0-180o change in phase converted to 0-90o azimuth - Phase change leads position image change when frequency < 1400 Hz

Answer 211

- Relative effectiveness of ITD and ILD can be evaluated - Localization versus lateralization - Sound trapped in head - Due to the loss of pinna effect - ITD: only works at onset and offset - IPD: works for continuous signals - ITD more important than IPD: Earphone test provide answers.

Answer 212

- In virtual hearing (via earphone): early onset in the near ear leads to sound coming from the nearer ear (the effect of onset discrepancy), whereas early offset in the near ear leads to sounds coming from the farther ear (the effect of offset discrepancy) - Overall, the onset ITD is dominant: in real hearing, we hearing sound based upon the onset discrepancy. - Also there are studies comparing the effect of onset ITD and ILD: in virtual hearing, near ear can have early onset but weak sound.

Answer 213

- From simple sound to complex signals - Use of headphones: lost spectrum cues - Digital technology can put back the spectrum cues

Answer 214

- Much more accurate than sound localization - Largest around 1-3 kHz (middle frequency) - Smallest at 0o azimuth Yost: - IPD shift required for image shift remains constant when f<900 Hz - IPD shift required for image shift increases with original IPD - Upper freq limit: 1200 Hz - Concurrent MAA (CMAA): two signals at same time

Answer 215

- Remains constant for up to 900 Hz - Proportional to original (or baseline) phase difference - At 500 Hz, a just detectable phase angle is 2 degrees or 11 microsec

Answer 216

- Just noticeable phase diff at 500Hz: 2 degrees or 11 microsec - At 1200 Hz: 12 degrees or 27 microsec

Answer 217

just noticeable phase difference at 100 Hz: 3 degrees or 5.83 microsec

Answer 218

localization error

Answer 219

At any point on the cone surface, binaural cues are the same.

Answer 220

- Sources - From ear canal resonance - From pinna effect - Head-related transfer function (HRTF) - Spectrum (HRTF) change with direction - Timbre changes - Roles: localization in vertical plane & avoiding error in azimuth

Answer 221

- HRTF: the difference of sound spectrum between what is measure in open space and that in real ear canal near eardrum. - HRTF is directionally related with sound source

Answer 222

- Dynamic versus stationary (referred to location) - Stationary sti can be dynamic when head is moving - Break front-back confusion by moving head - Head move helps monaural localization - Small effects reported

Answer 223

sound, spectrum cues

Answer 224

- Sound level - Ratio of direct-to-reverberant energy - Spectral shape - Binaural cues (ILD) - Familiarity or experience

Answer 225

- Sound that is heard first takes dominant role in localization - Classical click experiment - Fusion occurs when click interval is less than 5 ms. - Summing localization- fused when click interclick interval <1ms - Localization dominance, when click interval 2-5 ms, pair interval between 10-100 ms. - Discrimination suppression

Answer 226

Onset, offset

Answer 227

When T1, T2 and T3 are small enough (T1, T2 < 5 ms, 10ms

Answer 228

- The perceived location of the fused image is affected by the size of the delay between the two signals. - Summing localization occurs for delays shorter than 1 ms, in which case the perceived location of the fused image is affected by both the leading and lagging clicks - Localization dominance occurs when the location of the fused image is determined by the leading signal. This occurs when the delay between the first and second clicks is between about 1 to 5 ms.

Answer 229

a) monotically (signal and masker to same ear) = sound not audible (0 dB) b) diotically (signal and masker in both ears) = sound not audible (0 dB) c) similar to a), but noise is added (signal monotic, noise diotic) = previously masked signal is audible (9 dB) d) similar to b), but reverse the phase of noise = signal audible (13 dB) e) similar to b), but reverse the phase of signal = signal audible (15 dB)

Answer 230

Spectrum level

Answer 231

- Low F neurons in MSO and IC: EE type dominant - High F neurons in LSO: IE type dominant; in IC: EI type dominant

Answer 232

Time response to both ears (they are mimicking the sound wave)

Answer 233

Time difference

Answer 234

Characteristic delay

Answer 235

- Interaual delay is countered by neural delay to make coincident - Contra ear as near ear, it takes shorter time to get to that ear (longer for ipsi ear) - Longer delay in contra stimulation internally is compensated by shorter time delay externally - Coincident = certain neurons are excited at the same time because of an internal delay (mirrors external delay); put them together the stimuli to both ears arrive at the same time

Answer 236

1. modulated vs. unmodulated 2. modulation with different MF 3. modulation with different depth

Unit 4 Flashcards

(320 cards)