Wk4b - CI Speech Processing Strategies Flashcards

1
Q

What is a speech processing strategy?

A

An algorithm within the CI speech processor that converts sounds picked up by the microphone, into electrical signals sent to the implant electrodes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Speech processing strategies turn a broadband acoustic signal into 12-22 _____ ______ pulse trains

A

Amplitude modulated - one pulse train for every CI channel/electrode
- Med-EL has the fewest (12 electrodes) and Cochlear Americas has the most (22)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How many accredited CI manufacturers are currently in Canada?

A

4: Advanced Bionics, Cochlear Americas, Med-EL, Oticon Medical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

In an electrode array, we need both positive and negative sources to stimulate. How many of each?

A

Each electrode driver contains at least one positive and one negative source; there can be many positives, but only one negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does the number of electrode drivers tell us?

A

How many electrodes can fire at once; may vary from 1 (cochlear americas) to 16 (advanced bionics)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the maximum rate?

A

The max pulses per second that these CIs can fire
- Oticon Medical has lowest (19000)
Advanced Bionics has highest (83000)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does the DSP-unit (of the CI’s external unit) do?

A
  • receives mic input
  • extracts features of the sound
  • converts features into bitstream
  • contains maps (pt specific info)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does the External Unit (Speech Processor) of the CI consist of?

A

The DSP unit and Power Amplifier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

In CI’s, first the signal is processed through the DSP unit, then the ____ ____, then sent to a __-______, which sends the signal through the skin to a receiver in the internal unit

A
Power amplifier
Radio Frequency (RFF)-Transmitter
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the internal unit of a CI consist of?

A

RF-receiver
Hermetically sealed stimulator
Telemetry System

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does the Hermetically Sealed Stimulator of the CI’s Internal Unit do?

A
  • receives power from RF-signal
  • decodes RF-bitstream
  • conversion to electric currents
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the function of the telemetry system of the CI’s Internal unit?

A

To measure impedances and ECAP’s (action potentials)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does eCAP stand for?

A

Electric Compound Action Potential

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are Back Telemetry and ECAPs used for?

A
  • to check the status of the internal unit (e.g. voltages)
  • to measure and monitor critical info about the electrode-tissue interface (e.g. electrode impedance (non-audible stimulus), field potential, neural responses)
  • to conduct neural response telemetry (NRT) (AN response to electric stimulation) (like ABR, but response usually buried in electric artifact)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

T/F: It is possible for an audiologist to overwrite a CI’s safety checks

A

False - the CI performs these automatically and they cannot be overwritten

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are some commonly implemented safety checks in CIs?

A
  • stimulation parameter check
  • max charge check
  • charge balance check
  • parity check to detect bit error from RF-transmission or decoding
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What does CIS stand for?

A

Continuous Interleaved Sampling

- most modern strategies are based on this method

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What speech features are important for the speech processor?

A

Voicing
Pitch
Spectrum/Formants

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Pitch is represented by the formant ____. What are the problems with encoding it?

A

F0
We could use pulse rate, but it is unlikely we would have an electrode so far apically, and pitch alone is not enough to understand what is being said

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Why can’t we encode F2 and F3?

A

Their frequencies are >700 Hz, which is above the rate-pitch limit (250-500 pps)
- for F3 we would need to use place coding (electrode location)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Why must the charges be balanced in a CI?

A

To avoid tissue damage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What do we need to consider when proposing signal processing strategies? Name 3

A
  • charge equalization to avoid tissue damage
  • spread of electric field in fluid (poor freq selectivity)
  • electric field interactions when stimulating 2 electrodes simultaneously
  • monopolar vs bipolar stimulation
  • preservation of AN function across CI users and across the array w/in one user
  • location of electrode relative to modulus
  • narrow dynamic range (10-20 dB vs 100 dB in NH listeners)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How does the spread of the electric field impact frequency?

A

Poor frequency selectivity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What were the names of the 2 first speech processing strategies?

A

F0/F2

F0/F1/F2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What was the goal of the first speech processing strategies?

A

To avoid overloading the auditory system by extracting speech info explicitly (using only F0, F1, and F2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

How were the formant encoded in the first speech processing strategies (F0/F2 and F0/F1/F2)?

A

F0 encoded with pulse rate (b/c lower freq)

Formant freq encoded by electrode placement

27
Q

With the old strategies (F0/F2, F0/F1/F2) how many electrodes were active at once?

A

One or two

one for pitch and one for the formant

28
Q

Why are F0/F2 and F0/F1/F2 no longer used?

A

Poor performance

29
Q

How do modern speech processing strategies differ from the original strategies?

A

They do NOT attempt to extract speech features explicitly

  • they encode speech features implicitly by accurately representing the spectro-temporal structure of the the speech signal
  • i.e. extract temporal envelope in several independent freq bands
30
Q

How does CIS work?

A
  • divides incoming sound into several freq bands
  • extracts temporal envelope in each band
  • uses compressed version of envelope to modulate a fixed-rate train of pulses on each electrode
  • pulses for each electrode are interleaved in time
31
Q

How does CIS divide incoming signals into diff freq bands (step 1)?

A

By using a set of bandpass filters

- typically one freq band per electrode

32
Q

What are the pros and cons of matching the frequency map to the electrode placement in the cochlea?

A

Pros: preserves natural BM place-to-frequency relationship
Cons: loss of low frequency info, since electrode cannot reach the apex

33
Q

What are the pros and cons of mapping frequencies so that the entire speech frequency range is represented on the electrode?

A

Pros: All frequencies represented
Cons: Unnatural frequency-to-place mapping (compressed or shifted)

34
Q

Which of the two theories on frequency mapping onto electrodes is better in practice? Why?

A

Mapping the frequencies so that the entire speech frequency range is represented, but in a condenses area of the cochlea
- better b/c CI listeners can adapt to some distortion, and frequency shifts still yield intelligible speech
HOWEVER, not all distortions can be understood

35
Q

What did the frequency reversal simulation show us?

A

That too much distortion makes a speech signal unintelligible

36
Q

Which scale do we use with CI’s: linear or logarithmic? Why?

A

Logarithmic; it is closer to the tonotopic organization of the cochlea

37
Q

What is the typical range covered by the CIS bandpass filters? What determines the filter widths?

A

120-8000 Hz

The number of electrodes determines the filter width

38
Q

T/F: The acoustic frequency range of an electrode typically matches its Greenwood frequency

A

False

39
Q

What is the second CIS step?

A

Envelope extraction

40
Q

What two things can the band-pass filter output be broken down into? Which one does CIS use?

A
  • Carrier (temporal fine structure)
  • Amplitude modulation (envelope)
  • CIS uses the envelope
41
Q

How is the envelope used in CIS?

A
  • each electrode is sent a pulse stream at a constant high rate (greater than or equal to 1000 pps)
  • the amplitude of the pulses is modulated by the envelope of the output of the corresponding filter bank
42
Q

What are the two methods of extracting the envelope?

A
  • Rectification and low-pass filtering

- Hilbert transform

43
Q

Describe rectification and low-pass filtering (Method 1 of envelope extraction)

A
  • The lower half of the filter output signal is erased (half-wave rectified - only positive sine waves kept b/c only positive peaks will trigger an Action Potential)
  • Low-pass filter the half-wave rectified signal (maintains the shape of the envelope and discards the fine structure)
44
Q

Describe the Hilbert transform (Method 2 of envelope extraction)

A
  • Generate a 90 degrees phase shifted signal “Y”(based on the original signal “X”)
  • develop an envelope based on the equation: envelope = sqrt (X^2 + Y^2)
    (- similar to modern FB cancellation, but not inverted)
    **envelope is more precise with Hilbert compared to Method 1)
45
Q

Which envelope extraction method is better for single tone waves?

A

Hilbert - the envelope is very precise

46
Q

Does Hilbert work well with narrow band-width signals?

A

Yes

47
Q

What type of signal does Hilbert not work well with?

A

Wideband signals - the envelope is half of the wave, including lots of fine structuring info
- Hilbert works better with as many electrodes as possible (shallower insertion -> smaller filter bank -> poorer results)

48
Q

How does the cutoff of low-pass filtering impact envelope extractions?

A
  • If we use a 50 Hz cutoff, envelope has limited amount of fine structure remaining (closer to Hilbert)
  • 200 Hz cutoff, we get a good amount of temporal fine structure, with extracted envelope, which can be useful
49
Q

Which is better: half-rectification and low-pass filtering OR Hilbert transform?

A

No clear winner

50
Q

What is the third step of CIS?

A

Compression and conversion to current level

51
Q

Why does the speech processing signal need to be compressed?

A
  • the envelope levels follow the signal SPL, and may vary over >80 dB range
  • the range of current levels b/w threshold (T) and max comfort level (C) is only 6-30 dB
    THEREFORE cannot convert amplitude signals directly
52
Q

Can we encode the entire dynamic range into the available “room” between T and C?

A

No - too much, even with compression

53
Q

Since we cannot encode the entire dynamic range of human hearing into electric hearing, which parts are discarded?

A

Anything below 20 dB
Anything above 90 dB (everything above this is set to the max amount)
(so we now have a dynamic range that is roughly twice that available to us)

54
Q

What does ASW stand for?

A

Adaptive Sound Window - an input dynamic range that changes with the environment
- adaptively reduces 75 dB range to 55 dB (25-80 in quiet and 45-100 in loud)

55
Q

What are the steps involved in the compression and conversion to current level (Step 3 of CIS)?

A
  1. Omit <20 dB and set everything >100 dB to max
  2. Map the input dynamic range (IDR) to the adaptive sound window (ASW)
  3. Map ASW to electric dynamic range (compress those 55 dB to whatever is available, or T -> C range)
56
Q

What is an alternative to ASW?

A
  • Discard everything below 20 dB
  • Infinitely compress everything above 90 dB
  • Set the M (most comfortable) and T
  • Mapping the IDR to the electrical DR by compressing the remaining input levels to fit
  • then, a sensitivity control will adjust the IDR, leading to more or less compression (e.g. b/w 40 and 80 dB, thereby reducing sensitivity) (this setting can be adjusted by the AUD or sometimes the user)
57
Q

How does “volume” in CIs differ from HAs?

A

Volume in CIs doesn’t make soft sounds louder, it increases the M level
e.g. decrease M from 1 to 0.8; output stays b/w M and T

58
Q

Summarize Step 3 of CIS

A
  • sound wave level obtained at mic
  • set IDR
  • implement ASW (alternative: manual sensitivity or skip)
  • map this reduced envelope to electric DR (knowing T, C and M)
  • simple compensation for summation effect is volume control through M-level reduction
  • output is now 12-22 channels in current units
59
Q

What are 2 reasons against sending the 12-22 channel output (end of CIS Step 3) to the electrodes?

A
  • safety?

- channel interaction?

60
Q

What is step 4 of CIS?

A

Generation of modulated pulse trains

- need to multiply a biphasic pulse train (e.g. 1000 pps) with amplitude corresponding to the envelope from Step 3

61
Q

How do we avoid electrode interaction?

A

Interleaved sampling and presentation (e.g. entire pulse burst lasts 1 ms or 1/pps)
- e.g. electrode 1 stimulated - pause - electrode 2 - pause - electrode 3….

62
Q

During Step 4 of CIS, the _____ from Step 3 are converted into AM pulse trains

A

Envelope outputs

63
Q

What is good about CIS?

A
  • interleaved stimulation avoids channel interactions
  • preservation of tonotopic organization (via bandpass filters and which electrodes they send info to)
  • high pulse rates allow representation of F0 and voiced/unvoiced info
  • better speech reception scores
64
Q

What are the problems with CIS?

A
  • fixed rate pulsatile stimulation -> unnecessary synchronization of neural response
  • severe distortions in temporal discharge patterns
  • no delivery of phase info