Lecture 6 - Feature Extraction Flashcards

1
Q

Give two applications of feature extraction?

A
  1. Classification; leave only relevant information
    - speech recognition, speaker recognition,
  2. Coding; leave only perceptually (who is speaking) important information
    - Real-time transmission of speech
    - Online content
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The cepstral deconvolution separates the speech signal into components varying slowly (envelope) and more rapidly (excitation). Which coefficients do we keep?

A

Usually keep the lowest 10-15 coefficients; related to vocal tract resonances and keep the ‘filter’.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Speech has more energy at lower frequencies and therefore contain numerical problems in implementation. What can be done in order to boost higher frequencies?

A

Use a time-domain FIR filter with one free parameter
- P(z) = 1 - az^-1, a depends on the sampling frequency.

Can make information in higher formants more available to the acoustic model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly