Lecture 6 - Feature Extraction Flashcards
1
Q
Give two applications of feature extraction?
A
- Classification; leave only relevant information
- speech recognition, speaker recognition, - Coding; leave only perceptually (who is speaking) important information
- Real-time transmission of speech
- Online content
2
Q
The cepstral deconvolution separates the speech signal into components varying slowly (envelope) and more rapidly (excitation). Which coefficients do we keep?
A
Usually keep the lowest 10-15 coefficients; related to vocal tract resonances and keep the ‘filter’.
3
Q
Speech has more energy at lower frequencies and therefore contain numerical problems in implementation. What can be done in order to boost higher frequencies?
A
Use a time-domain FIR filter with one free parameter
- P(z) = 1 - az^-1, a depends on the sampling frequency.
Can make information in higher formants more available to the acoustic model
4
Q
A