Lecture 4 - Time Domain Flashcards

1
Q

What are the three broad categories of speech sounds?

A
  1. voiced
  2. unvoiced
  3. silence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Give the different intervals for short, medium and long in milliseconds and their applications.

A
  1. Short intervals; 5-20 msec
    - uncertainty due to small amount of data, varying pitch, varying amplitude.
  2. Medium intervals; 20-100 msec
    - uncertainty due to changes in sound quality, transitions between sounds, rapid transients in speech
  3. Long intervals; 100-500 msec
    - Uncertainty due to large amount of sound changes. Used in cases like finding the audio quality in a google hangout.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What’s the typical window overlap in speech.

A

50%; when we have a hamming window, we want 50% overlap as it gets the rest of the values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Give the difference between a rectangular and hamming window for speech signals.

A
  1. Rectangular is simple to implement but in frequency domain -> narrow wideband and large sidebands.
  2. Hamming window; BW of hamming twice rectangular and attenuation greater outside passband.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the downside to zero-crossing?

A

When there is DC-offset, the signal is shifted up. This can decrease the amount of ZC in a system.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What value is the most dominant energy for voiced and unvoiced speech?

A

Voiced speech; 700 Hz
Unvoiced speech; 2.5 kHz

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the range threshold between voiced and unvoiced.

A

voiced < 1.5 kHz
unvoiced > 1.5 kHz

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is autocorrelation?

A

Autocorrelation measures the relationship between a time series and a lagged version of itself over successive time intervals.

e.g. Tracking the temperature of a city every day. If today’s temperature is similar to yesterday’s, and yesterday’s temperature is similar to the day before, we say that the temperature data is autocorrelated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why is pitch tracking useful

A
  1. Pitch helps distinguish speakers or emotions.
  2. Can be used in tone analysis; in languages like mandarin.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the challenges in pitch tracking?

A
  1. Noise
  2. Multiple fundamental frequencies
  3. Is there a speaker?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Give reliable algorithms for pitch tracking?

A
  1. YIN
  2. RAPT
How well did you know this?
1
Not at all
2
3
4
5
Perfectly