Speech production Flashcards

1
Q

3 sources of speech production

A

All three have wideband spectrum:

  • Voicing: vibration of the vocal folds, same type of aerodynamic mechanism as a flag flapping in the wind.
  • Frication or Aspiration: turbulence created when air passes through a narrow aperture
  • Burst: the “pop” that occurs when high air pressure is suddenly released
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

3 steps to produce speech

A
  1. initiation
  2. phonation
  3. articulation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

vocal folds aka cords

A

two bands of smooth muscle tissue found in the larynx (voice box).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

vocal tract

A

air cavity between glottis and lips

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

source-filter model of the vocal tract

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

source-filter model of the vocal tract (with details)

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

formula for speech signal (through convolution of excitation signal and transfer function)

A

The speech signal s(t), is created by convolving (∗) an excitation
signal e(t) through a vocal tract transfer function h(t):
s(t) = e(t) * h(t)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Fourier transform through excitation product times transfer function

A

The Fourier transform of speech is the product of excitation
times transfer function:

S(f) = H(f)E(f)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

formants

A

vocal tract resonances

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what happens at resonant frequencies?

A

At the resonant frequencies, the resonance enhances the energy of
the excitation, so the transfer function H(f) is large at those
frequencies, and small at other frequencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

air stream (vowels)

A

unblocked air stream

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

air stream (consonants)

A

blocked / obstructed air stream

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

vowels scheme

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

vowel classification: tongue height

A

Tongue height:
– Low: e.g., /a/
– Mid: e.g., /e/
– High: e.g., /i/

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

vowel classification: tongue advancement

A

Tongue advancement:
– Front : e.g., /i/
– Central : e.g., /ə/
– Back : e.g., /u/

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

vowel classification: lip rounding

A

Lip rounding:
– Unrounded: e.g., /ɪ, ɛ, e, ǝ/
– Rounded: e.g., /u, o, ɔ/

17
Q

vowel classification: tense vs lax

A

Tense/lax:
– Tense: e.g., /i, e, u, o, ɔ, ɑ/
– Lax: e.g., /ɪ, ɛ, æ, ə/

18
Q

vowel scheme: dependence of formants 1 and 2 on tongue placement

A
19
Q

consonants classification: manners of articulation

A

Manner of articulation
– Stops: /p, t, k, b, d, g/
– Fricatives: /f, s, S, v, z, Z/
– Affricates: /tS, dZ/
– Approximants/Liquids: /l, r, w, j/
– Nasals: /m, n, ng/

20
Q

coarticulation

A
  • Coarticulation refers to changes in speech articulation (acoustic or visual) of the current speech segment (phoneme or viseme) due to neighboring speech. In the visual domain, this phenomenon arises because the visual articulator movements are affected by the neighboring visemes.
  • production of a speech sound becomes
    more like that of a preceding/following speech sound
21
Q

f0 and H1

A
  • fundamental frequency f(0) and first harmonic H(1) are the same thing.
  • The fundamental frequency, or f0, is the first harmonic, or H1. There is a harmonic at each interval of the f0 up to infinity. Vocal fold vibration produces many harmonics above f0, all the way up to 5000Hz in the adult human vocal tract. These harmonics decrease in amplitude as the frequency increases.
22
Q

magnitude spectrum and log magnitude spectrum formulas

A

magnitude spectrum: S(f) = H(f)E(f)

log magnitude spectrum = ln |S(f)| = ln |H(f)| + ln |E(f)|

23
Q

axes(spectrogram)

A

Spectrogram = time on the horizontal axis, frequency on vertical axis.

24
Q

spectral splatter

A

spectral splatter (also called switch noise) refers to spurious emissions that result from an abrupt change in the transmitted signal, usually when transmission is started or stopped.

For example, a device transmitting a sine wave produces a single peak in the frequency spectrum; however, if the device abruptly starts or stops transmitting this sine wave, it will emit noise at frequencies other than the frequency of the sine wave. This noise is known as spectral splatter.

When the signal is represented in the time domain, an abrupt change may not be visually apparent; in the frequency domain, however, the abrupt change causes the appearance of spikes at various frequencies.