Physics of speech Flashcards

Question 1

Q

Acoustic phonetics

Answer

A

study of sound waves made by the human
vocal organs for communication.

study of the physical properties of speech,
and aims to analyse sound wave signals that occur within speech
through varying frequencies, amplitudes and durations

Question 2

Q

Speech Waveform

Answer

A

transmitted through air (and other medium) as a regular
wave of pressure changes

changes in air pressure

can be heard
cannot see
can be measured
measurement can be visualized and used for statistical calculation

Question 3

Q

basic parameters of the speech signals

Answer

A

amplitude

* time (duration)

Question 4

Q

main derived parameters of speech signals

Answer

A

Intensity
noise vs. resonance
frequency and formants

Question 5

Q

methods used to analyse speech signals

Answer

A

analog-to-digital (A/D) conversion

* mathematical definitions of filters and transformations

Question 6

Q

Visualisation for Speech Signals

Answer

A

need to know

time
amplitude
oscillogram
spectogram
formants
fundamental frequency
pitch track

Question 7

Q

Time Domain

•

Answer

A

What actually appear in your waveform if you zoom it in

The positive or negative amplitude A of the speech signal at any given
point in time is the distance of the wave from zero at this point in
time

Question 8

Q

What can be derived from time domain analysis? Intensity

Answer

A

intensity of the speech signal at any given point
in time is the square of the amplitude of the wave
from zero at this point in time: I = square(A)

Question 9

Q

What can be derived from time domain analysis? Period or interval

Answer

A

duration of this single wave

signal is resonant if its period are regular in duration
signal is noisy if period is irregular in duration

Question 10

Q

wavelength λ (lambda)

Answer

A

speed of sound in m/sec divided by the number of periods per second

distance between successive peaks is called wavelength. measured in meters / cm

Question 11

Q

wave

speed

Answer

A

v = λ / T
v = λ f

Question 12

Q

Frequency Domain: Simple & Complex

Signals

Answer

A

frequency of a speech signal is the number of waves (periods) per second in the waveform

Question 13

Q

The source larynx

Answer

A

only valid for voiced sound

• Fundamental frequency
about 80 Hz - 150 Hz for men (greater range possible)
about 160 Hz – 300 Hz for women (greater range possible)

Several overtones/harmonics of the fundamental frequency
different intensities of overtones, relative to each other, which determines
the overall waveform

during voicing, the larynx generates a waveform which is rather like a “sawtooth” sequence

Question 14

Q

Frequency Domain: Complex Signals

Answer

A

Combination of several arbitrary sine waves resulting noise
But if it is not arbitrary, it produce harmonic wave

• The lower frequency is the F0
• The higher frequencies in a harmonic waveform are harmonics or
overtones of the F0

Question 15

Q

The filter system

Answer

A

consists of pharyngeal, nasal, oral cavities, with
resonant frequencies which amplify or damp the overtones with these frequencies

These filter frequency bands are called formants

Formant frequencies of the oral cavity can be modified by the variable filters (articulators such as tongue and lips)

Question 16

Q

Acoustic Resonance

Answer

A

understand how resonator (vocal tract) modified the complex periodic sound

Question 17

Q

How we get a speech sound

Answer

A

sound source

sets vocal tract into resonance depends on its shape

amplifies some freq and attenuates others

speech sounds come out

Question 18

Q

Source-Filter Theory

Answer

A

speech is a product of interaction between a sound source (vocal folds/phonation) and a filter (vocal tracts/resonance)

sound source (vocal folds/phonation) sometimes referred as source function

filter (vocal tract/resonance) sometimes referred as filter function / transfer function

Question 19

Q

Filter function

Answer

A

supralaryngeal vocal tract can be characterized by a filter function (length and shape of tube)
which specifies (for each input frequency from the source) the relative
amount of energy that is passed through the filter and out of the mouth

peaks in the vocal tract filter function are the resonances of the vocal tract
reflect the frequencies which are passed through the filter with the greatest
relative amplitude

Question 20

Q

Source and Filter: Independence

Answer

A

Formants = resonant frequencies of the vocal tract
- depends on shape of vocal tract to filter harmonics
- do not depend on f0 of the sound source
source and filter are independent of one another

Question 21

Q

Varying the Filter: Vocal Tract Shape

Answer

A

vocal tract = variable resonator

shape constantly changes over course of utterance
- space in oral cavity can be large/small
- space inpharyngeal can be L/s
air can resonate in nasal cavity / blocked from nasal cavity

changeing shape of vocal tract =

change resonating frequencies
change filter
different speech sounds

each sounds has its own filter made up of its own set of resonant frequencies (formants)

Question 22

Q

Source Filter Model

Answer

A

spikes (harmonics) are generated by SOURCE
peaks (formants) are generated by FILTER

single spectral analysis of an interval in a speech signal requires at least 1s

sequence of spectra is neede to track the changing structure of speech signal

the representation of sequence of spectra is called spectogram

Question 23

Q

Source Filter Model

Answer

A

spikes (harmonics) are generated by SOURCE
peaks (formants) are generated by FILTER

single spectral analysis of an interval in a speech signal requires at least 1s

sequence of spectra is neede to track the changing structure of speech signal

the representation of sequence of spectra is called spectogram