LING330: Quiz #5 Flashcards

Question

Quantization

Answer 1

How precise a measurement is (the higher the sampling rate (more decimal places), the more space it takes up on a computer) Must decide how much rounding error can be accepted Computer audio systems will default to 16 bits per sample, but 8 bits doesn't sound bad

Answer 2

In analog recordings: plastic or metal tape=stretches and distorts + noise from turning cogs and hissing tape travelling through heads could never be completely eliminated

Answer 3

Goal is to maximize this in recordings | Recording should be as clean and clear as possible (more signal, less noise)

Answer 4

Take full advantage of the system's DYNAMIC RANGE (adjust the range to match the sound)

Answer 5

Background noise in a recording due to representation of the continuous analog signal as a series of discrete levels **the higher the bit rate, the more levels available and the lower the quantization error

Answer 6

When the volume on the dynamic range is turned up too far and the amplitude peaks are cut off in a recording Result: distortion

Answer 7

- outside noise in the enviro - speakers raise and lower their voices, turn their heads, shift their bodies - papers crinkle with scripts * *head mounted microphone set to the side of speakers lips can reduce variation + watch level meter

Answer 8

Uni: designed for single talker Omni: pick up sound from all directions so best for recording multiple speakers on one channel

Answer 9

A waveform | Aka a graph of changes in air pressure (amplitude) over time

Answer 10

Vowels (bc mouth is open) Also complex repeating pattern (periodic) **diffs between absolute amplitude in vowels of different waveforms are just due to variation in how loudly each utterance is spoken

Answer 11

Lower amplitude | Less complexity

Answer 12

Periodic Lower amplitude than vowels and sonorants (sound of vocal fold opening and closing is beating through closed vocal tract) Transient burst when closure is released into the vowel In American English: [b, d, g]=periodic energy dies down during the closure unless stop is between other voiced sounds

Answer 13

No repeating pattern, appear as random noise Strident fricatives=high amplitude Non strident= may have very low amplitude, may be hard to distinguish from voiceless stops (clue: fricatives not followed by burst)

Answer 14

Easy to spit bc silent during closure phase (no amplitude) so appear as flat line in waveform (unless there is background noise) Usually followed by a burst Aspirated stops: followed by aspiration noise

Answer 15

Combine periodicity and noise | Voiced stops: periodicity can die out toward the end of the consonant

Answer 16

Segmentation | Speech analysis programs allow for this

Answer 17

In differences in VOT (voice onset time) aka the amount of time that elapses between the release of the consonant and the onset of periodicity for the vowel

Answer 18

Allows us to analyze segment quality (allows us to quantify, visualize and analyze component frequencies and thus to quantify, visualize and analyze the details of sound quality) Involves algorithms which mathematically analyze the signal in order to accomplish what the electronic filters in the sound spectrograph did: to test the strength of diff frequencies that might be present

Answer 19

A "sawtooth" wave aka steep upslope (because of pressure increase when the vocal folds are blown open) and then a gradual decrease (as they're pulled together by the Bernoulli effect) Periodic pattern

Answer 20

Harmonic frequencies will always occur at integer multiples of the fundamental frequency (the period of each sub-vibration has to fit exactly into the period of the fundamental) Ex: voice with f0 of 100 hz, harmonics occur at 200 hz, 300 hz, 400 hz etc

Answer 21

The more dense the harmonics

Answer 22

Typically higher f0 = less harmonics present + more breathiness = harmonics that are present may have lower amplitude (especially at higher frequencies)

Answer 23

APERIODIC sound | Pressure variations are totally random

Answer 24

Wide/broad band spectrogram: formant frequencies (regions of high amplitude energy; reflecting changes in resonance frequencies as vocal tract articulators change position) show up as broad bands (in wide band=spectra taken from short windows of speech signal at frequent intervals so the changes over short time periods are evident) Narrow band spectrogram: when windows at less frequent intervals are used; individual harmonics can be distinguished but time dimension is less precise

LING330: Quiz #5 Flashcards

(48 cards)