exam 2 Flashcards
motor equivalence
- the output of a vocalities, can be the same even if the physical qualities of the production differ in the source and filter
Different filter types
EXPLAIN WHAT HIGH PASS, LOW PASS, BAND PASS AND BAND REJECT.
Fourier transform
utilized to change domains, creates a spectrum of all the different parts of a wave. All periodic sounds are made up of a combination of sound waves, this theorem breaks down those parts, into amplitude, phase angles, frequencies.
all periodic sounds are made of a combination of sine waves
-amplitudes vary
-phase angles vary
-frequencies vary
**creates a spectrum from the time domain waveform, “analyze a cake to learn its ingredients”
Time domain display
shows time components of a wave, amplitude on y axis, and time on x axis
a waveform represents sound directly
-air pressure changes over time
Frequency domain display
shows frequency components of a periodic sound, does not show time, y axis is amplitude, and x axis is frequency
a line spectrum shows the frequency components of a periodic sound
-description of the signal
-has F0 that are multiples of the fundamental
-has nothing between the lines, the lines represent the F0 frequencies
Spectrum of noise – white, pink, brown – how is the energy distributed?
White noise
noise consisting of all audible frequencies in equal amounts
Brown noise
low frequencies to generate a deep rumble
Pink noise
noise whose lower frequencies are proportionally higher in amplitude than higher frequencies
Spectrum vs. spectrogram
a spectra is a single snapshot in time, a line spectrum. A spectrogram is the speech overtime, lining up single slices together. Shows over time how frequency and amplitude change
Wide and narrow band spectrograms
a wide band will show clear temporal detail, and the frequency resolution is poor. In a narrow band, the frequency detail is clear, but the time resolution is poor. In a wide band we will see vertical lines, glottal air pulses!
In a narrow band we will see horizontal lines, representing the harmonics
FFT and LPC spectra – what does each show?
FFT is a fast fourier transform, shows the range of harmonics, each peak is a harmonic, less clear of the vocal tract filter, but more revealing of the source. LPC is linear predictive coding, it is a spectral envelope, good at revealing the tract or filters, and how our vocal tract is shaping the sounds from our sound source, but does not show harmonics, no sound source,
What does a typical voice FFT look like?
Individually placed lines, representing the harmonic multiples. Each peak is a harmonic, the greater space between the lines represents noise.
11. Spectral slice –
Spectral slice
how is amplitude of the frequencies shown? No individual lines, the peaks of the line are represent formants, the spectral slice represents amplitude (x-axis)
1) Is a frequency domain display
2) x-axis s frequency
3) y-axis is amplitude
A spectra represent frequency components in a wave, sine waves would only have one. Noise does not have any relationships, very close lines, all different frequencies.
How is the amplitude of frequencies shown on a spectrogram?
Shows all 3 parameters, amplitude is shown by darkened spots on the graph representing intensity
High speed filming of phonation – typical frame rate standard
VHS video uses 30 frames/ sec- works best b/c of flicker fusion rate; ultra-high speed filming uses 4000-6000 frames/ sec (not clinically practical- expensive & requires massive amounts of data)
Spectrograms of normal and hoarse voices – how do they differ?
It is easier to see the location of glottal pulses and harmonics. Hoarse voices contain more noise within them, making it harder to receive a closer estimate
Stroboscopy – how to get a frozen vs. slow motion image; live stroboscopy in clinical practice
‘frozen’ image
-flash occurs at the same point in each cycle
-slow motion image
-flashes slightly delayed in successive cycles
when these are played back you have created an animation
capitalizes on an optical illusion
- light flashes illuminate a target, each illumination is a snapshot
- paste the snapshots together in succession, voila! a movie
- video has 30 frames/sec, vocal folds may oscillate 200 times/sec timing of the flashes is crucial
a slow-motion video image of vocal fold vibration
-light rapidly flashes
-captures vocal folds in different glottic cycles in a variety of positions
-not true slow-motion, simulated
-video has 30 sec/frames
-movie is a composite of stills from different cycles
*strobe movies are made of non-adjacent sample
*true vocal fold motion is not seen
-harmonics must be steady
-flashes cannot synchronize if harmonics are erratic
-severe dysphonia precludes stroboscopy