Quiz 3 Flashcards
Transcription | Vowel analysis | Consonants 1
What factors determine how much of the recorder audio gets transcribed
- The amount of time at hand
- Amount of data to transcribe
- Amount of $ to hire transcribers
- Quality of audio
T or F : MS-Word aligns the transcription text with the audio
F - it is a simple word processor but it doesn’t mean that it aligns
Types of software : ELAN - what are the pros
- Transcribing video data
- Allows many tiers
Types of software : PRAAT - what are the pros
- One of easiest ways to transcribe audio data
- Transcription is time-aligned with AUDIO
- Allows many tiers for diff levels to annotate
What are the 3 symbol systems for transcription
- IPA
- SAMPA
- ARPAbet
Symbol systems - what does SAMPA use and WHY pick it?
ASCII characters to rep. phonetic symbols
It simplifies IPA symbols for easy use in computer systems
Earlier software systems had limited ability to handle IPA symbols
Symbol systems - what does ARPAbet use and WHEN is it used? WHY pick it?
- ARPAbet uses a certain set of ASCII characters per phoneme - 2-3 LETTER CODE
- Used in text-to-speech systems
- Simplifies English phonemes for the use of speech tech
- Makes an easier integration of speech synthesis
When deciding how much of a recorded interview to transcribe, which of the following processes should be considered to ensure efficient and meaningful data usage?
A. The quality of the interviewer’s questions and how much they contribute to the conversation.
B. The type of symbol system chosen (e.g., SAMPA vs. ARPAbet).
C. The balance between data transcription depth and the project’s purpose, time
constraints, and budget.
D. The need for code-switching analysis within the transcript.
C. The balance between data transcription depth and the project’s purpose, time
constraints, and budget.
What are resonance frequencies?
Literally frequencies that are more resonant, they turn into what we call FORMANTS
T or F : our vocal folds generate a LOT of frequencies, but only some of them get through
True - some of them amplified and others get dampened
What are amplified frequencies?
Formants
F0 = ? = ?
F0 = fundamental frequency = harmonic 1
What is the difference in peaks between a broad and narrow band spectrum
Broad - peak of general formant
Narrow - peak of each harmonic (component)
Ella wants to know how many component frequencies are available under 1000 Hz. What type of spectrum would you suggest to try?
A. narrowband
B. wideband
C. either will be good
A. narrowband
James has generated a narrowband spectrum of an [u] vowel. Which of the following is true?
A. he has no way to estimate the frequency of the third formant
B. he has enough info to find the frequency of the F0
B. he has enough info to find the frequency of the F0
What is vowel normalization? How do we achieve it?
Techniques that account for the variation in our vocal tracts
- using Z-SCORES
What are two reasonings from speaker variability?
- Physiology
- Dialects
Goals of vowel normalization (4 main ones)
- Preserve sociolinguistic diffs in vowel quality
- Preserve phon. distinctions among vowels
- Eliminate physiological variation caused by mouth size
- Model cog processes in vowel normalization
What are the 4 types of vowel normalization techniques? think 2 V & 2S
- Vowel intrinsic method
- Vowel extrinsic method
- Speaker-Intrinsic
- Speaker-Extrinsic Method
Vowel intrinsic method + example
- Vowel intrinsic method - use info from a single vowel TOKEN (can be many speakers)
- Formant Frequency Adjustments : adjusting F values based on AVERAGE values for sim. speakers
Vowel extrinsic method + e.g.
use info of F values across diff vowels of the same person
- Grand Mean Norm : normali. vowels using a GRAND mean calculated from ALL V’s by the person
Speaker intrinsic + e.g.
normalize based on data from a SINGLE person’s V’s
- Individual Speaker Baseline : making a baseline for F freqs specific to an individual speaker
Speaker extrinsic + e.g.
Speaker-Extrinsic Methods: taking data from MANY speakers
- Mean Adjustment Method : normalize using mean values taken from MANY speakers
Stops : Hawaiian is know for what KIND of stops?
voiceless