Quiz 3 Transcription + Flashcards
1
Q
What factors determine how much of the recorded audio gets transcribed?
A
- what serves the purpose (interviewer’s speech might not be of interest, carrier phrase may be excluded ex. “Say __ again”
- how much time at hand, how much data to transcribe
- how much money to hire transcribers
- quality of audio
2
Q
What is the issue with using MS-Word to transcribe?
A
- does not align the transcription with the audio
3
Q
What is ELAN?
A
- a powerful, popular software tool for transcribing data
- especially useful when you want to transcribe video data
- allows multiple tiers
4
Q
How long does transcribing take, on average?
A
1 hour for 6-8 minutes of speech
5
Q
Which software is ideal for transcribing video data?
A
ELAN
6
Q
Describe Praat
A
- one of the easiest ways to transcribe audio data
- unlike word processors, transcriptions are time-aligned with audio
- allows use of multiple tiers to include different levels of annotations
7
Q
What are the symbol systems used in transcription?
A
- IPA
- SAMPA
- ARPAbet
8
Q
What does SAMPA stand for?
A
Speech Assessment Methods Phonetic Alphabet
9
Q
Why was SAMPA developed?
A
- to allow for transcription in electronic text formats
10
Q
What are the key features of SAMPA?
A
- uses standard ASCII characters to represent phonetic symbols (extension of IPA, so not just American English)
- designed to simplify IPA symbols for easy use in computational systems
- directly replaces IPA symbols with ASCII keys
- widely used in computational linguistics and speech synthesis applications
11
Q
Why was SAMPA created?
A
- computers and earlier software systems had limited ability to handle IPA symbols
- SAMPA provided a workaround using basic text, ensuring compatibility across different systems
12
Q
What does ARPAbet stand for?
A
Advanced Research Projects Agency for speech processing
13
Q
What is ARPAbet?
A
- a phonetic transcription system developed by the Advanced Research Projects Agency for speech processing
- like SAMPA, uses ASCII characters to represent phonemes, but is specifically designed for use in automatic speech recognition and speech synthesis
14
Q
What are the key features of ARPAbet?
A
- uses a distinct set of ASCII symbols for each phoneme, based on American English
- Each phoneme represented by capital letters and numbers (ex. /k/ represented as K)
15
Q
Why is ARPAbet used?
A
- simplifies the representation of English phonemes for use in speech technology
- easy to integrate into speech synthesis and recognition software without needing special characters