Définitions Flashcards
Define the notion of spontaneous speech and mention the names of seven sources for collecting spontaneous speech corpora.
Refers to any naturally occurring discourse in a social context in which the participants freely choose their words (vocabulary) and syntax.
Collected from the following sources:
- interview
- dialog/face to face conversation
- self talk
- storytelling
- jokes, the retelling of dreams etc as other types of spontaneous speech
- Sports reporting in radio/television broadcasts
- movie and theatre language
Explain what speech science is. What fields are related to speech science?
Speech science is the name of a general field of scientific research that investigates human speech in terms of physical processes involved in its production and perception: hence it is divided into study of speech production and speech perception.
The fields related to speech science are the following: child language acquisition, teaching speech, the study of pathological speech and speech disorders, speech synthesis, speech processing, speech technology, speech recognition, voice identification, speech-to-speech translation. All these fields benefit from the contributions of acoustic phonetics.
Explain what we study in acoustic phonetics.
In acoustic phonetics we study how the speech sounds are formed acoustically in terms of the acoustic manifestation of the linguistics features occurring in speech, and we investigate their relationship from the percepectives of their production and perception mechanisms.
Explain what is meant by the term speech corpus
A speech corpus is a database of speech consisting of speech audio files usually with corresponding written transcription of texts.
Explain the differences between read aloud speech and spontaneous speech.
Read aloud speech involves a more carefully articulated performance of the speech sounds and prosody during which the speaker tends to or has to be more formal = lacks the naturalness of spontaneous speech in terms of the natural articulation of speech sounds and prosody. The naturalness in speech includes the dimensions of speech rate variations of the utterance, the energy intensity variations of sounds throughout the utterance, as well as the hesitations and pauses or micro-pauses made by the speaker. A read aloud speech also lacks the context of situation in which natural spoken language is physically realized, as well as the speech style.
What are the sources of the corpora of read aloud speech?
1) - News broadcasts —
2) — News broadcasts and excerpts from books such as story reading are more suited for the supra-segmental analysis of speech than vowels and consonants.
3) Word lists: read-aloud isolated words and sentences in a phonetic laboratory — This type of corpus is the most preferred one for the acoustic analysis of the vowels and consonants of a language.
What is the objective of conducting a speech waveform analysis? What is required for a speech waveform analysis?
We conduct a waveform analysis to recognize certain elements/acoustic features of the physical composition of the spoken segment under analysis.
(1) the acoustic signal of the segment that has been recorded using a microphone, (2) a digital computer, and (3) speech acoustic software in order to derive the waveform from the signal and make it appear on the computer screen
Explain what is meant by the term voice source.
the term voice source of energy refers to the energy that is produced by the periodic pulsing of the vocal folds. This happens when the airflow enters from the lungs into the mouth while the vocal folds are vibrating
Explain what is meant by the term noise source.
The term noise source of energy refers to the energy of the airflow that is not pulsed by the vibrations of the vocal folds. The noise source involves random/irregular vibrations of the airflow
Explain what is meant by the term transient noise
This noise is produced in the vocal tract when there is an abrupt release of stopped air in the vocal tract. This noise is also called the noise burst. It is produced and perceived only in released stop consonants.
Explain what is meant by the term aspiration noise
This type of turbulence noise (of energy) is produced when the airflow is rapidly disturbed/modulated at the glottis
The aspiration noise is usually and perceived in the released aspirated stop consonants of English
Explain what is meant by the term spectrographic analysis? What acoustic parameters are studied in a spectrographic analysis?
The technique of analysis of sounds and sequences of sounds through their spectrograms is known as the spectrographic analysis. (1) frequency composition, (2) the intensity, and (3) the time dimension of the sound resonance
Explain what is meant by the term turbulence noise.
This type of turbulence noice (of energy) is produced when the airflow rapidly passes through a narrow constriction in the oral cavity. The friction noise is the sound source of all fricative consonants, whether they are phonetically voiced or voiceless.
What acoustic information can be provided in a waveform analysis? (Mention only four. Do not explain.)
- Voicing feature in vowels and consonants
- The voicing features in stop consonants occuring in syllable initial position in English word: positive VOT and zero VOT
- Occlusion in stop and affricate consonants.
- Stop burst of energy
- Friction noise in voiceless fricative consonants and voiceless affricates: aperiodic waveform
- Display of the energy intensity of sound segments in general.
What acoustic information is not observed in a speech waveform?
The information regarding (1) the frequency range of energy distribution and energy concentration in consonants, (2) the formants of vowels and sonorant consonants, (3) the formant transitions into or out of adjacent vowels, (4) the anti-formants in nasal and nasalized sounds, etc