Speech perception Flashcards
Phoneme
a unit of sound that distinguishes one word from another in a particular language
Number of speech sounds across all languages
over 850 different speech sounds.
3 steps of speech
Respiration, phonation, articulation
Respiration
The diaphragm pushes air out of lungs, through the trachea, up to the larynx.
Phonation
The process through which vocal folds are made to vibrate when air pushes out of the lungs, creating a certain frequency or sound (just a tone at this stage)
at the _____, the air must pass through two vocal folds
Larynx
What causes pitch differences in speech ?
More tension will cause more high-pitched sounds
- small vocal folds = high-pitched voices (chidren < women < men)
Harmonic spectrum
- spectrum of sound passing through the vocal folds (amplitude on y axis, frequencies on x axis)
- made up of multiple integers of the fundamental frequency
Articulation
act or manner of producing a speech sound using the vocal tract.
filter of speech
Articulation is the filter : amplifies certain frequencies, reduce others
vocal tract
area above the larynx (oral + nasal tract)
How can humans change the shape of their vocal tract ?
by manipulating their jaws, lips, tongue body, tongue tip, and velum (soft palate) – this is what we call “articulation” and what will produce the different phonemes
Resonance characteristics are created by …
changing the size and shape of the vocal tract to affect sound frequency distribution : increase/decrease energy at different frequencies.
formants
Peaks in the speech spectrum
- labelled by number, from lowest to highest frequency (F1, F2, F3)
- Formants have higher frequencies for people who have shorter vocal tracts. It is the relationship between the formants, usually first 3 to identify a phoneme, that counts.
Spectrograms
flipped spectrum with z axis as intensity
x: time
y: frequency
color: energy (amplitude), red = lot of intensity
How many sounds do we produce per second ?
10–15 consonants and vowels per second (can be doubled if you talk fast).
Coarticulation
Overlap in articulatory or speech patterns due to articulation in anticipation of the next consonant or vowel
Why can consonants preceding vowels be difficult to identify only from their formants ?
Because the vocal tract is preparing to produce a different vowel
- Even though we perceive the same consonant no matter the following vowel, the formants are actually quite different
“Motor theory” of speech perception
Motor processes used to produce speech sounds are used in reverse to understand the acoustic speech signal, “reading lips”
McGurk Effect
what someone sees can affect what they hear.
Seeing someone saying “ga” and reading “ba” at the same will lead you to hear something in between : “da”
- Using knowledge of articulation to perceive phonemes
Learning to listen
Already at four days old, infants, French babies prefer hearing French over Russian.
- However, the ability to distinguish phonemes starts off as almost universal, and decreases with time (e.g. after 10 months old, English infants lose the ability to distinguish Hindi phonemes not present in English)
Wernicke’s area
Area of the brain that is responsible with associating sound with meaning
Steps in associating sound with meaning
- Decode sound in primary auditory cortex and belt, para belt regions
- Brain sends this information to Wernicke’s area for meaning decoding
Wernicke or fluent (no problem speaking) aphasia
- Can hear well
- Lose word-sound-meaning connection
- Speech sounds non-sensical but fluent