Chapter 12: Perceiving Speech And Music Flashcards

1
Q

Speech Perception

A
  • Deals with how language sounds are perceived

- Involves relationship between perception and production

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Phonemes

A
  • smallest unit of sound that can change meaning of word

- sounds we can pronounce

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Morphemes

A

Smallest unit of sound that provides meaning to word

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

International Phonetic Alphabet (IPA)

A
  • alphabet in which each symbol stands for different speech sounds
  • provides distinctive way to write each phoneme in all human languages currently in use
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Producing the sounds of speech

A
  • speech starts in the brain

- after a speaker determines what to say, the other parts of the sound production system come into play

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Difference in fundamental frequency

A

Male (85 Hz)
Female (150-200 Hz)
Children (300+ Hz)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Vocal Folds

A

Aka vocal cords

  • pair of membranes within larynx
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Larynx

A

Aka voice box

  • part of vocal tract that contains vocal fold
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Pharynx

A

Uppermost part of throat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Uvula

A

Flop of tissue that hangs off posterior edge of soft palate

- can close off nasal cavity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Speech Production System

A

Influences by contraction and relaxation of throat muscles and tongue activity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Vowels

A

Produced with relatively unrestricted flow of air through pharynx and oral cavity

  • uninterrupted, unrestricted flow
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Formats

A

Frequency bonds with relatively high amplitude in harmonic spectrum of vowel sound

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Consonants

A

Produced by restricting flow of air at one place of another along path of airflow vocal folds

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Place of Articulation

A

In production of consonants, points in vocal tract at which airflow is restricted, described in terms of anatomical structures involved in creating restriction

  • closing of lip
  • top teeth and bottom lip
  • tongue behind upper teeth
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Manner of articulation

A

Nature of restriction of airflow in vocal tract

  • whether air is fully stopped or just restricted
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Voicing

A

Specifies whether vocal folds are vibrating or not (whether consonant is voiced or voiceless)

  • whether vocal fold vibrate or not
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Vowel Sounds: Production and Frequency Spectrum

A

Speech sound changes formant

Formants= harmonics with increased amplitude for specific sound

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Sound Spectogram

A

Graph that includes dimensions of frequency, amplitude, and time, showing how frequencies corresponding to each sound in utterance change over time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Phonemes cannot be identified by mapping […] to specific phonemes

A

Phonemes cannot be identified by mapping specific frequencies to specific phonemes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Speech sounds vary, even with the same speakers, for a variety of causes

A
  • sloppy enunciation
  • speaking with mouth full
  • coarticulation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Coarticulation

A

Influence of one phoneme on acoustic properties of another due to articulatory movements required to produce them in sequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Variability in the Acoustics of Phonemes

A

Ex. The difference between “resisting arrest” and “resisting a rest”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Categorical perception of phonemes

A
  • refers to the perception of different sensory stimuli as identical, up to a point at which further variation in the stimulus leads to a sharp change in the perception
  • means that a change in some variable along a continuum is perceived not as gradual but as an instance of discrete categories
  • opposite of continuous perception (no sharp changes in perception
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Categorical perception and voice onset time (VOT)
Categorical perception happens when categories that observers possess influence that observer’s perception Ex Ba vs Da
26
Voice Onset Time (VOT)
In production of stop consonants, interval between initial burst of frequencies and onset of voicing *varies due to consonant place and manner of articulation
27
Phonemic Boundary
VOT at which stop constant transitions from being perceived as voiced —> voiceless *when sound started vs when voicing started
28
McGurk effect
In perception of speech sounds, when auditory and visual stimuli conflict , the auditory system tends to compromise on a perception that shares features with both seen and heard stimuli
29
Knowledge takes 3 forms
1. Knowledge of the grammatical rules of the language and the context in which an utterance is produced 2. Knowledge about the probability of various sequences of phonemes within words or across words in the language they’re hearing 3. Knowledge of the specific words that are expected in a particular situation
30
[…] speech is easier to perceive than […] speech
Grammatical speech is easier to perceive than ungrammatical speech * grammatical> anomalous>ungrammatical * ungrammatical speech is characteristic of Wernicke’s area
31
Word Segmentation
- in general, perception of language is a clear separation between words (segmentation) - in reality, talking involved creating a continuous, connected stream of sounds— except when pausing
32
Word Segmentation: A different type of perceptual challenge relates to the […] between words in the sound stream of normal speech
Word Segmentation: A different type of perceptual challenge relates to the indistinct boundaries between words in the sound stream of normal speech
33
Infant Learning of Transition Probabilities
Infants can predict what words come next
34
Phoneme Transition Probabilities
For any particular sequence of phonemes, the changes that sequences occur at start of a word, at end of word, or across boundary between two words
35
Phonemic Restoration Effect
Kind of perceptual completion in which listeners seem to perceive obscured or missing speech sounds
36
Results of Shahin and Miller’s (2009) study reinforce the conclusion that […]
Results of Shahin and Miller’s (2009) study reinforce the conclusion that knowledge is important in phonemic restoration *knowledge of the mouth movements associated with specific words and their phonemes
37
Aphasia
Impairment in speech production/ comprehension (or both) caused by damage to speech centers in brain Broca’s Aphasia: speech production Wernicke’s aphasia: speech comprehension Globus aphasia: arcuate fasciculus
38
Speech Pathways in Brain
Ventral Pathway: meaning and combo of words (“what”) Dorsal Pathway: production of speech using motor system (“where/how”)
39
Music: No other creature seems to have the ability to compose music other than humans
- music has the ability to evoke emotional responses from humans - understanding of music requires an appreciation of pitch loudness, timing, and timbre combinations that composers can use to create a musical experience
40
Pitch
Perceptual basis of organization with notes separated by proportionally equivalent intervals - notes separated by an octave are perceptually more similar than notes separated by some other intervals - semitone intervals are perceptually equivalent to one another
41
Octave
Sequence of notes in which fundamental frequency of last note is double the fundamental frequency of first note Ex. A4= 220 Hz, A5= 440 Hz
42
Semitones
12 proportionally equivalent intervals between notes in octave - difference between A1 and A1# and A8 and A8#
43
C3 and C4 are perceptually […] than B4 and C4
C3 and C4 are perceptually more similar than B4 and C4
44
Pitch helix illustrates similarity among pitches […]
Pitch helix illustrates similarity among pitches geometrically - Tone chroma - Tone height distance between successive notes along helix is constant— perception of constant difference in pitch
45
Tone Chroma
Difference in pitch within octave
46
Tone Height
Octave in which pitch appears
47
Dynamics
Manner in which loudness varies as a piece of music progresses
48
Rhythm
Temporal patterning of events in a musical composition - Tempo - Beat - Meter
49
Tempo
How fast/ slow overall piece is
50
Beat
Equally spaced pulses that can express fast or slow tempo
51
Meter
Temporal patterning of strong and weal pulses in beat over time
52
Dimensions of Music: Timbre
Attack and decay: ways in which harmonic components begin than fade away
53
Melody
Sequence of musical notes arranged in particular rhythmic pattern, which listeners perceive as single, recognizable unit Melodies can be recognized even when the notes are transposes to a different musical key * infants recognize melody at 6 months
54
Transposition
Two versions of same melody, containing same melody, containing same intervals but starting at different notes
55
Scales
Particular subset of notes on octave * major and minor scales
56
Key
Scale that functions as basis of musical compositions
57
Harmony
Consonance and Dissonance - some combinations of notes are consonant while other are dissonant - this is due to the harmonicity or lack thereof in the harmonics of the combined tones
58
Consonance
Quality exhibited by combo of 2+ notes form scale that sound pleasant
59
Dissonance
Quality exhibited by combo of 2+ notes from scale that sounds unpleasant
60
Harmonicity
Extent to which harmonics of notes played in combo coincide with harmonics of notes with lower fundamental frequency
61
Gestalt Principles of Melody
- Proximity - Similarity - Closure
62
Neural Basis of Music Perception
Once neural information leaves the primary auditory cortex, the brain has areas that are more active when processing certain types of sounds
63
Fixed-pitched sequence vs silence
More active left and right auditory cortex
64
Changing-pitch sequence vs fixed-pitched sequence
Only right auditory cortex responds
65
Color Music Synesthesia
combining color whenever music is played *each note has its own color
66
Mirror Neurons in Music
When non-musicians listened to songs they learned on piano, brain areas for music perception and finger movements were activated
67
Absolute Pitch
Listened to isolated notes and same them accurate and efforlessly
68
Antonia
Can’t match of identify pitch
69
Amusia
Profound impairment in perceiving and remembering melodies and in distinguishing one melody from another - congenital or developed after brain damage - 4% of population - can sound like pots and pans
70
Musical training and experience
Solidifies musical areas in brain - magnitude of brain activity between musicians and nonmusicians is higher in experience musicians - both groups showed equivalent patterns of activity
71
Language, Culture, and Music
- Music and Language- some languages are more lyrical than others - Learning and Culture and affect Music Perception
72
Musical Illusions
Sheperd Tones Octave Illusion Tritons Paradox
73
Sheperd Tones
Layered tone separated by one octave - top line gets quieter, middle line stay the same, and bottom line gets louder - Pitch increases
74
Octave Illusions
2 notes that are one octave apart are played alternatively
75
Tritone paradox
Sequentially paired Sheperd tones that some people perceive as ascending and others perceive as descending
76
Automatic Speech Recognition
Accurate perception of human speech by machines
77
Speech Perception by Machines
1. Many modern devices incorporate automatic speech recognition (ASR) to allow user control via spoken commands and requests 2. The first step in the process of ASR is to convert the waveform of the utterance into a set of feature vectors that capture the spectral information in the speech 3. Then the acoustic parameters that characterize the speech sounds are computed 4. Finally, linguistic and lexical information is used to help guide the search for the most probably word sequence corresponding to the acoustic parameters (hypothesis search)