Study Blue Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What are the basic dimensions of consonants?

A

Voicing
Place
Manner

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The brain isn’t looking for ______: it’s a robust system.

A

a single cue to figure out if the sound is a vowel

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Lisker’s Rule?

A

Each of the distributed acoustic consequences of a gesture has some value as an acoustic cue

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Consonants are:

A

Speech sounds characterized by obstruction of the vocal tract

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Stop consonants are characterized by:

A

Closure in the vocal tract that is released rapidly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Articulation of stop consonant: vocal tract closure. Acoustic Correlate: _____________

A

Stop gap - energy drops a lot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Articulation of stop consonant: Release of the closure. Acoustic Correlate: _____________

A

Stop burst

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Articulation of stop consonant: Rapid articulatory movements. Acoustic Correlate: _____________

A

Relatively fast formant transitions (mostly F1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Articulation of stop consonant: Rapid opening or closing gesture. Acoustic Correlate: _____________

A

Rapid rise/fall in intensity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Envelope are cues to _____.

A

manner

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Perception is: (2 things)

A
  1. Driven by what you expect to hear

2. What really comes up from the periphery

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What would Siri look for to tell “dot” from “got”?

A
  1. Energy peak in the spectrum in the burst

2. Second formant transitions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does the stop burst spectra look like for a labial?

A

Falling - energy falls with frequency; low energy peak

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does the stop burst spectra look like for an alveolar?

A

Rising - energy increases with frequency; high energy peak

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does the stop burst spectra look like for a velar?

A

Narrow (most intense portion is in the middle, narrow concentration of energy)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

The spectrum of a labial burst has most of its energy under:

A

600 Hz, overall down tilt

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

The spectrum of the alveolar burst has most of its energy at:

A

3000-4000 Hz, up tilt

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

The alveolar F2 is:

A

flat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

The labial F2 is:

A

rising

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

The velar F2 is:

A

falling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Articulators rapid movement =

A

rapid formant transition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Context condition variability:

A

The F2 is a variability; F1 stays approximately the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Context-conditioned cue:

A

how much F2 rises and falls

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Give an example of the complexities of a stop burst:

A

“G” - burst anticipates coarticulation, and it is the reason for complexity
Even though the bursts look the same, we assume them to be different in perception

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What are the cues to stop voicing in INITIAL position?

A
  1. VOT
  2. Low vs. high starting position of F1
  3. Relatively larger vs. small F1 change
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What does the “O” mean in VOT?

A

O = voice and burst start at the exact same time

- all languages have “O” in common

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

English is a _____ category language for voicing.

A

2 categories (voiced, voiceless); other languages do this differently

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What are the average VOTs for voiced and voiceless initial English consonants?

A
Voiced:
/b/ - 1 msec
/d/ - 5 msec
/g/ - 21 msec
(If the VOT is longer than 40 msec - it's perceived as voiceless)
Voiceless:
/p/ - 58 msec
/t/ - 70 msec
/k/ - 80 msec
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Typically, if the VOT is longer than about _____, the stop is perceived as voiceless.

A

40 msec

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Cues to stop voicing in MEDIAL position:

A
  1. Voicing during closure is the most salient cue
  2. Duration of stop gap
  3. Length of preceding vowel
  4. F1 transition for voiced stop
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

Duration of the stop gap is not ___

A

The VOT - it’s the burst

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Which sound has a short stop gap in medial position?

A

Voice

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

As the stop gap gets shorter, the stop starts sounding ____.

A

Voiced

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

Duration of the stop gaps goes the ______ in final position.

A

the opposite way

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What are the cues to stopping voicing in the FINAL position?

A
  1. Voicing during closure (most salient)
  2. Duration of stop gap
  3. Length of preceding vowel
  4. F1 falls for voiced stop
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

Cues to final voiced stop:

A
  1. Voicing during closure
  2. Longer preceding vowel
  3. F1 falls at the end of the vocalic portion
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

Cues to final voiceless stop:

A
  1. No voicing during closure
  2. Short preceding vowel
  3. No F1 fall
38
Q

If you do an FFT and nothing is there what does that mean?

A

It’s voiceless!

39
Q

List four manner cues for fricatives:

A

Aperiodic noise component
Duration of noise
No silence
No burst

40
Q

Difference between /s/ and /sh/:

A

Higher amplitude and higher peak in /s/

Peak: 4500-8000 Hz for /s/, 2500, 4500 Hz for /sh/

41
Q

Which phoneme has the highest pitch and energy peak for all of the consonants?

A

/s/

42
Q

/f/ and /th/ have a relatively:

A

flat spectrum

43
Q

What are the place cues for fricatives?

A
  1. Relative amplitude of noise

2. Formant transitions

44
Q

What are the cues for fricatives?

A

Manner Cues: Noise (hissy bit)
Place Cue: major frequency difference
Voicing cue: periodic and aperiodic component

45
Q

What are the cues for affricates?

A
  1. Silence
  2. Burst
  3. Rapid rise time
  4. Short stop + fricative
46
Q

What is rise time?

A

Time from the burst to the highest amplitude in the fricative

47
Q

What is steady state duration?

A

Length

48
Q

Rise time for affricates vs. fricative:

A

Fricatives: 76 ms

Fricative portion of an affricate: 33 ms

49
Q

Steady state duration for affricates vs. fricative:

A

Fricatives: 100 ms
Affricates: 48 ms

50
Q

What is the acoustic manner cue for fricatives?

A

Noise/aperiodic, visual will look like fuzzy stuff

51
Q

What are three place cues for fricatives?

A
  1. Spectral peak
  2. Amplitude
  3. Formant transitions
52
Q

What is the articulatory difference between fricatives and affricates?

A

There is a complete stop/closure of vocal tract for fricatives. Spectrally this is a stop gap

53
Q

You can make the /l/ and /r/ sound the same by:

A

Fiddling with F3
/l/ - has a high, flat F3
/r/ - F3 starts low and rises

54
Q

The acoustic correlate to duration of lip release is:

A

Duration of formant movement

55
Q

Manner cues for stops, the duration is:

A

Fast

56
Q

Manner cues for semivowels, the duration is:

A

Slow

57
Q

Manner cues for labial voiced stops:

A

30 ms or less

58
Q

Manner cues for semivowels:

A

100 ms or more

59
Q

What are the cues to nasal manner of articulation?

A

Characteristic nasal murmur - voiced
Low frequency
Relatively steady state formants
Resonance usually under 300 Hz

60
Q

Steady state formants occur in the nasal because:

A

Formants do not change in frequency because you’re not moving your articulators

61
Q

Nasal formants are usually below:

A

500 Hz

62
Q

How do you tell nasals apart?

A

Formant Transitions

63
Q

Malecot (1958) identification of isolated nasal murmur

A

Stimulus: /m/; responds /m/ 96%, /n/ 4%, /ng/ 0%
Stimulus: /n/; responds /m/ 42%, /n/ 56%, /ng/ 2%
Stimulus: /ng/; responds /m/ 60%, /n/ 28%, /ng/ 12%
NEED FORMANT TRANSITIONS AND RESONANCE TO BE MORE ACCURATE

64
Q

Formant transitions mainly provide _____ cues for articulation.

A

F2 place

65
Q

How can you tell /s/ from /sh/?

A

/s/ is around 4kHZ

/sh/ is around 2.5kHZ

66
Q

What are the three things that give information about the time/amplitude domain?

A

Envelope cues
Periodicity cues
Fine temporal cues

67
Q

Periodicity cues show:

A

Fricative, manner and voicing in the envelope

68
Q

Envelope cues:

A

Segmentation of signal into syllables and phoneme sized units, manner of articulation, strong weak-fricatives, minimal vowel information

69
Q

Fine temporal cues:

A

Frequency of F1

70
Q

Evidence that: Speech perception is not entirely auditory

A

Miller, Hise, and Lichen (1951) - sound plus visual equals 15 dB improvement SNR

71
Q

McGurk Effect:

A

Hear /ba/
See /ga/
Perceive /da/

72
Q

What are the three roles of vision?

A
  1. It directs attention and auditory analysis to the signal rather than the noise.
  2. Vision provides segmental info that is redundant to acoustics
  3. Vision provides info that complements acoustics
73
Q

What is one of the first features that is lost in noise?

A

Place of articulation

74
Q

Sumby and Pollack (1954)

A

1 dB change in SNR = ~5-10% change in intelligibility

75
Q

What does enveme mean?

A

Envelope Feature

76
Q

What does viseme mean?

A

Visual Feature

77
Q

You need ____ envemes and ____ visemes for a 95% correct perception.

A

4 envemes; 6 visemes

78
Q

______ activates primary and secondary _____ blank areas in ______.

A

Lip movements; auditory; superior temporal cortex

79
Q

Top down effects: Perceptual Restoration

A

Stimulus “the legislators will meet over the weekend”
Operation: remove the /s/ and replace with a cough or low-pass filer
Result: you “hear” the /s/

80
Q

What is the cohort activation model?

A

Lexical retrieval
Based on the concept that auditory and visual input to the brain activates a list of word possibilities
(Example: dog)

81
Q

Delayed commitment:

A

Will occur as the signal becomes more and more degraded, and certainty about the phoneme string is reduced
Example: nasals

82
Q

What are two temporal processes in word recognition?

A
  1. Left to right activation of cohorts and strategies to delete words in activation lexicon
  2. Delay of decision
83
Q

Periodicity

A

Strong weak syllables is used

84
Q

Most English words have what pattern?

A

Strong-weak pattern

85
Q

Articulation is in the _____ hemisphere.

A

Left

86
Q

Recognition occurs in _____ hemisphere.

A

Both

87
Q

Speech perception involves ____ and ____.

A

Dorsal and ventral (Duel stream model of activation)

88
Q

Envelop processing is in the _____ hemisphere.

A

Right

89
Q

Extraction of formant transitions is ______

A

bilateral

90
Q

What is the success of the modern CI dependent on?

A

Getting enough information correct to perform top down processing

91
Q

Where do the electrodes lie from a CI?

A

Scala tympani