Speech Recognition Flashcards

Question 1

Q

Segmentation is one problem listeners with a speech signal have. What is it?

Answer

A

Dividing the speech input into phonemes (units of sounds) and words

Question 2

Q

What is coarticulation?

Answer

A

Speech signal is variable because pronunciation of a phoneme depends on speaker’s pronunciation of preceding and following phonemes

Question 3

Q

What are speech signals due to?

Answer

A

Differences in speaker’s sex, dialect and speaking rate

Question 4

Q

How phonemes do speakers typically produce per second?

Answer

A

Around 10 - listeners must identify what is being said very rapidly. Non-native speakers often produce many speech errors

Question 5

Q

What is energetic masking?

Answer

A

Speech signal is hard to perceive due to distracting sounds e.g. other speakers

Question 6

Q

What are the 3 cue categories mattys et al identified in the hierarchical approach to segmentation?

Answer

A

Lexical, syntax, word knowledge -> used if listening conditions are good
Segmental -> coarticulation
Metrical prosody, word stress -> used only if other cues cannot be used

Question 7

Q

What is speaker’s variability?

Answer

A

Listeners use speaker characteristics e.g. American accent to form a speaker model. This speaker model then influence how listeners interpret speech signal

Question 8

Q

add in info from graph

Speech Recognition Flashcards

(8 cards)