Lecture 6 Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Describe main steps in text - speech synthesis

A
  1. Text to phoneme conversion: converting text into strings of phoneme symbols
  2. Speech Synthesis: phoneme symbols computed from text in previous stage are used to compute digital waveform. Sound wave info undergoes digital to analogue conversation, converting digital waveform into electrical current - will generate sound from loud speaker
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Compare formant speech synthesis (FSS) with concatenation speech synthesis (CSS)

A

FSS; 2 Tyes of phoneme sound source excitation are used- Voice pulses for phonation and noise for aperiodic fricative sounds.

CSS; Involves storing, concatenating (linking together), smoothing section of prerecorded voices

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Identify main problems faced by automatic recognition systems

A

Segmenting speech input into words and phonemes & Variability in the speech signal associated with coarticulation, talker differences, rate, dialect, sentence prosody, background nose level.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Describe main steps in automatic speech recognition

A
  1. A-to-D conversion converts voltage levels of the analogue signal picked up by the microphone into digital values
  2. Acoustic processing (codes digital signal and provides the spectral pattern)
  3. Phonetic features are then extracted
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Main ways speech recognition systems may be categorised

A
  1. Discrete (speaker must pause between words) vs continuous (no gaps necessary)
  2. Vocabulary size: Small (200 words)- often specific tasks, large (1,000)-, v large (30,000)- useful for dictation purposes.
  3. Speaker dependent systems- genre of the person using the software to be set., and for the software to be trained on or adapted to the users voice.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Identify two distinct clinical applications of speech synthesis and identify factors associated with the technology that need to be considered when they are clinically applied

A
  1. Reading instructions- potential to help children with reading difficulties (hearing the text whilst looking at words has a positive effect on reading ability)
  2. Communication aids- can be a means of oral communication for those who cannot
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Identify two distinct clinical applications of speech recognition, and identify factors associated with the technology that need to be considered when they are clinically applied

A
  1. Dyslexia- narratives can be produced on oral basis- cognitive resources can be applied to meaning and expressing themselves rather than concentrating on spelling and writing letters.
  2. People with physical disabilities that affect standard keyboard control
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Advantages/ Disadvantages of formant speech synthesis

A

Advantages: relatively high efficiency, computational demands are not excessive. Achieves high fast output rates.

Disadvantages: speech sounds unnatural and robotic and expensive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Advantages/ Disadvantages of concatenation speech synthesis

A

Advantages: tends to sound more natural and intelligible, shorter development time and in turn relatively less expensive

Disadvantages: expensive in terms of memory storage, limited variation in voice quality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly