Introduction to Speech Recognition Flashcards

1
Q

What probability model will we construct? How does this work

A

Hidden Markov Model.

Construct a statistical model of how signals are generated and use probability calculus in order to update of belief.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a sound wave? How do we extract useful features?

A

A measure of a change in air pressure over time. Signal processing.

  1. break signal up into a sequence of overlapping segments
  2. Use a Fourier transform to extract the dominant frequencies of the signal.
  3. Obtain a set of Mel-frequency cepstrum coefficients
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is an MFCC?

A

Mel-frequency cepstrum coefficients.
Numbers representing the contribution from different frequency bands obtained by Fourier transform.
Each segmented part of the speech signal has a vector of (13) MFCC features

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a phoneme? What do they help us to do?

A

small elementary utterances. Help us to represent longer words

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a statistical language model?

A

Disambiguate cases by identifying simple patterns in language. i.e. which word is likely to follow another.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How do we combine language models with evidence fro speech signals?

A

Probabilistic methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is training?

A

process whereby the parameters of a model are adapted to a particular problem domain

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How can sensory information be represented in useful ways?

A

Sound waves

Robust sensor data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly