Introduction to Speech Recognition Flashcards

Question 1

Q

What probability model will we construct? How does this work

Answer

A

Hidden Markov Model.

Construct a statistical model of how signals are generated and use probability calculus in order to update of belief.

Question 2

Q

What is a sound wave? How do we extract useful features?

Answer

A

A measure of a change in air pressure over time. Signal processing.

break signal up into a sequence of overlapping segments
Use a Fourier transform to extract the dominant frequencies of the signal.
Obtain a set of Mel-frequency cepstrum coefficients

Question 3

Q

What is an MFCC?

Answer

A

Mel-frequency cepstrum coefficients.
Numbers representing the contribution from different frequency bands obtained by Fourier transform.
Each segmented part of the speech signal has a vector of (13) MFCC features

Question 4

Q

What is a phoneme? What do they help us to do?

Answer

A

small elementary utterances. Help us to represent longer words

Question 5

Q

What is a statistical language model?

Answer

A

Disambiguate cases by identifying simple patterns in language. i.e. which word is likely to follow another.

Question 6

Q

How do we combine language models with evidence fro speech signals?

Answer

A

Probabilistic methods

Question 7

Q

What is training?

Answer

A

process whereby the parameters of a model are adapted to a particular problem domain

Question 8

Q

How can sensory information be represented in useful ways?

Answer

A

Sound waves

Robust sensor data

Introduction to Speech Recognition Flashcards

(8 cards)