Introduction to Speech Recognition Flashcards
What probability model will we construct? How does this work
Hidden Markov Model.
Construct a statistical model of how signals are generated and use probability calculus in order to update of belief.
What is a sound wave? How do we extract useful features?
A measure of a change in air pressure over time. Signal processing.
- break signal up into a sequence of overlapping segments
- Use a Fourier transform to extract the dominant frequencies of the signal.
- Obtain a set of Mel-frequency cepstrum coefficients
What is an MFCC?
Mel-frequency cepstrum coefficients.
Numbers representing the contribution from different frequency bands obtained by Fourier transform.
Each segmented part of the speech signal has a vector of (13) MFCC features
What is a phoneme? What do they help us to do?
small elementary utterances. Help us to represent longer words
What is a statistical language model?
Disambiguate cases by identifying simple patterns in language. i.e. which word is likely to follow another.
How do we combine language models with evidence fro speech signals?
Probabilistic methods
What is training?
process whereby the parameters of a model are adapted to a particular problem domain
How can sensory information be represented in useful ways?
Sound waves
Robust sensor data