SR Flashcards by Leonard F

What is Speech Recognition?

is the interdisciplinary subfield of computational linguistics that develops methodologies and technologies that enables the recognition and translation of spoken language into text by computers

How well did you know this?

Not at all

Perfectly

ASR

automatic speech recognition

How well did you know this?

Not at all

Perfectly

STT

speech to text

How well did you know this?

Not at all

Perfectly

Speech Signal:

Amplitude/Time

How well did you know this?

Not at all

Perfectly

Fundamental problem
1 Given:
2 Wanted:
3 Search:

1: an observation (ADC ,FFT) X = x1, x2, … , xT
2: the corresponding word sequence W = w1, w2, … , wm
3: the most likely word sequence W’

How well did you know this?

Not at all

Perfectly

W’

= arg max(w) P(W|X)

How well did you know this?

Not at all

Perfectly

P( W|X )

p( X|W ) * P( W ) / p( X )

How well did you know this?

Not at all

Perfectly

P( X|W ) The acoustic modeI

how likely is it to observe X when W is spoken

How well did you know this?

Not at all

Perfectly

P( W ) The language model

how likely is it W is spoken -priori

How well did you know this?

Not at all

Perfectly

What is X ?

The Problem of Pre-Processing (Vorverarbeitung)

How well did you know this?

Not at all

Perfectly

What is p( X|W ) ?

The Problem of Acoustic Modelling (Akustische Modellierung)

How well did you know this?

Not at all

Perfectly

What is P( W ) ?

The problem of Language Modelling (Sprachmodellierung)

How well did you know this?

Not at all

Perfectly

How do we find argmax W?

The Search problem (Suche)

How well did you know this?

Not at all

Perfectly

SR Flashcards

(13 cards)