Language Models Flashcards

1
Q

Language Models

A

How likely is this word sequence?

Probabilistic models of the likelihood of a string occurring

Compute the maximum likelihood estimate
(get probabilities by estimating relative frequency from the text)

For longer sequences, the counts are too sparse, so use the chain rule and the Markov assumption to estimate the probabilities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

P (“the chair hopped away”)

A

Chain Rule:
P(“the”) * P(“chair”|”the”) * P(“hopped”|”the chair”) * P(“away” | “the chair hopped”)

Markov Assumption:
P(“away” | “the chair hopped”)
= P(“away”|”hopped”) if using bigrams
= P(“away”|”chair hopped”) if using trigrams

How well did you know this?
1
Not at all
2
3
4
5
Perfectly