N-gram Modeling Flashcards

Question 1

Q

Probability of a Sentence

Answer

A

Multiplying together the probability of each word occurring after the previous word AND its context.

Question 2

Q

Count-based Models

Answer

A

To get probability of word, using corpus, count up frequency of word occurrence (can include given previous word) and divide by total number of words.

Question 3

Q

Count-based Model Formula

Answer

A

P(wi∣wi−n+1 ,…,wi−1)=
C(wi−n+1,…,wi−1,wi)/ C(wi−n+1,…,wi−1)

Question 4

Q

Probability of a Sentence Formula

Answer

A

P(S)=P(w1)⋅P(w2∣w1)⋅P(w3∣w2)⋅…⋅P(wn∣wn−1)

Question 5

Q

Perplexity

Answer

A

Inverse probability of the sentence normalized by the number of words in the sentence.
HIGHER means less confident, found more possible words
LOWER means more confident, found less possible words

Question 6

Q

Ways to Evaluate Models

Answer

A

Log-likelihood
Per-word Log Likelihood
Per-word (cross) Entropy *
Perplexity ***

Question 7

Q

Ways to Sample Text with LMs

Answer

A

Greedy decoding
Beam search
Nucleus sampling
Top-k sampling

Question 8

Q

Greedy Decoding

Answer

A

Choose next word with highest probability of occurring after previous word.
Problem: Most probable next word doesn’t lead to most probable sentence.

Question 9

Q

Beam Search

Answer

A

Maintain several paths for most probably sentences. 1. Choose two most probable words
2. Calculate total probability of sentence
3. Choose two most probable words for each sentence
4. Eliminate 2 least likely sentences
5. Repeat
Problem: too many similar sequences, too close to optimal

Question 10

Q

Nucleus Sampling

Answer

A

Take top p% of distribution, sample within that

Question 11

Q

Top-k Sampling

Answer

A

Take top k most likely words, sample from those

N-gram Modeling Flashcards

(11 cards)