Topic 2: N-gram modeling Flashcards

Question 1

Q

N-gram

Answer

A

N-gram is a N-token sequence of words. example of bi-gram, tri-gram??

Question 2

Q

N-gram model

Answer

A

language model is a prediction model. predicting words form previous N-1 words.
statistical model
Language Model

Question 3

Q

application of n-gram model

Answer

A

give with example

spelling correction
speech recognition
augmentative communication
machine translation

Question 4

Q

Simple n-grams

Answer

A

probability of the word,w given history P( w | h )

Question 5

Q

Relative frequency counts

Answer

A

example…

Question 6

Q

Corpus based estimation

Answer

A

example…

Question 7

Q

Easier Estimation

Answer

A

this utilizes chain rule of probability. example
P(its water was so transparent) 
= P(its) * 
P(water|its)*
P(so|its water was)*
P(transparent|its water was so)|...

Question 8

Q

Intuition of n-gram model

Answer

A

approximate the history by just the last few words instead of computing probability of the entire word history.

Question 9

Q

Markov assumption

Answer

A

N-gram model comes with independent assumption that probability of some future unit can be predicted without looking too far into the past.

Question 10

Q

Exercise bi-gram model with maximum likelihood estimation
I am Sam
Sam I am
I do not like green eggs and ham
Calculate bigram probabilities from this corpus.
P(I|) = 2/3 = 0.67 P(Sam|) = P(am|I) =
P(|Sam) = P(Sam|am) = P(do|I) =

Answer

A

P(I|)=2/3=0.67
P(Sam|)=1/3=0.33
P(am|I)=2/3=0.67 
P(|Sam)=1/2=0.5
P(Sam|am)=1/2=0.5
P(do|I)=1/3=0.33

Question 11

Q

Relative Frequency

Answer

A

obtained by dividing frequency of a sequence by frequency of a prefix

Question 12

Q

Bi-gram count exercise

Answer

A

refer to slides. have unigram count..have bigram table
calculated the probability by
value in bigram table divided by unigram word count

Question 13

Q

what knowledge can be captured by N-gram probabilities.

Answer

A

world knowledge
syntax
discourse

Question 14

Q

Evaluating language models. what are the 2 types of evaluation

Answer

A

extrinsic evaluation - embed language model in an application and measure how it improves, measure end to end performance, expensive
intrinsic evaluation - training and test set. measure quality of model, independent of application

Question 15

Q

Training and testing paradigm

Answer

A

evaluating different architectures
development, training and test set
split 80:20

Question 16

Q

Intuition of perplexity

Answer

Study These Flashcards

A

how to predict next word, a better model is the one predicts the word that’s actually occurs

Question 17

Q

Perplexity as evaluation metric

Answer

Study These Flashcards

A

best language model is the one best predicts an unseen test set.
Perplexity is the inverse probability of the test set, normalized by the number of word.
perplexity calculation….

Question 18

Q

Generalization

Answer

Study These Flashcards

A

the statistical model is highly dependent on the training corpus
it’s pretty useless if the training sets and test sets are from different genre..
business meetings vs movie?

Question 19

Q

challenges in language model

Answer

Study These Flashcards

A

dynamically adapt to different genres

Question 20

Q

Zeros

Answer

Study These Flashcards

A

one kind of generalization: zeros

test set have data occurrences that does not occur in the training set.

Question 21

Q

Incorrect estimation

Answer

Study These Flashcards

A

this is the problem of zeros
underestimated probability that all sorts of words will occur.
if the probability of any word in test set is 0. entire probability of test set is 0.
perplexity can’t be computed

Question 22

Q

Smoothing (Laplace)

Answer

Study These Flashcards

A

introduced to overcome zero problem
how it’s being done?
add 1 to all the word counts denominator is also adjusted with extra V observation

Question 23

Q

Laplace smoothing exercise

Answer

Study These Flashcards

A

probability with added count
bi gram table
vocab number

Topic 2: N-gram modeling Flashcards

(23 cards)