probabilistic approach to NLP Flashcards

Question 1

Q

what is the logical or knowledge-based approach to NLP

Answer

A

rule based

e. g regular expression and finite automata

Question 2

Q

what is probabilistic approach to NLP ?

Answer

A

use of theory of probability

e. g neural networks, kernel methods,

Question 3

Q

What is probabilistic modelling in NLP ?

Answer

A

general framework for modelling NLP

- it use random variables, random configurations, and reasoning about the probabilities of the configurations

Question 4

Q

What are independent variables ?

Answer

A

P(V1 =x1, V2 =x2) = P(V1 =x1)P(V2 =x2)

Question 5

Q

what are conditionally independence variable ?

Answer

A

P(V1 = x1 , V2 = x2 |V3 = x3 ) =
P(V1 =x1|V3 =x3)P(V2 =x2|V3 =x3)
or 
P(V1 = x1 |V2 = x2 , V3 = x3 ) = P(V1 = x1 |V3 = x3 )

Question 6

Q

What are the 4 computation task in probabilistic modelling ?

Answer

A

evaluation
simulation
inference
learning

Question 7

Q

What is the evaluation task in probabilistic modelling ?

Answer

A

calculate probability of a complete configuration

Question 8

Q

What is the simulation task in probabilistic modelling ?

Answer

A

generate random configuration
or
producing full configurations according to a given model

Question 9

Q

What is the inference task in probabilistic modelling ?

Answer

A

3 tasks
marginalization
conditioning
completion

Question 10

Q

What is the learning task in probabilistic modelling ?

Answer

A

learning parameters of a model from data.

Question 11

Q

What is marginalization in inference task ?

Answer

A

computing a marginal probability

Question 12

Q

What is conditioning in inference task ?

Answer

A

computing a conditional probability

Question 13

Q

What is completion in inference task ?

Answer

A

finding the most probable assignment of some variables

Question 14

Q

What is joint distribution model ?

Answer

A

the probability of each complete configuration

- P(V1=x1,…,Vn=xn) in the probability table.

Question 15

Q

What is fully independent model ?

Answer

A

all variable are independent.

P(V1 = x1 , …, Vn = xn ) = P(V1 = x1 ) · · · P(Vn = xn ).

Question 16

Q

Drawbacks of joint distribution model ?

Answer

Study These Flashcards

A

memory cost to store table
expensive running time
space data problem(no enough data to cover all options)

Question 17

Q

What is Bayes theory ?

naive babes theory ?

Answer

Study These Flashcards

A

P(a|b) = P(b|a) · P(a)/P(b)

P(V2, V3, . . . , Vn|V1) = P(V2|V1) · P(V3|V1) · . . . · P(Vn|V1)

P(V1, V2, V3, . . . , Vn) = P(V1) · P(V2|V1) · P(V3|V1) · . . . · P(Vn|V1)

Question 18

Q

What are the advantage of NB model ?

Answer

Study These Flashcards

A

efficiency: good running time and small memory size
sparse data problem : enough data to train
good performance: unrealistic independent assumption

Question 19

Q

what are the disadvantage of NB model?

Answer

Study These Flashcards

A

strong independence assumption

- only one output variable

Question 20

Q

What is smoothing ? and why we use it in probabilistic models ?

Answer

Study These Flashcards

A

avoid 0 probability

- modify estimated probabilities to correct errors in the dataset.

Question 21

Q

what are the smoothing techniques ?

Answer

Study These Flashcards

A

add-one smoothing (Laplace smoothing)

- Bell-Witten smoothing

Question 22

Q

What are the evaluation tasks in Hidden Markov Model(HMM) ?

Answer

Study These Flashcards

A

evaluation: use HMM assumption formula
generation: generate in the order of graphical representation.
inference: marginalization, conditioning, and completion
learning: MLE if labeled are given.

probabilistic approach to NLP Flashcards

(22 cards)