probabilistic approach to NLP Flashcards

1
Q

what is the logical or knowledge-based approach to NLP

A
  • rule based

e. g regular expression and finite automata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is probabilistic approach to NLP ?

A
  • use of theory of probability

e. g neural networks, kernel methods,

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is probabilistic modelling in NLP ?

A
  • general framework for modelling NLP

- it use random variables, random configurations, and reasoning about the probabilities of the configurations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are independent variables ?

A

P(V1 =x1, V2 =x2) = P(V1 =x1)P(V2 =x2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are conditionally independence variable ?

A
P(V1 = x1 , V2 = x2 |V3 = x3 ) =
P(V1 =x1|V3 =x3)P(V2 =x2|V3 =x3)
or 
P(V1 = x1 |V2 = x2 , V3 = x3 ) = P(V1 = x1 |V3 = x3 )
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the 4 computation task in probabilistic modelling ?

A
  1. evaluation
  2. simulation
  3. inference
  4. learning
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the evaluation task in probabilistic modelling ?

A
  • calculate probability of a complete configuration
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the simulation task in probabilistic modelling ?

A
  • generate random configuration
    or
  • producing full configurations according to a given model
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the inference task in probabilistic modelling ?

A
  • 3 tasks
  • marginalization
  • conditioning
  • completion
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the learning task in probabilistic modelling ?

A
  • learning parameters of a model from data.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is marginalization in inference task ?

A
  • computing a marginal probability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is conditioning in inference task ?

A
  • computing a conditional probability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is completion in inference task ?

A
  • finding the most probable assignment of some variables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is joint distribution model ?

A
  • the probability of each complete configuration

- P(V1=x1,…,Vn=xn) in the probability table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is fully independent model ?

A
  • all variable are independent.

P(V1 = x1 , …, Vn = xn ) = P(V1 = x1 ) · · · P(Vn = xn ).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Drawbacks of joint distribution model ?

A
  • memory cost to store table
  • expensive running time
  • space data problem(no enough data to cover all options)
17
Q

What is Bayes theory ?

naive babes theory ?

A

P(a|b) = P(b|a) · P(a)/P(b)

P(V2, V3, . . . , Vn|V1) = P(V2|V1) · P(V3|V1) · . . . · P(Vn|V1)

P(V1, V2, V3, . . . , Vn) = P(V1) · P(V2|V1) · P(V3|V1) · . . . · P(Vn|V1)

18
Q

What are the advantage of NB model ?

A

efficiency: good running time and small memory size
sparse data problem : enough data to train
good performance: unrealistic independent assumption

19
Q

what are the disadvantage of NB model?

A
  • strong independence assumption

- only one output variable

20
Q

What is smoothing ? and why we use it in probabilistic models ?

A
  • avoid 0 probability

- modify estimated probabilities to correct errors in the dataset.

21
Q

what are the smoothing techniques ?

A
  • add-one smoothing (Laplace smoothing)

- Bell-Witten smoothing

22
Q

What are the evaluation tasks in Hidden Markov Model(HMM) ?

A
  1. evaluation: use HMM assumption formula
  2. generation: generate in the order of graphical representation.
  3. inference: marginalization, conditioning, and completion
  4. learning: MLE if labeled are given.