Part-of-speech Tagging and Chunking Flashcards

1
Q

Part of speech tagging

A

Map words to part of speech tags

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Part of speech tagging approaches

A
  • Default tagger
  • Morphology tagger
  • Lookup tagger
  • N-gram tagger
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Default tagger

A

Set everything to a noun

Good baseline

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Morphology tagger

A

Regular expression tagger

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Lookup tagger

A

Store a dictionary of the most hundred frequent words and their most frequent tags

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

N-gram tagger

A

Look at sequences of words and tags

Deals with ambiguity and words that have more than one part-of-speech tag

Allows > 90% accuracy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Hidden Markov Models, description

A

For the given sequence of words, what is the most likely sequence of tags?

Probabilistic approach

Want
t1,…,tn = argmax P( t1,…,tn | w1, …, wn)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Hidden Markov Models, formula

A

For each tag sequence, maximize
the product of P (Ti | Ti-1) * P(Wi | Ti)

Probability of tag given previous tag * probability of word given tag

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Hidden Markov Models, two tricks

A
  • Bayes’ Rule

P( t1,…,tn | w1, …, wn)
= P( w1,…,wn | t1, …, tn) * P(t1,…tn) / P(w1,…wn)

  • Markov Assumption

Each word depends on its own tag
P( w1,…,wn | t1, …, tn)
= P(w1 | t1) * … * P(wn | tn)

Look back two tags
P(t1,…tn) = P(t1) * P(t2 | t1) * P (t3 | t2)

Ignore the probability of the words

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Dynamic Programming

A

Store intermediate results for shared subsequences rather than recomputing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Chunking

A

Finding sequences of part-of-speech tags, such as noun phrases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly