Part-of-speech Tagging and Chunking Flashcards

Question 1

Q

Part of speech tagging

Answer

A

Map words to part of speech tags

Question 2

Q

Part of speech tagging approaches

Answer

A

Default tagger
Morphology tagger
Lookup tagger
N-gram tagger

Question 3

Q

Default tagger

Answer

A

Set everything to a noun

Good baseline

Question 4

Q

Morphology tagger

Answer

A

Regular expression tagger

Question 5

Q

Lookup tagger

Answer

A

Store a dictionary of the most hundred frequent words and their most frequent tags

Question 6

Q

N-gram tagger

Answer

A

Look at sequences of words and tags

Deals with ambiguity and words that have more than one part-of-speech tag

Allows > 90% accuracy

Question 7

Q

Hidden Markov Models, description

Answer

A

For the given sequence of words, what is the most likely sequence of tags?

Probabilistic approach

Want
t1,…,tn = argmax P( t1,…,tn | w1, …, wn)

Question 8

Q

Hidden Markov Models, formula

Answer

A

For each tag sequence, maximize
the product of P (Ti | Ti-1) * P(Wi | Ti)

Probability of tag given previous tag * probability of word given tag

Question 9

Q

Hidden Markov Models, two tricks

Answer

A

Bayes’ Rule

P( t1,…,tn | w1, …, wn)
= P( w1,…,wn | t1, …, tn) * P(t1,…tn) / P(w1,…wn)

Markov Assumption

Each word depends on its own tag
P( w1,…,wn | t1, …, tn)
= P(w1 | t1) * … * P(wn | tn)

Look back two tags
P(t1,…tn) = P(t1) * P(t2 | t1) * P (t3 | t2)

Ignore the probability of the words

Question 10

Q

Dynamic Programming

Answer

A

Store intermediate results for shared subsequences rather than recomputing

Question 11

Q

Chunking

Answer

A

Finding sequences of part-of-speech tags, such as noun phrases

Part-of-speech Tagging and Chunking Flashcards

(11 cards)