Part-of-speech Tagging and Chunking Flashcards
Part of speech tagging
Map words to part of speech tags
Part of speech tagging approaches
- Default tagger
- Morphology tagger
- Lookup tagger
- N-gram tagger
Default tagger
Set everything to a noun
Good baseline
Morphology tagger
Regular expression tagger
Lookup tagger
Store a dictionary of the most hundred frequent words and their most frequent tags
N-gram tagger
Look at sequences of words and tags
Deals with ambiguity and words that have more than one part-of-speech tag
Allows > 90% accuracy
Hidden Markov Models, description
For the given sequence of words, what is the most likely sequence of tags?
Probabilistic approach
Want
t1,…,tn = argmax P( t1,…,tn | w1, …, wn)
Hidden Markov Models, formula
For each tag sequence, maximize
the product of P (Ti | Ti-1) * P(Wi | Ti)
Probability of tag given previous tag * probability of word given tag
Hidden Markov Models, two tricks
- Bayes’ Rule
P( t1,…,tn | w1, …, wn)
= P( w1,…,wn | t1, …, tn) * P(t1,…tn) / P(w1,…wn)
- Markov Assumption
Each word depends on its own tag
P( w1,…,wn | t1, …, tn)
= P(w1 | t1) * … * P(wn | tn)
Look back two tags
P(t1,…tn) = P(t1) * P(t2 | t1) * P (t3 | t2)
Ignore the probability of the words
Dynamic Programming
Store intermediate results for shared subsequences rather than recomputing
Chunking
Finding sequences of part-of-speech tags, such as noun phrases