Named Entity Recognition Flashcards
what is a named entity
a real world object that can be named, e.g. person, location, time, money, organisation
How long is an entity mention
can be either a single token or a span of text
What are the possible approaches to named entity recognition
- dictionary lookup
- rule based
- machine learning
How can we treat named entity recognition as a ml problem
treat it as a tagging problem
use BIO for entity mention and then find the category
How many classes do we consider with n entity types
2n + 1
What is the local approach to NER
tags are independent of each other
What methods are local approaches to NER
rnn, lstm, bilstm
What does a global approach to NER mean
tags are dependent on each other
What methods are global approaches to NER
hmm (hidden markov model)
crf (conditional random fields)
How does a HMM perform NER
establish sequence by arranging output variables in a chain
sequence input x, sequence of states y
yt depends only on yt-1
xt depends only on yt
Give the equation for sequences P(y,x) using a HMM
p(y,x) = multiply for each t p(yt-1|yt)p(xt|yt)
What is a conditional random field crf?
a discriminative model for sequence labelling
finds the most probable sequence y’ given observation sequence x
What is the equation for y* using a crf model
y* = argmax p(y|x)
what is p(y|x) given a linear chain crf
1/normalisation factor *
exp( sum for t sum for feature f weight * feature function(yt, yt-1, xt)
What is a feature function?
characterises the input (based on certain features)