Relation Extraction Flashcards
What is relation extraction?
Identifying relationships amongst entities in a piece of text
what are the three main relationship types?
affiliations, geospatial, part of
How can we approach relation extraction using a pattern based approach?
regular expressions
how can we modify the expression /* has a hub at */ to include entity constraints?
/[organisation] has a hub at [location]/
How can we reduce false negatives when using regex for relation extraction?
relax the pattern or extend the set of patterns using bootstrapping
What is bootstrapping?
We start with a set of seed patterns and seed tuples, e.g. /* has a hub at */ and (ryanair, charleron).
We search for the tuple terms in the seed tuple
Use search results to extract new patterns
Search for additional tuples using the new patterns
How can we perform relation extraction using a statistical/ml approach
- requires labelled data
a) detect the relationship
b) classify the relationship
How do we detect the relationship using a statistical/ml approach?
do two entities participate in a relationship- binary classification
How do we classify the relationship using a statistical/ml approach?
Multiclass classification
How can we combine detecting the relationship and classifying the relationship into one ml task?
multiclass classification, where one of the labels is no relationship
what types of features can we use in our ml approach?
- Named entity features: entity type, head word
- Context: words between, words before the first entity, words after the second entity (within a fixed window)
- Syntactic structure: dependency paths
What is a dependency path?
Follow dependencies to create a path
e.g. easyjet at liverpool