Week 7 - Span Extraction Flashcards

1
Q

What can be difficult about span extraction

A

Not clear which collection of classes apply in large amounts of docs
So we need to identify which tokens in the text are relevant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the 5 ways to identify relevant tokens in a text

A
  • Keyword extraction
  • Relation extraction
  • Open information Extraction
  • Machine Reading Comprehension
  • Question Answering
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the definition of span extraction

A

extracting 0,…,n contiguous spans from a piece of text

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are Keywords

A

contiguous spans that represent and summarise the essential content of the document

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Relation extraction

A

Relation classification without entity mention
Extraction of entities that are related by a fixed set of relations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is keyword extraction

A

open-class document categorisation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is open information extraction

A

relation extraction without relations
domain-independent

Allows us to convert information that is conveyed in textual, unstructured form into a machine readable format

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is machine reading comprehension

A

query-conditioned information extraction
finding a span in a passage that best answers a question referring to the passage

often employed as an endpoint of Question Answering systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Relation extraction v open information extraction

A

fixed vs open vocabulary of relations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

issues with open IE

A

has to make syntactical assumptions on data which can result in not ideal relations extracted (some being missed)

should perform some sort of normalisation to achieve a fixed set of relations “munch on” “chew” “eat” = same relation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is Aggregation

A

gather and summarize information from many sources, providing a comprehensive view of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

open IE: precision vs recall

A

to increase precision -> need stricter rules on relations which results in more false positives (higher recall) and vice versa
this is a tradeoff

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Question answering pipeline

A

User query “what is…?”
Evidence retrieval makes query to wikipedia/search engines/etc
Question and retrieved documents are send to MRC model
MRC model extracts answer from evidence and sends it back

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are pattern based relation extractions

A

can define high precision/low recall patterns to extract relations of interest
these patterns can rely on syntax/semantics/additional knowledge

eg X/NP, (RB) found in Y/NP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do we get hold of RE patterns

A
  • linguistic knowledge (human)
  • bootstrapping
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the three steps of relation extraction bootstrapping

A

1) For known triples, search corpus to find other ways the triple is described -> find new expressions “Eartch contains Uranium”/”Uraniam found in the Earth”

2) extract patterns and select the top-K patterns that occur as “New”

3) apply “New” patterns to same corpus to extract new triples (these new patterns represent previously unknown relationships)

17
Q

What is an RE triple

A

subject
predicate: type of relationship “lies on” “eats”
object

18
Q

Bootstrapping - why only the top K patterns

A

Dont want to drift away from original relationships
new found expression can be too general

19
Q

What is open IE expected output

A

set of extracted relations

20
Q

Open IE motivation

A

applicable to Diverse Data
cannot resort to a specific domain
scalable to large data so cannot take ages for a single extraction

21
Q

What is the ReVerb algorithm

A

open IE solution
Uses POS tagging and finds verbs
extracts relationships form surrounding verb phrases
filters out low confidence relations
scalable, flexible

22
Q

what is OLLIE

A

Open IE bootstrapping solution
similar to normal bootstrapping
takes high confidence extractions from ReVerb
maps to large corpus (contains extraction words) and generates patterns
(paths in dependency parse)
apply patterns to corpus to extract new triples

23
Q

Open IE as sequence labelling

A

a solution to open ie challenges
for each verb, expand the predicate
for each word: label as argument or non participating

24
Q

How does MRC transformer work at a high level

A

Question and passage passed to encoder
forms a probability distribution to predict a token being at beginning/end of a span

25
Q

How do get contextualised embeddings for MRC

A

As with many tasks, contextual embeddings from pre trained models are expressive enough to be directly plugged into this classifier (eg from BERT)

26
Q

What is MRC transformer architecture

A

(almost identical to text classification)
contextualised embeddings of questions and passage are concatenated together
passed as input to LM
Multiply output of final layer with two vectors to obtain probability distributions over tokens (denoting start/end)
minimise the CE loss between predicted distribution and ground truth start and end positions

27
Q

MRC transformers pre-processing

A
  • discards spans that go over special tokens (eg over [SEP]
  • discard spans where Pe (end ) comes before Ps(start)
  • cut out spans that are too long
  • rank based on probabilities
  • pick the best one
28
Q

MRC for longer documents

A

similar to text classification
split in n chunks of size m and stride s
question is prepended to each of the chunks
process chunks independently (no contextualization layer)

29
Q

what is MRC transformer post-processing

A

for longer documents, needs to take into account all chunks per question
rank them and pick best probability from candidates over all of the chunks

30
Q

what does open ie allow us to perform

A

allows us to perform search, linking, aggregation