Information Extraction Flashcards

1
Q

Information Extraction

A

Turn unstructured information into structured data, in order to make the information more accessible to machines and humans

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Named entity recognition

Definition and Two Approaches

A

Labeling of categories such as
people, organizations, locations

Handled as a supervised learning task

A sequence labeling task much like part-of-speech tagging (using Hidden Markov Models) to describe connections between tags and words

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Relation extraction

A

Extraction of relations between entities

Use textual patterns (as with hypernymy)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Semantic drift

A

Compute a confidence score for each pattern, based on:

  • The number of already known tuples it finds (hits & misses)
  • Productivity: the overall number of tuples that the pattern produces
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What information is used by named entity extraction?

A
  • orthographical shape (capitalized?)
  • predictive tokens (Mrs.)
  • bags of words
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Bootstrapping in relation extraction

A
Use bootstrapping
- Start with seed tuples
- Acquire patterns from a corpus
OR
- Start with seed patterns
- Acquire tuples
How well did you know this?
1
Not at all
2
3
4
5
Perfectly