Hoorcollege 10 Natural language inference Flashcards

1
Q

SICK-NL

A

dataset for dutch natural language inference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

RTE (recognizing textual entailments) / NLI

A

Recognizing textual entailment is an established task in NLP
* RTE covers strictly speaking more than logical entailment

SICK dataset -> sentences involving compositional knowledge
* SNLI is lager, like sick, description of scenes
* MultiNLI, more diverse text ‘genre’ sources

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Entailment datasets

A

At the word level entailment is hypernymy (dog -> animal) more general

Categorical vs graded entailment
* Chemistry -> science (10.0)
* Enemy -> crocodile (0.33)
Phrase level
* Parrot -> pet
* Dead parrot -/> pet
Sentence level
what a sentence entails: Entailment, contradiction, neural

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Dataset formats

A
  • Entailment
  • Non entailment: contradiction, UNKNOWN / Neural

(Hella) SWAG
* Different focus: commonsense reasoning
* Different format: multiple choice

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

NLI in language models

A
  • LLMs are fine-tuned on NLI data (don’t transfer well to new inference data)
  • Achieve good accuracy
  • textual entailment different from logical since it isn’t exact
  • issue with variation between logical contradiction and referential contradiction in datasets

a man is smoking vs a man is not smoking

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Interannotator agreement

A

expected agreement & Cohen’s kappa (for two annotators) &
Fleiss’s kappa (for more than two annotators) measures agreement between annotators

How well did you know this?
1
Not at all
2
3
4
5
Perfectly