Week 1 - Intro to NLU and NLU tasks Flashcards

Question 1

Q

What is NLP

Answer

A

Natural language processing
Converts unstructured data into a structured (posssibly machine-readable) form

Question 2

Q

What is NLU

Answer

A

Natural language understanding
A specification of NLP
Determines the intended meaning of natural language expressions; focuses on the comprehension of human language by machines
makes agents more intelligent

Question 3

Q

What is NLG

Answer

A

Natural language generation
A specification of NLP
produces natural language expressions

Question 4

Q

What are the 4 main NLU task categories

Answer

A

Sequence classification
Pairwise sequence classification
Sequence labelling
Span-based operations

Question 5

Q

What are single-problem applications of NLU tasks

Answer

A

Focus on addressing a specific task or problem within natural language understanding

Question 6

Q

What are multi-problem applications of NLU tasks

Answer

A

Involves addressing multiple NLU tasks within a single application or system
Relate complex applications to underlying NLU tasks, by decomposing them into subtasks

Question 7

Q

What is sequence classification and what is its output

Answer

A

Takes a series of words (tokens) (eg sentence, tweet, document)
Output: classification category

Question 8

Q

What is pairwise sequence classification and what is its output

Answer

A

Classify relationship between 2 input sequencies
output: Neutral, contradicts, entails

Question 9

Q

What is sequence labelling and what is its output

Answer

A

Classification at the level of the individual tokens in the sequence
output: eg noun/verb
Can also use subsequence of tokens if the context if relevant
eg john smith = person

Question 10

Q

What is a Unit

Answer

A

A subsequence of tokens

Question 11

Q

What is the BIO scheme

Answer

A

B: beginning of a subsequence of interest

I: Inside the subsequence of interest
- Will also be used for last token in subsequence

O: outside of a subsequence of interest

Question 12

Q

What are span-based operations

Answer

A

Analysis of a span - not necessarily a full sentence nor full document

Takes a sequence, finds all possible spans

Question 13

Q

What is a span

Answer

A

A contiguous sequence of tokens

Question 14

Q

What are the total number of spans found in a sequence with max span = T

Answer

A

Total = (T(T+1)) / 2

Question 15

Q

What are the 3 subtasks of span operations

Answer

A

Identification
Classification
Relation Classification

Question 16

Q

What is Identification: span-based

Answer

A

Identifying spans of interest as binary classification
Input: sequence and question (eg find the keyphrases)
“her best groundstroke is her two-handed backhand”
output: “groundstroke” “two-handed backhand”

Question 17

Q

What is classification: span-based

Answer

A

classifying spans according to a set of labels
Eg “United Airlines” : ORG

Question 18

Q

What is an embedded entity

Answer

A

Eg ORG “United Airlines Holdings”,
ORG “United Airlines”
Here we have an ORG within an ORG

Question 19

Q

Why can span-based classification be better than sequence labeling

Answer

A

It identifies embedded entitieis

Question 20

Q

What is relation classification: span based

Answer

A

Classifying relations between spans
eg Output
EMPLOYEE-OF(”Jane Vickers”, “United Airlines Holdings”)

Question 21

Q

Underlying NLU task: Sentiment Analysis

Answer

A

Identifying overall meaning: -ve, +ve, neutral
Sequence classification

Question 22

Q

Underlying NLU task: Emotion Recognition

Answer

A

Identify the emotions in the input text (eg: sad, lonely)
Sequence classification

Question 23

Q

Underlying NLU task: Hate Speech Detection

Answer

A

Determine whether text contains hate speech
Some also aim to define the type: race/gender/etc
Sequence classification

Question 24

Q

Underlying NLU task: NLI

Answer

A

Hypothesis and Premise
true (entails) or false (contradicts) or neutral
Pairwise sequence classification

Question 25

Q

Underlying NLU task: Paraphrase Identification

Answer

A

Determine whether one is a paraphrase of another (degree of similarity)
eg plagiarism
Pairwise sequence classification

Question 26

Q

Underlying NLU task: NER

Answer

A

Identify subsequence corresponding to categories
Sequence labelling or span based classification

Question 27

Q

Underlying NLU task: Entity Linking

Answer

A

Link subsequence to its standard form in vocabulary
Pairwise sequence classification or span based classification(find span from document)

eg linking a name in wikipedia article to its own wikipedia page

Question 28

Q

Underlying NLU task: Semantic Role Labelling (SRL)

Answer

A

Identify predicate-argument structures: who did what to whom; which bit is linked grammatically (labelled as Pred (verb) or Arg (noun))
Sequence labelling or span-based classification

Question 29

Q

Underlying NLU task: Relation extraction

Answer

A

Identify relation type that holds between two spans
EG Bill was born on April 13th in Seattle
Bill, Seattle: BORN-IN relationship

Span based relation classification

Question 30

Q

Underlying NLU task: Coreference Resolution

Answer

A

Determine if the spans refer to the same real-world entity or concept
EG If “Tom” and “he” refer to the same real-world man

Span based relation classification

Question 31

Q

Multi-underlying NLU task: Aspect-based sentiment Analysis

Answer

A

Identify the target and then aspect category
Eg “Horrible services. The room was dirty and unpleasant.”
Target: Room
Aspects: price, location, comfort, cleanliness → cleanliness
span-based classification(target)
span-based classification(aspect)
span-based relation classification

Question 32

Q

Multi-underlying NLU task: Fact Verification

Answer

A

Determine whether information is supported by facts or not
involves 3 tasks:

Claim identification
determine whether piece of text is worth fact checking
sequence classification or span based classification
Evidence Retrieval
find (within a pre-existing support corpus) pieces of text which are relevant to a given claim
pairwise sequence classification
Automated Verification
determine if a piece of text contains information that is supported or refuted by provided pieces of evidence
pairwise sequence classification

Question 33

Q

Multi-underlying NLU task: Argument mining

Answer

A

Identify argumentative structures
involves 2 tasks:

Argument component identificatoin
identify claims and premises
span-based classification
Argument relation classification
classify whether a premise supports a claim, i.e., whether the relationship between them is support or oppose
span-based relation classification

Question 34

Q

Multi-underlying NLU task: Question answering (Extractive)

Answer

A

Given two pieces of text; a passage(context) and a question
To identify the span of text that answers the question

(combination) pairwise, span-based identification
(identifying whether span is of interest in relation to the question)

Question 35

Q

Multi-underlying NLU task: Event extraction

Answer

A

Given a sequence and list of named entities
To identify events, i.e., the event trigger and event participants
has 2 subtasks:

Event Trigger Detection
identify the word that denotes the event and its type
span-based classification or
sequence labelling
Event Participant Identification (which named entitiesa are actually involved in the event)
to determine the relationship that holds between a named entity and the event trigger
span-based relation classification

eg given: Tom was hit by Harry, Tom = person, Harry = person
want to identify event trigger (“hit”) that indicates conflict
and who (tom and harry, attacker and victim) is involved

Question 36

Q

What is the difference between Pairwise sequence classification and span-based relation identification

Answer

A

Pairwise sequence are usually taken from two different sources (the entire source)

span-based is a shortened sample - usually from the same source

Brainscape's Knowledge GenomeTM

Week 1 - Intro to NLU and NLU tasks Flashcards

Brainscape's Knowledge Genome^TM