Week 1 - Intro to NLU and NLU tasks Flashcards
What is NLP
Natural language processing
Converts unstructured data into a structured (posssibly machine-readable) form
What is NLU
Natural language understanding
A specification of NLP
Determines the intended meaning of natural language expressions; focuses on the comprehension of human language by machines
makes agents more intelligent
What is NLG
Natural language generation
A specification of NLP
produces natural language expressions
What are the 4 main NLU task categories
- Sequence classification
- Pairwise sequence classification
- Sequence labelling
- Span-based operations
What are single-problem applications of NLU tasks
Focus on addressing a specific task or problem within natural language understanding
What are multi-problem applications of NLU tasks
Involves addressing multiple NLU tasks within a single application or system
Relate complex applications to underlying NLU tasks, by decomposing them into subtasks
What is sequence classification and what is its output
Takes a series of words (tokens) (eg sentence, tweet, document)
Output: classification category
What is pairwise sequence classification and what is its output
Classify relationship between 2 input sequencies
output: Neutral, contradicts, entails
What is sequence labelling and what is its output
Classification at the level of the individual tokens in the sequence
output: eg noun/verb
Can also use subsequence of tokens if the context if relevant
eg john smith = person
What is a Unit
A subsequence of tokens
What is the BIO scheme
B: beginning of a subsequence of interest
I: Inside the subsequence of interest
- Will also be used for last token in subsequence
O: outside of a subsequence of interest
What are span-based operations
Analysis of a span - not necessarily a full sentence nor full document
Takes a sequence, finds all possible spans
What is a span
A contiguous sequence of tokens
What are the total number of spans found in a sequence with max span = T
Total = (T(T+1)) / 2
What are the 3 subtasks of span operations
Identification
Classification
Relation Classification