Lecture 6 Flashcards

1
Q

What is Natural Language Generation (NLG)?

A

NLG is the process of transforming structured data into human-readable text, also known as Data-to-Text.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Name three applications of NLG.

A

Structured Report Generation (e.g., BabyTalk project), Weather Reporting, and Question Answering Systems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the main types of NLG systems?

A

Classical (rule-based), Template-Based, Statistical/Neural (data-driven), and Hybrid Systems.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a Template-Based NLG System?

A

A system that uses fixed text templates with slots for variable content, often used in predictable domains.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Describe the advantage and disadvantage of Rule-Based NLG Systems.

A

Advantage: Produces highly accurate text for specific domains. Disadvantage: Requires extensive manual effort to define rules.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the key components of a Classical NLG System?

A

Content determination, discourse structuring, aggregation, referring expression generation, lexical choice, realization, and fluency ranking.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Content Determination in NLG?

A

The process of deciding what information to include in the generated text.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Discourse Structuring in NLG?

A

Organizing the information into a coherent and logical flow within the generated text.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Name three Decoding Strategies in NLG.

A

Greedy Sampling, Beam Search, and Top-K Sampling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Top-K Sampling?

A

A decoding method where only the top
𝑘
k most probable words are considered for generation, adding diversity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How does Temperature Sampling work in NLG?

A

Adjusts the probability distribution with a temperature parameter to control creativity and focus; lower values make generation more focused.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What types of data are commonly used to train LLMs?

A

Web text from Common Crawl, Colossal Clean Crawled Corpus (C4), Wikipedia, news sites, and patents.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why is Data Quality important in NLG?

A

Low-quality data can introduce biases, toxicity, and unsafe content in the generated text.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is Prompting in the context of LLMs?

A

Using an input prompt to guide a language model to generate relevant text, sometimes known as in-context learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the difference between Zero-Shot and Few-Shot Prompting?

A

Zero-shot prompting includes no examples, while few-shot prompting includes labeled examples to improve model performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is Instruction Tuning?

A

A finetuning phase where the model is trained on tasks framed as instructions, enhancing its ability to handle new tasks.

17
Q

What are Intrinsic Evaluation Metrics?

A

Metrics that measure model quality directly, focusing on fluency, coherence, and grammaticality of generated text.

18
Q

What is BLEU?

A

A reference-based metric used to evaluate text similarity, especially in machine translation.

19
Q

What does BERTScore evaluate in generated text?

A

It considers semantics to evaluate paraphrasing, capturing distant dependencies and word order.

20
Q

When is MAUVE used in evaluation?

A

For open-ended tasks, MAUVE measures how similar the distributions of human-written and model-generated texts are.

21
Q

What is the purpose of Human Evaluation in NLG?

A

To assess generated text based on fluency, relevance, coherence, diversity, and usefulness in task-specific contexts.