[NLP] Lecture 2: Large Language Models (Anna Rogers) Flashcards

Question 1

Q

What kind of LM do we have?

Answer

A

Autoregressive Language Models:

Predict the next token based on previous tokens

Masked Language Models:

Predict masked (hidden) tokens within a sequence
Can use both left and right context to make predictions

Question 2

Q

DIfference between corpus model and language model?

Answer

A

It would hep to call it corpus model, so it is more obvious the model is based on an exact corpus, so we don’t think it is not biased, compared to calling it language model, we “forget” it is not just langauge, it is trained on a specfific corpus.

Question 3

Q

Explain difference between pre-training and fine tuning

Answer

A

Pre-training: Pre-taining is not labelled, it is trained with regressive and masking (BERT), this is the base model

Fine-tuning: To make to do something other than predicting tokens, we fine tune it for at task

Biggest difference is the size of the available data

Question 4

Q

What happens during fine tuning?

Answer

A

Final layers gets changed the most

Question 5

Q

What is pre-fine-tuning

Answer

A

An intermediate stage between pre and fine-tuning.

Question 6

Q

What is instruction tuning?

Answer

A

It is trained on 20 different text tasks before fine-tuning (the T5 model)

Question 7

Q

What is few shot learning?

Answer

A

Give examples in the prompt

Question 8

Q

What is instruction tuning and RLHF?

Answer

A

Instruction tuning focuses on teaching the model to follow instructions

RLHF uses human feedback to refine the model’s understanding of what constitutes a good response

Instruction tuning is about capability, RLHF is about aligning the model with human values and expectations

Question 9

Q

Explain basics about ChatGPT

Answer

A

Dialogue version og InstructGPT
New OpenAI in-house data (humans both writing and rating model response)
New ranking data for RLHF
Keeps changing under the haude
We dont know anything else about the models

Question 10

Q

What is RAG

Answer

A

Retrieval augmented generation, a way to find out where the model it got the information. Bing does it, it provides sources

Question 11

Q

Why do the LM get better the bigger they are?

Answer

A

As long as you add more weights and data set, they will get better “neural scaling laws”

Question 12

Q

WHat is data contamination?

Answer

A

When the model has seen something in the training data, and we test it on something similar?

Question 13

Q

What is emergent properties

Answer

A

When a model can do something, it is not trained on. It is difficult to day, because we don’t know how much data it is allowed to see, to say it has been trained on

Question 14

Q

What is the Eliza effect?

Question 15

Q

Caveat 2: Finetuning vs few-show performance

Answer

A

since GPT-3 most big models were presented with few-shot evaluations only

Question 16

Q

Caveat 3: The prompt matter

Answer

Study These Flashcards

A

Question 17

Q

Answer

Study These Flashcards

A

[NLP] Lecture 2: Large Language Models (Anna Rogers) Flashcards

(17 cards)