Week 5 - (part 1) LLMs And Gen AI Flashcards

Question 1

Q

To train the model, what sort of data is being used?

Answer

A

Common crawl, books, wikipedia etc.

Question 2

Q

What is Generative Pretraining?

Answer

A

Based on the algorithm, the model is trying to predict the next output based on probability of co-occurrence.

Question 3

Q

Generative Language Models do sampling + classification over tokens. Do you have to classify all the words in the dictionary?

Answer

A

No. Only the sub-words. This gives us a smaller class size to predict. This estimate is done for efficiency reasons. If you have all the words, too many classes and a lot of words not used.

Question 4

Q

Why don’t we use character classification?

Answer

A

If you use characters, the class size is too small and it takes very long.

Question 5

Q

How do they decide to group the sub-words?

Answer

A

Through frequency and statistics

Question 6

Q

Instruction fine tuning and human feedback

Answer

A

Instruction fine tuning - Step 1: a labeller demonstrates the desired output behaviour by telling the machine what is the correct answer given a set of inputs

Human feedback - Step 2: A labeller ranks the outputs from best to worst

Question 7

Q

Pros and cons of feedback

Answer

A

[+] train and adapt to come up with better answers
[+] overly compliant
[-] may not be safe

Question 8

Q

Why does the model hallucinate?

Answer

A

The model is always told there is one right answer. It will predict to its best ability based on probability. Probability is baked into its weight and parameter.

Question 9

Q

How to mitigate true preferential learning

Answer

A

Ask the model to output sub word tokens equivalent to “i dont know” and ask the model some questions with no answer. You are manipulating the probabilities.

Question 10

Q

Why can’t we just use LLMs for everything and rely on probability?

Answer

A

Efficiency Low
Lack of updatability
Issue of provenance
Effective
Good at synthesising information

Question 11

Q

What are the benefits of retrieval based NLP

Answer

A

Efficient, updatable, provenance, effective, synthesis

Question 12

Q

Do auto regressive LMS simply predict the next token?

Answer

A

Yes, that’s all they do (TO a certain extent). They predict scores over the entire vocabulary at each step. We then use those scores to compel them to predict some other token or other. They also present data in their internal and output representations.

Question 13

Q

How is generative pre-training different from predictive text?

Answer

A

Generative pre training involves looking through more things to produce more useful outputs. The goal is to learn USEFUL knowledge.

Question 14

Q

Emergent abilities - what AI chatbots can do that autocorrect cannot?

Answer

A

Play chess - an internal model of chess rules and strategy must be programmed in the first place

Week 5 - (part 1) LLMs And Gen AI Flashcards

(14 cards)