[NLP] Lecture 2: Large Language Models (Anna Rogers) Flashcards
WHat kind og LM do we have?
Auto regressive (auto finish?), masked model,
DIfference between corpus model and language model?
It would hep to call it corpus model, so it is more obvious the model is based on an exact corpus, so we don’t think it is not biased, compared to calling it language model, we “forget” it is not just langauge, it is trained on a specfific corpus.
Explain difference between pre-training and fine tuning
Pre-training: Pre-taining is not labelled, it is trained with regressive and masking (BERT), this is the base model
Fine-tuning: To make to do something other than predicting tokens, we fine tune it for at task
Biggest difference is the size of the available data
What happens during fine tuning?
Final layers gets changed the most
Can we fine-tune on many tasks at once?
Yes, “pre-fine-tuning”
What is instruction tuning?
It is trained on 20 different text tasks before fine-tuning (the T5 model)
WHat is few shot learning?
Give examples in the prompt
What is instruction tuning and RLHF?
Fine tuning: you take the same data as ect sentiment data, but you form it as an instruction
Explain basics about ChatGPT
- Dialogue version og InstructGPT
- New OpenAI in-house data (humans both writing and rating model response)
- New ranking data for RLHF
- Keeps changing under the haude
- We dont know anything else about the models
What is RAG
Retrieval augmented generation, a way to find out where the model it got the information. Bing does it, it provides sources
Why do the LM get better the bigger they are?
As long as you add more weights and data set, they will get better “neural scaling laws”
WHat is data contamination?
When the model has seen something in the training data, and we test it on something similar?
What is emergent properties
When a model can do something, it is not trained on. It is difficult to day, because we don’t know how much data it is allowed to see, to say it has been trained on
Can we say that ChatGPT has emergent properties with the way it plays chess`
What is the Eliza effect?