Handout #11 - Large Language Models Flashcards
What are large language models?
Large transformer networks, with billions of weights anbd trained on all text available on the internet.
What method do LLM’s rely on?
Auto-regressive models -> models that predict from the pas.
Explain tokenisation
This is when words are mapped into vectors.
The english language is too big -> reduce size to something small
GPT-3 uses ‘Reinforcement learning from human feedback’ (RLHF). Explain what this is
- Humans are asked to rank (lots) of results.
- Ranking are then used to score outputs.
- Score used to train a reward/preference model
- Fine-tune the original model through a reinforcement learning update.
Explain the concept of jail-breaking and prompt injection attacking in LLMs
This is the concept of asking harmful questions through other means -> role-playing. ‘Ignore previous instructions.’
Explain the concept of zero-shot or few-shots in LLMs
One can just ask LLMs to describe a task, without any example
In terms of LLM scaling laws, what determines the performance of LLMs
The number of parameters in the network N and the size of the training set D.
Give the definition of Artifical General Intelligence
The threshold where an agent can accomplish any intellectual task that human beings or animals can perform.