Handout #11 - Large Language Models Flashcards

1
Q

What are large language models?

A

Large transformer networks, with billions of weights anbd trained on all text available on the internet.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What method do LLM’s rely on?

A

Auto-regressive models -> models that predict from the pas.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Explain tokenisation

A

This is when words are mapped into vectors.

The english language is too big -> reduce size to something small

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

GPT-3 uses ‘Reinforcement learning from human feedback’ (RLHF). Explain what this is

A
  1. Humans are asked to rank (lots) of results.
  2. Ranking are then used to score outputs.
  3. Score used to train a reward/preference model
  4. Fine-tune the original model through a reinforcement learning update.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Explain the concept of jail-breaking and prompt injection attacking in LLMs

A

This is the concept of asking harmful questions through other means -> role-playing. ‘Ignore previous instructions.’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Explain the concept of zero-shot or few-shots in LLMs

A

One can just ask LLMs to describe a task, without any example

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In terms of LLM scaling laws, what determines the performance of LLMs

A

The number of parameters in the network N and the size of the training set D.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Give the definition of Artifical General Intelligence

A

The threshold where an agent can accomplish any intellectual task that human beings or animals can perform.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly