Introduction Flashcards

1
Q

Power of LLM

A

Use ChatGPT API as developer tool to quickly build software applications

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Types of LLM

A

Base LLM
Instruction tuned LLM

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Base LLM

A

It predicts the next word, based on text training data from the internet and other sources e.g. if you write a sentence, it will complete it while if you write a question it will complete it by adding more questions because this is what is has learned from the internet.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Instruction Tuned LLM

A

It is trained to follow the instructions.
These LLMs are first trained on base LLMs and then further fine tuned with inputs and outputs that are instructions using a technique called Reinforcement Learning with Human Feedback (RLHF) to make the system better follow instructions that are helpful, honest and harmless.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly