Introduction Flashcards

Question 1

Q

Power of LLM

Answer

A

Use ChatGPT API as developer tool to quickly build software applications

Question 2

Q

Types of LLM

Answer

A

Base LLM
Instruction tuned LLM

Question 3

Q

Base LLM

Answer

A

It predicts the next word, based on text training data from the internet and other sources e.g. if you write a sentence, it will complete it while if you write a question it will complete it by adding more questions because this is what is has learned from the internet.

Question 4

Q

Instruction Tuned LLM

Answer

A

It is trained to follow the instructions.
These LLMs are first trained on base LLMs and then further fine tuned with inputs and outputs that are instructions using a technique called Reinforcement Learning with Human Feedback (RLHF) to make the system better follow instructions that are helpful, honest and harmless.

Introduction Flashcards

(4 cards)