Large Language models Flashcards

Question 1

Q

How do neural networks aim to emulate the brain’s manner of computation?

Answer

A

it uses lots of processors like neurons, talking to each other.

Question 2

Q

what do Classifier Networks do?

Answer

A

they learn to map input onto categories through training examples

Question 3

Q

how do we build a classifier?

Answer

A

we train it on many training examples

Question 4

Q

supervised learning

Answer

A

each training example is an input paired with the correct category

Question 5

Q

what are neural networks similar to?

Answer

A

a probability distribution or model

Question 6

Q

when training a network inputs cause what to happen?

Answer

A

it generates activity in all its output units. As more activity in the outputs increases the chance of getting the correct output

Question 7

Q

How could you interpret the outputs of a neural network?

Answer

A

as a probability distribution as all possible outcomes can sum to 1

Question 8

Q

How does a network for language process work?

Answer

A

by taking a sequence of words as its input/ prompt and learning to predict the next word

Question 9

Q

what is self-supervised learning?

Answer

A

a network prepends it doesn’t know the next word and then checks if its guess was right.

Question 10

Q

what does a language processing network need to predict a good distribution?

Answer

A

good word representations that can handle long prompts

Question 11

Q

How should we represent words in a neural network?

Answer

A

One-Hot Encoding
A sparse vector where each word in the vocabulary is represented by a unique vector with a single high (1) value and all other values low (0).

“cat” 10000
“dog” 10001

Question 12

Q

Contextualized Word Embeddings

Answer

A

Word representations that take the surrounding context into account, providing different embeddings for the same word in different contexts.

Question 13

Q

what are embeddings?

Answer

A

distributed word representations that are ‘stuck’ into good places in an n-dimensional space.

Question 14

Q

how are LLM’s input word sequences processed?

Answer

A

by an encoder network

Question 15

Q

how are LLM’s output word sequences processed?

Answer

A

by a decoder network

Question 16

Q

what are transformers

Answer

Study These Flashcards

A

LLMs that have learned to attend to the right input words.

Question 17

Q

what is the role of reinforcement learning in creating chatbots like GPT

Answer

Study These Flashcards

A

It is used to teach the model to respond appropriately.

Question 18

Q

How do Large Language Models Select the Next Word?

Answer

Study These Flashcards

A

Statistical probabilities learned from the training data.
Context derived from the preceding text.

Large Language models Flashcards

(18 cards)