Andrej Karpathy Flashcards

1
Q

What system does GPT use to tokenize?

A

tiktoken

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How many tokens does GPT have in its vocabulary?

A

50,000 (approx)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the length of text that is sampled called?

A

Block Size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What’s the simplest neural network for text patterns and generation?

A

Bigram Language Model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

The method of back-propagation for soft-max outputs is…

A

Cross-entropy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly