RL Concepts Flashcards
1
Q
What does the “Step Size” variable “alpha” do ?
A
Influences the rate of learning
2
Q
Difference between stationary vs non-stationary reward ?
A
Stationary rewards are constant. Non-stationary rewards vary.
3
Q
What does DeepMind’s AlphaTensor algorithm do ?
A
Speeds up matrix multiplication via Reinforcement Learning
4
Q
What is Bootstrapping?
A
Using one or more estimated values of a variable to update estimates of the same variable