RL Concepts Flashcards

1
Q

What does the “Step Size” variable “alpha” do ?

A

Influences the rate of learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Difference between stationary vs non-stationary reward ?

A

Stationary rewards are constant. Non-stationary rewards vary.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does DeepMind’s AlphaTensor algorithm do ?

A

Speeds up matrix multiplication via Reinforcement Learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Bootstrapping?

A

Using one or more estimated values of a variable to update estimates of the same variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly