RL Concepts Flashcards by Kevin LoGuidice

What does the “Step Size” variable “alpha” do ?

Influences the rate of learning

How well did you know this?

Not at all

Perfectly

Difference between stationary vs non-stationary reward ?

Stationary rewards are constant. Non-stationary rewards vary.

How well did you know this?

Not at all

Perfectly

What does DeepMind’s AlphaTensor algorithm do ?

Speeds up matrix multiplication via Reinforcement Learning

How well did you know this?

Not at all

Perfectly

What is Bootstrapping?

Using one or more estimated values of a variable to update estimates of the same variable

How well did you know this?

Not at all

Perfectly