Lecture 6 Flashcards
1
Q
What is the State-action Value Function?
A

2
Q
What is the optimal Q function, What is the optimal V function?
A

3
Q
What is Q-learning?
A

4
Q
What does ρ mean, when looking at Q-learning?
A

5
Q
What is the Q-learning Algorithm?
A

6
Q
When is Q-learning typically used? What is the stopping critereon?
A
