Lecture 6 Flashcards
1
Q
What is the State-action Value Function?
A
2
Q
What is the optimal Q function, What is the optimal V function?
A
3
Q
What is Q-learning?
A
4
Q
What does ρ mean, when looking at Q-learning?
A
5
Q
What is the Q-learning Algorithm?
A
6
Q
When is Q-learning typically used? What is the stopping critereon?
A