Lecture 5 Flashcards
1
Q
What is the definition of the minimal expected total discounted cost for a finite horizon?
A
2
Q
What is the algorithm of value iteration for the discounted critereon?
A
3
Q
What is a stopping critereon for value iteration for the discounted critereon?
A
4
Q
How to do value iteration for the long-run average critereon?
A