Lecture 4 Flashcards
1
Q
What is the notation with an infinite horizon decision problem where you discount the costs?
A

2
Q
What is the theoretical improvement step of the discounted cost critereon?
A

3
Q
What is the actual application of the improvement step?
A

4
Q
What is the Policy Iteration Algorithm for Discounted MDP?
A

5
Q
What is the theoretical improvement step for policy iteration for the long-run average cost critereon?
A

6
Q
How to apply the improvement step for policy iteration for the long-run average cost critereon?
A

7
Q
What is the algorithm for policy iteration for the long-run average cost critereon?
A

8
Q
What are the optimality equations for the long-run average cost critereon?
A
