Lecture 4 Flashcards

1
Q

What is the notation with an infinite horizon decision problem where you discount the costs?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the theoretical improvement step of the discounted cost critereon?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the actual application of the improvement step?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the Policy Iteration Algorithm for Discounted MDP?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the theoretical improvement step for policy iteration for the long-run average cost critereon?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How to apply the improvement step for policy iteration for the long-run average cost critereon?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the algorithm for policy iteration for the long-run average cost critereon?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the optimality equations for the long-run average cost critereon?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly