Lecture 3 Flashcards

1
Q

How to calculate the average costs per week when dealing with a Markov decision process?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a stationary policy?

A

Every time you are in state i you choose action a

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the Markov property and Time Homogeneity?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the definition of discounted costs or discounted rewards?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When is a stationary policy optimal (in case of minimization)?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What follows for any stationary policy if the Markov property is satisfied? V.. = c(i,…)

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How follows that the minimal discounted costs are finite?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the optimality equations?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does solving the optimality equations look like? (Do not solve)

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

When is a stationary policy just as good? When is a stationary policy better?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the definition of an long term average cost per time user problem?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the definition for an optimal long term policy?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When is a policy optimal for discounted and long term costs?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly