Lecture 3 Flashcards by M d Heijer

How to calculate the average costs per week when dealing with a Markov decision process?

How well did you know this?

Not at all

Perfectly

What is a stationary policy?

Every time you are in state i you choose action a

How well did you know this?

Not at all

Perfectly

What is the Markov property and Time Homogeneity?

How well did you know this?

Not at all

Perfectly

What is the definition of discounted costs or discounted rewards?

How well did you know this?

Not at all

Perfectly

When is a stationary policy optimal (in case of minimization)?

How well did you know this?

Not at all

Perfectly

What follows for any stationary policy if the Markov property is satisfied? V.. = c(i,…)

How well did you know this?

Not at all

Perfectly

How follows that the minimal discounted costs are finite?

How well did you know this?

Not at all

Perfectly

What are the optimality equations?

How well did you know this?

Not at all

Perfectly

What does solving the optimality equations look like? (Do not solve)

How well did you know this?

Not at all

Perfectly

When is a stationary policy just as good? When is a stationary policy better?

How well did you know this?

Not at all

Perfectly

What is the definition of an long term average cost per time user problem?

How well did you know this?

Not at all

Perfectly

What is the definition for an optimal long term policy?

How well did you know this?

Not at all

Perfectly

When is a policy optimal for discounted and long term costs?

How well did you know this?

Not at all

Perfectly

Lecture 3 Flashcards

(13 cards)