L6 - AAA Flashcards

1
Q

1 / 1-gamma

A

Effective Horizon.

The time to find an optimal policy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the relationship of gamma in value iteration?

A

Gamma tells you what your horizon is.

Smaller gamma stays more in the present.

Larger gamma pushes out into the future.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What happens with a gamma close to 0?

A

Agent becomes short sighted.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

In what time does Value Iteration solve MDPs.

A

> than polynomial but it is possible to get close to the optimal policy in polynomial time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly