Intelligent Agents Flashcards by Love Broman

Kan MDPs ha kända states?

How well did you know this?

Not at all

Perfectly

When is it best to use Q-Learning?

When the optimal action depend on the current state and we dont beforehand know the reward of each state.

How well did you know this?

Not at all

Perfectly

Is Q-learning modelfree?

Yes

How well did you know this?

Not at all

Perfectly

What are the advantages of thompson samling over UBC?

It is extensible for contexutal bandits

How well did you know this?

Not at all

Perfectly

Why is state factorization important?

Allows us to handle combinatorial explosion of states

How well did you know this?

Not at all

Perfectly

What does mixed policies mean?

That we assign propabilities to policies rather than choose policiy entirely.

How well did you know this?

Not at all

Perfectly

What are need for a policy to be differentiable?

That it’s mixed

How well did you know this?

Not at all

Perfectly

How does policicy gradients work?

How well did you know this?

Not at all

Perfectly