Introduction to RL. Multiarmed bandits Flashcards

1
Q

Reinforcement Learning is both a class of ___ and a class of ___

A

problems

algorithms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Policy

A

Mapping between what the agent is seing and what the agent chooses to do

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Rewards

A

(immediate )nomerical single that provides the agent what good or bad actions are
Agent goal is to get as much reward as possible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Value Functions

A

Long term functions of reward

We need to see if the agent lives that long on the long term

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Models

A

(of the problem/environment)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

State

A

represents the relevant information to solve the task

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Actions

A

what the agent can do

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Goal

A

Draws the behaviour of the agent (rewards)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Dynamics

A

Describe how the actions of the agent influence the environment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The agent does not know the ___ and the ___

A

Goal

Dynamics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The agent showld interact with (or explore / exploit) the environment and figure out what the goal is and the dynamic is

A

How well did you know this?
1
Not at all
2
3
4
5
Perfectly