Reinforcment learning Flashcards

1
Q

What does the agent do at each timestep t?

A

Take action a_t
Recieve observation o_t
Recieve reward r_t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does the environment do at each timestep t?

A

Recieve action a_t
Emit observaton o_t
Emit reward r_t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the environment state?

A

The environment state is the internal representation of the environment and usually not vissible to the agent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the agent state?

A

The agent state is the agents internal representation, the information the agent uses to make decissions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a fully observable environment?

A

Agent state = Environment state = Information state. This is a MDP.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the markov assumption?

A

s_t and a_t are independent of former states and actions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the goal of reinforcment learning?

A

Find a policy that maximises the sum of rewards.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the value function in reinforcment learning?

A

The predicted expected reward of all future actions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the idea behind DQN (Deep Q learning)

A

Use a neural network to estimate the Q(s,a) function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are some tricks for training the network in DQN?

A

1) Experience replay, store old action/state/reward tuples and sample from them for training
2) Periodic updates.
3) Clip rewards.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are some challenges of DQN?

A

1) Non- idd. data
2) Rapid policy changes
3) Unknown reward range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are two applications of reinforcment learning to imaging?

A

Anatomical landmark detection

Standard plane detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a multi scale agent?

A

A agent that can choose to change the resolution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly