Reinforcement Learning Flashcards

1
Q

Base components of every RL framework

A

Action, environment(state), reward

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a discount rate?

A

The discount factor essentially determines how much the reinforcement learning agents cares about rewards in the distant future relative to those in the immediate future.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a policy?

A

A factory mapping states to actions, Often denoted using PI

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a greedy policy?

A

A policy where the agent always chooses the best expected return

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a discrete space?

A

Discrete spaces has finite states and finite actions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a continuous space?

A

Continuous spaces can have a wide range of numbers and most physical space actions are continuous by nature

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When do you want to use MDPs like Monte carlo/TD-learning?

A

For finite spaces where actions are limited

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When do you want to use Deep reinforcement learning?

A

For continuous space tasks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly