Reinforcement Learning Flashcards

Question 1

Q

Base components of every RL framework

Answer

A

Action, environment(state), reward

Question 2

Q

What is a discount rate?

Answer

A

The discount factor essentially determines how much the reinforcement learning agents cares about rewards in the distant future relative to those in the immediate future.

Question 3

Q

What is a policy?

Answer

A

A factory mapping states to actions, Often denoted using PI

Question 4

Q

What is a greedy policy?

Answer

A

A policy where the agent always chooses the best expected return

Question 5

Q

What is a discrete space?

Answer

A

Discrete spaces has finite states and finite actions

Question 6

Q

What is a continuous space?

Answer

A

Continuous spaces can have a wide range of numbers and most physical space actions are continuous by nature

Question 7

Q

When do you want to use MDPs like Monte carlo/TD-learning?

Answer

A

For finite spaces where actions are limited

Question 8

Q

When do you want to use Deep reinforcement learning?

Answer

A

For continuous space tasks

Reinforcement Learning Flashcards

(8 cards)