RL2 Flashcards

1
Q

Backward view TD(lamba) - pseudo

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Sarsa(lamba) - pseudo

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Gradient MC for estimating v_hat

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Semi-gradient TD(0) for estimating v_hat

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Semi-gradient n-step for estimating v_hat

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Episodic semi-gradient Sarsa for stimating q_hat

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

MC policy gradient method for estimating pi_theta

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

QAC

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

QAC with advantage function

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly