Topic 5: Temporal Difference Learning Flashcards
1
Q
Temporal difference learning
A
Change in environment over time causes change in expectations, which causes change in behaviour.
2
Q
TDL model vs. R-W model
A
- Multiple vs. one step
- Continuous vs. discrete
- Temporal vs. per trial
- Changes in expectation of reward vs. reward
3
Q
Dopamine codes prediction errors
A
Neurons that release dopamine appear to mimic the error function from temporal difference learning.
4
Q
Reward coding
A
Dopamine neural response is proportional to R(pS)
5
Q
Delay coding
A
The greater the delay from cue onset, the lower the intensity of dopamine neurons. When reward is provided, dopamine neurons fire.