Reinforcement Flashcards
If the probability of choosing action i is:
p_i=(exp(ßm_i)/(sum(ßm_j))
What are the variables?
m_i is the estimated reward for action i.
ß is the choosiness or exploitation parameter.
If ß=0 then all actions are chosen with equal probability and there is little discrimination; if ß is large the action with the heighest estimated reward is chosen with probability near one, there is little exploration.
How is m_i updated?
m_i-> m_i + eta∂
where eta is the learning rate
∂=r-m_i
What is the basal ganglia and where is it located?
The basal ganglia is a collection of sub-cortical brain areas or nuceli found near the centre of the brain and connected to the cortex, thalmus and brain stem.
What three parts of the brain is the basal ganglia connected to?
Thalmus, cortex, brain stem.
What is thought to be the function of the Basal ganglia?
Important in decision making, action selection and in the regulation of some routine behaviours like eye movements.