Reinforcement Flashcards

1
Q

If the probability of choosing action i is:
p_i=(exp(ßm_i)/(sum(ßm_j))
What are the variables?

A

m_i is the estimated reward for action i.
ß is the choosiness or exploitation parameter.
If ß=0 then all actions are chosen with equal probability and there is little discrimination; if ß is large the action with the heighest estimated reward is chosen with probability near one, there is little exploration.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How is m_i updated?

A

m_i-> m_i + eta∂
where eta is the learning rate
∂=r-m_i

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the basal ganglia and where is it located?

A

The basal ganglia is a collection of sub-cortical brain areas or nuceli found near the centre of the brain and connected to the cortex, thalmus and brain stem.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What three parts of the brain is the basal ganglia connected to?

A

Thalmus, cortex, brain stem.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is thought to be the function of the Basal ganglia?

A

Important in decision making, action selection and in the regulation of some routine behaviours like eye movements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly