Credit Assignment Flashcards

1
Q

What is the credit assignment problem?

A

The credit assignment problem deals with how to assign credit for an outcome over a sequence of steps that leads to the outcome with maximal reward (optimal policy).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Credit assignment equation

A

Vi = p1x(V1) + p2x(V*2)…..

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does the optimal policy in credit assignment need to account for?

A

The optimal policy (Vi) needs to take into consideration current state, the value of state for each step taken (Vn) as well as the probability (p) of the state transition.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How will an algorithm determine the optimal policy?

A

The algorithm will take decisions that exploit to estimate the steps with greatest reward and explore to identify the values of different paths in order to find the optimal policy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the temporal credit assignment problem?

A

The temporal credit assignment problem takes into consideration that the value of the reward diminishes with increasing time through the discount factor (gamma). Gamma tends to 1 if the value of the reward minimally diminishes over time and tends to 0 if the value of the reward greatly diminishes over time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the equation for the temporal credit assignment problem?

A

V(t) = r(t) + yV(t+1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Outline the processes involved in perceptual decision making

A

a decision variable evolves until a stopping mechanism commits the process to a particular choice (Schall 2001)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What first evidence suggested that the LIP was involved in decision making?

A

studies demonstrating that neuronal firing in the LIP could predict the direction, timing and magnitude of saccadic eye movement in monkeys viewing random-dot motion for reward (Shadlen and Newsome, 2001)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What variation to initial LIP studies suggested that LIP encodes temporal properties?

A

When the visual stimulus was presented only briefly, LIP activity persisted for up to 800ms until a decision was made which corresponded to the saccade (Huk and Shadlen, 2005), indicating that the LIP is involved in time integration of the response.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What have the spike rates of LIP neurones been interpreted as?

A

The spike rates of single LIP neurons have been interpreted as direct neural correlates of an evolving decision variable (Gold & Shadlen 2007).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What regions besides the LIP have been implicated in decision making?

A

Meanwhile, neuronal firing in the middle temporal visual area (MT) have been implicated in representing the motion stimulus (Britten et al., 1993).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Outline a model for decision making by LIP

A

Together, these properties have given rise to a model where LIP neurons either integrate, or reflect the integration of, motion evidence from area MT in favour of a decision.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What study opposes LIP as the decision maker?

A

However, that silencing the LIP does not impact decision making (Katz et al.,2016) suggests that direction-related signals in the LIP may be a result of feedback or extensive training.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly