Reward and motivation Flashcards

Question 1

Q

State-dependency of reward

Answer

A

The value of rewards depend on the state of the receiver. E.g. a bottle of water is much more rewarding if you’re stuck in the desert.

Thus, it is dependent on subjective utility

Question 2

Q

Reward

Answer

A

Stimulus that elicits approach behavior

Question 3

Q

Primary and secondary reinforcers

Answer

A

Primary reinforcers elicit approach behavior because of its implicit, unconditioned value (e.g. food)

Secondary reinforcers are rewards that are conditioned, but they don’t have intrinsic value. E.g. money is a reward that is conditioned, you can’t really use it for anything unless you spend it.

Question 4

Q

negative reinforcement

Answer

A

Reinforcement that is due to the removal of a punishment.

Question 5

Q

Reward pathways

Answer

A

Nigrostratal pathway
Mesolimbic pathway
Mesocortical pathway

These pathways are characterised by having cells that synthesize dopamine. Thereby, most of the dopamine in the brain stems from one of the two.
The nigrostriatal pathway has dopamine synthesising cells in the substantia nigra.
The Mesolimbic and mesocortical pathways have dopamine synthesising neurons in the ventral tegmental area.

See dedicated cards for further explanations.

Question 6

Q

Nigrostriatal pathway

Answer

A

The nigrostriatal pathway is a bilateral dopaminergic pathway in the brain that connects the substantia nigra pars compacta (SNc) in the midbrain with the dorsal striatum (i.e., the caudate nucleus and putamen) in the forebrain. It is one of the major dopamine pathways in the brain, and is critical in the production of movement as part of a system called the basal ganglia motor loop. Death of neurons in this pathway can lead to Parkinson’s disease.

Using single cell recordings from the substantia nigra of monkeys, Romo and Schultz (1990) showed elevated activation when presented with food reward, and no activation when presented with no reward.

Question 7

Q

Mesocortical pathway

Answer

A

Like the mesolimbic pathway, the mesocortical pathway originates in the ventral tegmental area (VTA). The mesocortical pathway connects the VTA and the prefrontal cortex.

The mesocortical pathway is essential to the normal cognitive function of the dorsolateral prefrontal cortex (part of the frontal lobe), and is thought to be involved in cognitive control, motivation, and emotional response.

Dysfunction of the pathway is hypothesised to be involved in psychosis and schizophrenia.

Question 8

Q

Mesolimbic pathway

Answer

A

The mesolimbic pathway originates in the ventral tegmental area (VTA). The VTA has been called the reward center in the brain, but prof. John is skeptical (as always).

The pathway connects the VTA in the midbrain to the ventral striatum of the basal ganglia in the forebrain. The ventral striatum includes the nucleus accumbens and the olfactory tubercle.

The release of dopamine from the mesolimbic pathway into the nucleus accumbens regulates incentive salience (e.g. motivation and desire for rewarding stimuli) and facilitates reinforcement and reward-related motor function learning.

The dysregulation of the mesolimbic pathway and its output neurons in the nucleus accumbens plays a significant role in the development and maintenance of an addiction.

Question 9

Q

What is the role of the orbitofrontal cortex in reward processing?

Answer

A

The orbitofrontal cortex (OFC) is a zone of multimodal processing. Studies have shown that pleasurable stimuli from visual, auditory, gustatory or olfactory modalities will increase activity in the OFC, compared to unpleasant stimuli.

Monkeys with lesions in the OFC will select a boring capsule more often than a delicious banana, which shows that they aren’t able to evaluate the rewarding value of stimuli.

Question 10

Q

Medial vs lateral OFC

Answer

A

Medial OFC show increased activation in response to positive rewards

Lateral OFC show increased activation in response to punishment

Question 11

Q

Classical conditioning

Answer

A

Unconditioned stimulus = e.g. food
unconditioned response = drooling
Without conditioning, dogs will drool when they see food.

Neutral stimulus = e.g. a sound

During conditioning, the neutral stimulus is paired with the unconditioned stimuli.

When conditioned, the sound will now be a conditioned stimulus activating a conditioned response. Thus, the dogs will drool based on the sound, because it is conditioned to do so.

Question 12

Q

Where does reward anticipation show activation in the brain?

Answer

A

Midbrain and striatum

Question 13

Q

How is reward expectancy evaluated in the OFC?

Answer

A

Like in V1 or M1, it seems like the reward expectancy is encoded by many different neurons which have different tunings to different probabilities of rewards. These tuned activations all contribute to the reward expectancy.

Question 14

Q

How does the response of dopamine neurons change from unpredicted reward to predicted reward?
And what happens if the reward is predicted but doesn’t show up?

Answer

A

If a reward occurs without any cues, dopamine neurons in the VTA will fire.

If a reward is preceded by a conditioned stimulus in a way where the stimulus allows the brain to predict the reward (E.g. sound precedes food), the dopamine neurons will fire when the conditioned stimulus occurs, but not when the reward is presented.

If the conditioned stimulus is shown without reward, dopamine neurons will fire when the conditioned stimulus is shown, and there will be a decrease in firing when no reward is given.

I.e. the cells aren’t coding the pleasure of an experience, rather they are coding the reward prediction error.

Question 15

Q

How do scientist measure pleasure in mice?

Answer

A

They look at their facial expressions, ofc

Question 16

Q

Delay of gratification

Answer

A

E.g. in the marshmallow test, capability of delaying gratification was predictive for a later success in life.

However, the idea of temporal discounting should be taken into account, as the value of a reward decreases with increasing delay time.

Question 17

Q

Temporal discounting

Answer

A

The value of a reward decreases as delay time increases. E.g. one marshmallow now is as good as two marshmallows in 15 mins.

Question 18

Q

What is the ‘reward curve’?

Answer

A

The reward curve is a plot showing how different porportions of reward at different delay times create different preferences for ‘sooner-smaller rewards’ and ‘larger-later rewards’.

Can be used to find the point of subjective equality, where 50% chooses sooner-smaller and 50% chooses larger-later, because the temporal discounting equalizes the value of different sized rewards.

Question 19

Q

What does the discounted value of reward show?

Answer

A

How much the temporal discounting affects the value of a reward.