Instrumental Conditioning II Flashcards

1
Q

Whats continuous reinforcement?

A
  • Reinforcement provided every single time after the desired behaviour is performed
  • Association is easy to make - learning occurs quickly
  • Extinction also occurs quickly after reinforcement is no longer provided
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Whats partial reinforcement schedules?

A

Only reinforce the desired behaviour occasionally rather than all the time

  • slower learning = more difficult to make the association between behaviour and reinforcement
  • produces behaviour that’s more resistant to extinction
  • particpants tempted to persist in their behvaiour in hopes that they will eventually be rewarded
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Whats the to criteria we adjust to for reinforcement schedules?

A

Interval - we present an outcome following a response at fixed times, for example every minute

Ratio - we present an outcome whenever the animal accumulates certain number of responses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What can both interval and ratio be?

A

Fixed or variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Whats fixed interval schedules?

A

The animal gets the outcome for responding after a fixed period of time since the last outcome was presented

  • animal learns to time the interval, so responding isn’t uniform over the minute long interval
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Whats variable interval schedules?

A

The animal gets the outcome for responding after a non-fixed period of time

  • animal can’t learn to time the interval between the outcomes
    = over testing, we see uniform responding regardless of the length of the interval between outcome delivery
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Whats fixed ratio schedules?

A

The animal gets the outcome after a fixed number of responses have been performed
- a cumulative count of responses
- animals don’t respond uniformly
- they learn the number of responses required to generate the outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Whats variable ratio schedules?

A

The animal gets an outcome after a non-fixed number of responses have been performed
- animals get the outcome after making on average 3 average responses but the number varies in training.

  • like variable - interval schedule, animals respond uniformly across testing
  • animals cannot learn the number of responses required to generate the outcome
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Describe the graph cumulative responses?

A

The little lines indicate delivery of outcome
Fixed ratio - animals ger the outcome after a set number of responses = which leas to responding in burst of activity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the line for variable ratio look like?

A

Longer then variable interval and has 4 lines coming out of it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does the line of fixed ratio look like?

A

Staggered like steps and at each step theres a line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does the line for variable intervals look like

A

Shorter then vr and has only 3 lines coming out

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does fixed interval line look like?

A

Banner and at each banner there is a line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What can we notice about the interval responses compared to ratio responses?

A

Interval responses accumulate much slower than the ratio responses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Whats the application of these to the real world?

A

Reinforcement schedules are used to control behaviour in real world
Companies with performance bonuses operate a fixed ratio reinforcement

  • common in banking
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Explain about fixed odds bettting terminals?

A

Fixed odds bettting machines allow users to gamble on virtual events. Most popular = roulette

Programmed with fixed odds to give certain % of winners
Machines return between 90-95% of the money being out into machine

17
Q

Whats variable reinforcement schedules?

A

People respond uniformly and the outcome does not alter responses

18
Q

Whats ratio reinforcement?

A

Schedule by which responses accumulate the fastest

19
Q

What does reward signal in the brain

A

The ventral tegmental area VTA

20
Q

What does the VTA do?

A

Contains neurons that synthesise the neurotransmitter dopamine
Population of dopamine cells is simialr to the cells in the substantial nigra, the midbrain neurons that are lost in Parkinson disease

21
Q

Whats the Mesolimbic pathway?

A

Consist of dopamine - producing neurons that release dopamine into the cells in the nucleus’s accumbens

The major pathway by which reward is mediated by the brain

22
Q

Whats the mesocortical pathway?

A

Consists of dopamine producing neurons that release dopamine into the cells in the prefrontal cortex (PFC)

PFC as being involved in conscious descision making and inhibition of action

23
Q

Whats the reward stimuli?

A

• Natural rewards that increase survival and fitness of a species activate the
reward circuit
• These behaviors and stimuli include certain food (like those containing high
sugar or fat levels), social bonding, etc.,
• Most drugs of abuse also activate the reward circuit and dopamine signaling,
which plays a critical role in the formation of addictio

24
Q

Whats extinction?

A

the gradual weakening and eventual disappearance of a learned
response when reinforcement or association no longer occurs

25
Q

Whats spontaneous recovery?

A

• After extinction, the conditioned response may reappear temporarily if the
CS is presented again after a period of rest.
• Learning Is Not Erased: Extinction suppresses the response but does not
completely erase the original association. Re-conditioning is typically faster
than initial learning

26
Q

Whats generalisation?

A

the tendency for a learned response to occur in the presence
of stimuli that are similar to the original stimulus