Learning & Motivation Flashcards

1
Q

What are the 4 different schedules of partial reinforcement (PRF) and their effect on response rate

A

Ratio (Rft is based on number of responses) and interval (Rft is based on time) - fixed or variable

Fixed ratio (FR)                    Fixed interval (FI)
Variable ratio (VR)               Variable interval (VI)

Fixed & variable ratio produce the highest rate of response
Fixed interval produces a higher response rate than variable interval, however less than ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is extinction and the partial reinforcement extinction effect (PREE), and what are two explainations of this effect

A

Where the established response tends to decline when the reinforcer is removed (zero contingency between response and reinforcement)

PREE - partial reinforcement is more resistant to extinction compared to continuous reinforcement

motivation - subjects in a PRF schedule become more resilient to frustration due to a weak contiguity, so extinction is slower

discriminative stimuli - CRF is very distinguishable from extinction compared to PRF, so extinction is a more effective discriminative stimulus in CRF leading to more effective extinction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a discriminative stimuli and some examples

A

An event/context in the presence of which a response is either reinforced, or not (S+ & S-)

Usually lights, tones etc. but can also be contexts, emotional/physiological states or the passage of time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Generalisation & Discrimination

A

Generalisation - the extent to which a response transfers to similar novel stimuli (generalisation gradient, decreases with extremely different stimuli)

Discrimination - the extent to which a response changes to a different stimuli

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is discrimination learning

A

The ability to produce a different response in the presence of different stimuli

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the role of motivation in learning, what is an example

A

Can affect the performance of previously learned responses

Frustration as a result of the omission of reinforcement can paradoxically increase a response (e.g. pressing the button more at traffic lights)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is drive reduction & the Premack principle

A

Drive reduction - a form of reinforcement where the strength of a biological drive (e.g hunger, thirst) is reduced, actions that coincide with the reduction of a drive are reinforced (e.g. you seek food when you are hungry, when you eat, both eating and seeking food are reinforced by the reduction of hunger)

Premack principle - more preferable behaviours can be used to reinforce less preferable behaviours (give sufficient freedom to determine which behaviours are most/least preferable)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Explain the difference between habitual vs. goal-directed behaviour

A

Goal directed behaviour - performing a response in order to receive an outcome

Habitual behaviour - stimuli produces a response, and a motivating outcome reinforces the S-R association (the response is insensitive to changes in motivation for the outcome)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Does punishment involve the delivery of aversive stimuli

A

Punishment involves the delivery of aversive stimuli

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What factors effects the efficacy of punishment

A

Path dependence - transitioning from a weak to a strong punishent leads to a less effective behaviour change and vice versa

Delay - shorter delay (better temporal contiguity) leads to a more effectvie behaviour change

Punishment schedule - CRF leads to a more effective behaviour change, but PRF will reduce extinction (PREE) when punishment cannot be delivered

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Describe omission as a negative punishment

A

There is a negative contingency between a response and an outcome (i.e performing a response prevents a positive outcome)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Explain the relationship between punishment & fear conditioning

A

Classical conditioning can be used to associate a conditioned stimulus with fear

Tone (CS) —-> Shock (US)
| |
Fear (CR) Fear (UR)

E.g. little albert and the white rat, generalised fear onto other stimuli

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is negative reinforcement

A

Negative reinforcement (avoidance) - a response developed/learnt due to earlier aversive stimuli

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the two-process theory of avoidance

A
  1. Classical conditioning - Warning signal (CS) paired with shock (US) -> CS elicits fear
  2. Instrumental conditioning - Avoidance behaviour (R) leads to a reduction in fear (Rft)

e.g. fear conditioning of a tone associated with an electric shock will result in the subject initially escaping the shock, but later learning to avoid it once the tone is heard

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are safety signlas, explain their role in avoidance

A

Safety signals - stimuli which signal safety from an aversive outcome (e.g. offset of the warning signal, stimuli associated with escape - doorway)

  • Improve the acquisition of avoidance
  • Maintain avoidance in the absence of overt fear (with avoidance, there is no more aversive stimuli, so the helps maintain the response)
  • Protect warning signals from extinction
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the issue of the treatment of avoidance behaviours

A

In order to treat avoidance, the subject must learn that the CS is safe (extinguish CS-fear association), but the subject avoids the CS!

To overcome this, we must enforce exposure to the CS and prevent the avoidance response from occuring

17
Q

What is the peak shift effect?

A

Displacement of the peak away from S+ in the direction opposite S-

18
Q

What is latent learning?

A

Tolman (rat in maze with/without food experiment)

  • learning occurs without reinforcement or motivation (e.g rats learnt to navigate the maze faster even with no food present)
  • behaviour changes when there is sufficient motivation
  • reinforcement provides an impetus to perform
19
Q

What are covert factors?

A

Cognitive, internalised, not measurable factors

20
Q

What are overt factors?

A

Directly interacting with the environment, measurable and observable factors

21
Q

Explain two ways in which outcome devaluation can occur

A
  1. Devaluate the outcome by pairing it with an aversive outcome (e.g. poisoning food)
  2. Devaluating through saturation of the outcome (free-feeding, long exposure)
22
Q

What are the two variables contributing to behaviour?

A
  1. Consequences
    - reinforcers/punishments
  2. Antecedents
    - triggers of habits
    - structure of environment facilitating/preventing a behaviour
    - triggers of emotional states that motivate behaviour
23
Q

What are the 4 factors contributing to the success of token economies (e.g classroom stickers)

A
  1. Do not interfere with behaviour
  2. Not subject to satiety
  3. Good temporal contiguity
  4. Universal reinforcers - cater to individual tastes (e.g money)
24
Q

Two alternatives to punishment

A
  1. Extinction

2. Differential reinforcement of other behaviours

25
Q

What are the 3 factors of the Self-Determination Theory (Deci & Ryan, 1985)

A
  1. Competency - desire to achieve mastery
  2. Relatedness - the desire to relate, be connected to or care for others
  3. Autonomy - universal urge to be causal agents of one’s own life
26
Q

Explain the 2 impacts of reinforcement on performance and their effects on intrinsic motivation

A
  1. Task dependent reward (regardless of quality) - reduces intrinsic motivation
    - performance independent (engage in task)
    - task-completion dependent (complete task)
  2. Performance dependent reward (quality exceeds some meaningful standard) - increases intrinsic motivation
27
Q

Describe the matching law and how we can use it to extrapolate

A
  • Behaviour lawfully follows the distribution of reinforcement of that particular behaviour
  • States that the relative rate of responses equals (matches) the relative rate of reinforcement for that behaviour
  • The history of distribution of reinforcement can be used to predict the distribution of behaviour
28
Q

What are the 3 reasons delayed rewards lose value?

A
  1. Risk of losing reward
  2. Expected transaction costs
  3. Concave utility effects