W8 Learning 2: Operant conditioning Flashcards

Question 1

Q

Action outcome framework

Answer

A

Classical and operant conditioning are experimental paradigms that have lead to highly influential frame-work for associative learning.

Question 2

Q

Classical conditioning (pavlov)

Answer

A

stimulus-response-associations. Involves the pairing of two stimuli. Conditioning stimulus (CS) and Unconditioned Stimulus (US), US is associated with a hardwired response. Response becomes associated with the CS-through conditioning. CS and US can be temporally segregated or overlapping.

Question 3

Q

Operant (instrumental) Conditioning

Answer

A

US is contingent on behaviour of animal (e.g., only occurs when a lever has been pressed), need for action. Action-outcome association (action will determine the outcomes)

It goes beyond hard-wired unconditioned responses and incorporates more complex behaviour.

Question 4

Q

Learning of action-outcome associations

Answer

A

‘Response’ (in operant conditioning): pressing a lever, opening a door, pushing a button etc.
Operant behaviour: under stimulus control, so that the action can be a response to a certain stimulus/situation

The outcome can be a ‘reinforcement’ or a ‘punishment’
Action => Outcome

Question 5

Q

Law of Effect

Answer

A

“… responses that create a typically pleasant outcome in a particular situation are more likely to occur again in a similar situation, whereas responses that produce a typically unpleasant outcome are less likely to occur again in the situation” (Thorndike, 1911)

Action is driven by reward (pleasant outcome).

Question 6

Q

Skinner Box (Operant chamber)

Answer

A

Allows for variety of operant conditioning paradigms.
Lights – Speakers – stimulus : generate action
Lever for responses
Food dispenser – appetitive stimuli/rewards: outcomes (reward)
Electrified grid – aversive stimuli/punishment: outcomes (punishment)
Used with rodents – very good at responding to these paradigms.

Question 7

Q

Skinner’s Terminology: Reinforcer

Answer

A

an event that increases the likelihood of the action

Question 8

Q

Skinner’s Terminology: Punishment

Answer

A

an event that decreases the likelihood of the action. (prevent you to do something again.)

Question 9

Q

Skinner’s terminology: Positive

Answer

A

Something has been introduced

Question 10

Q

Skinner’s Terminology: Negative

Answer

A

Something has been removed.

Question 11

Q

Punishment

Answer

A

Decreases Behaviour
Less beneficial than Reinforcement
Temporary changes in behaviour – based on coercion
Creates negative/adversarial relationship
When the person who provide punishment leaves – unwanted behaviour returns

Question 12

Q

Reinforcement

Answer

A

Increases Behaviour
More beneficial than punishment
More likely to result in long-term changes in behaviour
Creates positive relationship with the person providing reinforcement

Question 13

Q

Classical condition: partial reinforcement:

Answer

A

Classical condition: partial reinforcement: intersperse trials in which the CS is not followed by the US. This is done randomly so that the CS is followed by the US with a certain probability (here 75%). Slows down both acquisition and extinction learning.

Question 14

Q

Partial reinforcement: reinforcement schedules

Answer

A

responses are sometimes reinforced and sometimes not.
Slower initial learning: but greater resistance to extinction
As reinforcement does not appear after every behaviour, it takes longer for learner to determine a lack of reward. Extinction is slower.

Question 15

Q

Fixed ration

Answer

A

behaviour is reinforced after a specific number of responses. (e.g. giving a child a sweet after reading 5 pages of a book.)

Question 16

Q

Variable ratio

Answer

Study These Flashcards

A

behaviour is reinforced after an average, but unpredictable number of responses. (e.g. Payoffs from slot machines and other games of
chance)

Question 17

Q

Variable interval

Answer

Study These Flashcards

A

behaviour is reinforced for the first response after an average but unpredictable, amount of time has passed (e.g. periodically checking email)

Question 18

Q

Fixed interval

Answer

Study These Flashcards

A

behaviour is reinforced for the first response after a specific amount of time has passed (e.g. receiving a monthly salary for work)

Question 19

Q

Fixed Interval (FI) Schedule

Answer

Study These Flashcards

A

First response after a designated amount of time is followed by reinforcement. FI 60s – every 60 seconds.
Produce characteristic pattern of responding observable across species.
PRP followed by slow rates of responding and high rates of responding toward the end of the interval. “Scallop” (monthly salery)

Question 20

Q

Variable Ratio (VR) Schedule

Answer

Study These Flashcards

A

Responding reinforced after a randomly determined number of responses have been emitted. (VR). VR15 – average number of responses for reinforcement – but could be anywhere between 1 – 29. Rate of responding for VR is typically faster than FR, no PRP in VR schedule. Response rates relatively constant over time. (quicker than fixed ration)

Question 21

Q

Fixed ratio (FR) schedule

Answer

Study These Flashcards

A

The number of responses required for reinforcement is describes the schedule. Continuous reinforcement is technically FR1. Probability of reinforcement increases with successive responses. Brief pause in responses (Post-reinforcement Pause – PRP) after each reinforcement before responses begin again. ‘Stair-step’ pattern

Question 22

Q

Variable Interval (VI) Schedule

Answer

Study These Flashcards

A

Responding reinforced after a randomly determined amount of time. VI 60s – average of 60 seconds between reinforcements – but individual intervals will differ from one another.
Relatively constant, no PRP (except at unusually low rates of reinforcement). Most commonly used schedule in operant research – produces steady predictable performance

Question 23

Q

Shaping

Answer

Study These Flashcards

A

process of guiding behaviour to the desired outcome through the use of intermediate stages.

W8 Learning 2: Operant conditioning Flashcards

(23 cards)