Unit 3; Instrumental Conditioning Flashcards
What is instrumental / operant conditioning
Learning a contingency between a behaviour and a consequence
What does operant mean
You operate the environment and directly produce the effect through your voluntary behaviour
What is the Law of Effect
Desirable behaviours stamped in with a positive consequence = increase in frequency
And vice-versa
What is a reinforcer
Stimulus presented after a response that impacts frequency
What is a primary reinforcer
A reinforcer with intrinsic (not taught) value
ex. food, water
What is a secondary reinforcer
A reinforcer with learned value
ex. money, a toy
Two training types that increase behavior
Reward (present positive reinforcer)
Escape (remove negative reinforcer)
Two training types that decrease behavior
Punishment (present negative reinforcer)
Omission (remove positive reinforcer)
When to present consequence
Immediately following behaviour
What is successive approximation and when to use this method
Learning gradually as you get closer and closer to the desired behaviour
- For complex behaviours (ex. animal trick training)
What is chaining and when to use this method
Each behavioyr is reinforced wth the opportunity to perform the next behaviour
- For a very complex sequence of behaviours (ex. a routine of tricks, learning the alphabet)
What is the difference between shaping through successive approximation vs. chaining
With shaping: rewarded for improvement
With chaining: rewarded for correct order
What is an SD / S+
remember sd = signals ___
It is a discriminatory stimulus that signals the contingent relationship IS valid
ex. you will be rewarded for eating veggies only when mom (SD) is here or only at grandma’s house (SD)
What are the 4 schedules for reinforcement
Fixed ratio
Fixed interval
Variable ratio
Variable interval
What are the graphs for the 4 schedules of reinforcement
Fixed ratio:
Steps -> due to post-reinforcement pause
(think rAE sho /***)
Fixed interval:
Scalloped
(think V = uuuuuu the shape of a scallop)
Other two use standard line
Slope = frequency of response rate