Positive Reinforcement, Negative Punishment Flashcards
Delay of Reinforcement
Conditioned reinforcers in contextual environment allows for delays of reinforcement of up to 20 minutes
r(G)-s(G) mechanism
fractional anticipatory goal responses : energizes behaviour, results in conditioned reinforcer presented (sG), sG guides behaviour
Proprioceptive Conditioned Reinforcers
Body recognizes when you move, sensation of turning right paired with food
Everytime rat turns, salivate, enter box, salivated even more
Each stimuli prolong presentation of final reinforcer (food)
Stimulus Generalization gradient
Pigeon pecking at light, reinforced every time they peck at a yellow light
Change wavelength of light a little (i.e orange) pigeon still pecks at it
Change wavelength even more, doesn’t peck anymore
Stimulus Generalization
Keep reinforcing at each wavelength so pigeon pecks so matter what colour the light is
Stimulus Discrimination Training
Only pecks at one colour
Progressive Ratio
of responses needed to get reinforcer increases exponentially across trials
Break Point
Measures incentive value of reinforcer (max number of responses an animal is willing to give to get a reinforcer)
i.e. break point for chocolate = 500 responses, break point for strawberries = 1000 responses
Second Order Schedules
One schedule controls primary reinforcer, second schedule controls conditioned reinforcer
FR- 5 responses-> food
VR- around every response -> light
Animal responds all the time, sometimes for light, sometimes both
2 schedules combined (any type of schedules can be used)
Effectiveness of Partial Schedules
VR schedules have higher rates of of responding than VI
Vi schedules have more stead responding when reinforcement is infrequent
Encourages persistent behaviour
Partial Reinforcement Extinction Effect (PREE)
Example of Paradoxical reaction effect
No food in goal box 1, animal runs faster in alley 2
Animal experiences frustration in alley 1, runs faster in alley 2 to get food in goal box 2. Running and frustration is reinforced
Frustration occurs again when they reach goal box 1 again….repeated reinforcement of both running and frustration
When you take reinforcer away, frustration is still present, and that is enough to get the rat to run
At first, frustration is a reaction, but then becomes a stimulus
Magnitude of Reinforcement Extinction Effect
Behaviour reinforced by large rewards extinguish faster
Over learning extinction effect
Behaviour extensively reinforced can extinguish faster
Maintaining Behaviour
Move from continuous to partial reinforcement
Reinforce in a variety of settings
Fading- when removing reinforcer, do so gradually
Move from primary to conditioned or social reinforcers
Premark Principle
Access to preferred behaviour reinforces less preferred behaviour and punishes more preferred behaviour
Originally there was some motivation for the less preferred behaviour , but higher motivation for preferred behaviour results in initial motivation to be eradicated
i.e. Parent wants kid to do dishes, knows kid like playing outside. Uses playing outside as a reinforcer for doing the dishes (doing the dishes reinforced, playing outside punished)
Undermining Intrinsic Motivation
Punished by Reward phenomenon
Tell child to go play outside (which they probably like they do), and tell them you’ll give them a prize for playing outside… they’ll become less interested in playing outside (only interested in playing outside if they get a prize)