Reinforcement in Contingency-Based Learning Flashcards
Thorndike’s Law of Effect states
Behavior -> Satisfying Event = Stamped In
Behavior -> Annoying Event = Stamped Out
Skinner introduced what type of learning?
Consequence-based learning
Skinnerian Reinforcement is defined as
An increase in target behavior responding due to a contingent relationship between target behavior and a consequence
Skinnerian Punishment is defined as
A decrease in target behavior responding due to a contingent relationship between the target behavior and a consequence
Primary reinforcers are…?
Natural or unconditional reinforcers such as food, water, sex, shelter.
Secondary reinforcers are…?
Conditioned reinforcers such as money, grades, tokens, awards, praise.
Parts of Operant Conditioning Paradigm
Antecedent
Behavior
Consequence
Operant conditioning terminology: positive
Add a stimulus
Operant conditioning terminology: negative
Remove a stimulus
Operant conditioning terminology: reinforce
Increase a behavior
Operant conditioning terminology: punish
Decrease a behavior
A positive punishment would involve…
Adding an aversive to decrease behavior
A positive reinforcement would involve…
Adding an appetitive to increase behavior
A negative punishment would involve…
Removing an appetitive to decrease behavior
A negative reinforcement would involve…
Removing an aversive to increase behavior
Two components to “shaping” a behavior:
- Reinforce successive approximations to required response
2. Non-reinforce earlier response forms
“Chaining” involves…
a discriminitive stimulus that leads to a behavior as well as a reinforcing quality of the next event in the chain
Four variables affecting reinforcement:
Contingency
Contiguity
Magnitude
Deprivation
_ is a good determiner of the strength of conditioning or the effectiveness of association
Extinction
_ is an extreme, spontaneous recovery of a “forgotten” association
Extinction burst
Hullian / Drive Reduction Theory
Special stimuli that satisfy or reduce a specific drive state act as reinforcers. Sandwiches satisfy hunger state, and so are reinforcing.
Problem with Hullian / Drive Reduction Theory:
No explaination for secondary reinforcers
Premack’s Relative Value Theory
Behaviors are reinforcing, in contrast to the stimuli. Eating is satisfying, sandwiches only satisfy because they can be eaten.
Premack’s Principle / Differential Probability Theory states
High probability behaviors can serve to reinforce Low probability behavior
Response Deprivation Theory states
Any behavior deprived below its free-running baseline can serve as a reinforcer
Problem with Premack’s Relative Value Theory:
No explanation secondary reinforcers
High or low probability behavior is relative to the individual
B.F. Skinner was inspired by Thorndike’s _ and Watson’s _.
The Law of Effect
Behaviorism
Four theories of positive reinforcement include:
Hullian / Drive Reduction Theory;
Premack’s Relative Value Theory;
Premack’s Differential Probability Theory;
Response Deprivation Theory
Theories of negative reinforcement:
Avoidance & Escape Theory;
Four types of simple contingent reinforcement schedules:
Ratio
Interval
Duration
Differential
Tween simple fixed ratio and simple variable ratio, which creates stronger associations?
Variable ratio reinforcement schedule
Phenomena associated with fixed ratio reinforcement schedules?
Post reinforcement pause
Three explanations for post reinforcement pause:
Fatigue
Satiation
Feedback loop
_ involves teaching an animal to associate an operant reinforcer with a secondary reinforcer.
“Bridging”
Differential schedules involves…
providing reinforcement only after a target behavior has been limited or increased to a minimum point
Differential reinforcement low aims to…
reduce behavior
Differential reinforcement high aims to…
increase behavior
Which simple reinforcement schedule results in scalloped response/time graph?
Fixed interval
Humphrey’s Paradox
Partial reinforcement, found in variable ratio schedules, thin, or stretched schedules, have stronger associative strength than fixed ratio. Extinction takes longer.
Theories explaing Humphrey’s Paradox
Discrimination Theory
Frustration Theory
Sequential Theory
Response Unit Theory
Discrimination Theory states
Partial reinforcement works better bc it’s more difficult to distinguish between extinction and interval ratio scheduling.
Frustration Theory states
Partial reinforcement works better bc removing frustration of a failed previously reinforced behavior reinforcers that behavior.
Sequential Theory states
SSS