Operant Conditioning Pt. 1 Flashcards
Classical vs Operant Conditioning
- In classical conditioning the response occurs at the
end of the stimulus chain - For example:
- Shock → Fear
- Tone : Shock → Fear
- Tone → Fear
- Study of reflexive behaviors
Operant conditioning
- Goal-oriented behavior
- Consequences!
- Organism learns to respond to environment in way that
produces (+) consequences & avoids (-) ones - Instrumental Conditioning – response is instrumental in producing the
consequence
Thorndike’s Law of Effect
- If a stimulus is followed
by satisfaction, the
response is more likely
to occur the next time
the stimulus is
encountered, but an
“unpleasant state of
affairs” leads makes the
response less likely
Skinner
- Didn’t like Thorndike’s emphasis of internal events
(satisfaction, annoying) - Subcategories of behavior
- Respondant – involuntary, reflexive (CC)
- Operant – voluntary, controlled by consequences
- Proposed that voluntary behaviors are controlled
by their consequences (rather than by preceding
stimuli) - Operant conditioning
- The future probability of a behavior is affected by the
consequences of the behavior
Operant Behavior
- The consequences that follow a certain response, affect
the future probability (or strength) of the response - Press Lever → Food
- Press Lever → Shock
- Operant behaviors are emitted by the organism
(voluntary) not elicited by stimuli (CC) - Voluntariness may be illusion
- Operant behavior defined as a class of response
- Exact response could be fast, slow, hard, soft, etc.
- Easier to predict class of behavior than exact response
Operant Conditioning
- Reinforcer
- Anything that increases the likelihood that a
behavior will be repeated - Press Lever (R) → Food (SR
) - Punisher
- Decreases likelihood that a behavior will be repeated
- Press Lever (R) → Shock (SP
) - Reinforcement and punishment refer to
- Process or procedure
- Reinforcers and punishers refer to
- Consequences
Consequences
- The consequences of the behavior can either be
- Appetitive: a consequence that the organism wants
- Aversive: a consequence the organism wants to avoid
- Reinforcer – increases behavior
- Punisher – decreases behavior
- Positive – adding stimulus
- Negative – removing stimulus
Operant Conditioning
- Positive reinforcers (SR+)
- Rewards
- Increase behavior
- Negative reinforcers (SR-)
- Removal of unpleasant
stimuli - Increase behavior
- Escape behavior
- End aversive stimulus
- Avoidance behavior
- Before aversive stimulus
is presented
Punishment
Presentation of an
aversive stimulus
(positive) or removal
of pleasant
(negative) stimulus
* Reduces
frequency of
behavior
* Often confused
with negative
reinforcement
* Positive punishment
* Fines, shock,
spanking
* Negative punishment
* Time out, loss of
privileges
Negative
Reinforcement
DOES NOT =
Punishment
Negative
Reinforcement
DOES NOT =
Punishment
Reinforcement &
Punishment
Positive
Reinforcement
ADD something
GOOD
Negative
Reinforcement
TAKE AWAY
something BAD
Positive
Punishment
ADD something
BAD
Negative
Punishment
TAKE AWAY
something GOOD
Finding a reinforcer
a) Consumable Reinforcers: What do you like to eat or drink?
1. What do you like to eat most?
2. What do you like to drink most
b) Activity Reinforcers: What things do you like to do?
1. What do you like to do in your home or residence?
2. What do you like to do in your yard or courtyard?
3. What activities do you like to do in your neighborhood?
4. What passive activities (e.g., watching TV) do you like to do?
c) Manipulative Reinforcers: What kinds of games do you like?
d) Possessional Reinforcers: What kinds of things or objects do
you like to possess?
e) Social Reinforcers: What social rewards do you like?
1. What types of praise statements do you like to receive?
2. What type of physical contact do you enjoy (e.g., hugging)?
Operant Contingencies
- Behavior modification is often more effective with
positive reinforcement than with punishment - Example
- If attempting to stop a child’s tantrums, it is better
to positively reinforce behavior when the child is
not misbehaving, than to punish the child when she
is misbehaving. The attention he receives during
the punishment might also be rewarding.
Immediate vs. delayed
reinforcement
- The more immediate the reinforcer, the stronger its
effect on behavior - Dickinson, Watt & Griffith (1992)
- Rats were trained to press a lever to obtain food
- Delayed the time between pressing lever and
obtaining food between 2 & 64 seconds - An increase in the delay of food for just
a few seconds resulted in considerably
less responding - Responding almost ceased by 64
seconds - Results initially interpreted as a memory
deficit – i.e., the rats forgot which
response produced the food - Subsequent studies have shown that
rats have excellent memory. The
problem is that the rats could not figure
out which response produced the food - Reinforcement delay allowed rats time
to engage in other behaviors
- Long-delay reinforcements
- Strengthened due to rules or instructions either from ourselves or others
- Learn delayed consequences
- E.g., working on my essay early will probably get me a better
grade than if I write it the night before. You won’t get the
consequence until you get your grade - Example 1
- You are potty training your puppy. When they go to the
bathroom outside. It is best to immediately reinforce
the behavior; delaying the reinforcement enables the
puppy to engage in some other inappropriate behavior,
which could inadvertently be reinforced by the delay. - Example 2
- May explain why people smoke cigarettes. Short-term
reinforcing properties (e.g., reduced anxiety) outweigh
delayed reinforcing properties (e.g., live longer)
Primary reinforcers
- Do not require special training for their properties to be
reinforcing - Naturally appetitive reinforcers are those that are
necessary for the survival of the species (e.g., food,
water, sex) - Effectiveness is influenced by deprivation & satiation
- Researchers in the 1950s accumulated evidence that
not all primary reinforcers were necessary for survival
(and this type of primary reinforcer was NOT influenced
by deprivation & satiation) - Psychological stimulation
- Butler (1954)
- Monkeys placed in enclosed cage with two wooden
doors - Blue door = can see experimental room for 30 seconds
(visual stimulation) - Yellow door = opaque screen (no sensory stimulation)
- Monkeys continually pushed blue door (one subject
responded on every trial for 19 hours straight!!!) - Suggests that sensory stimulation is also a primary
reinforce
Secondary reinforcers
Learned by being associated with some other
reinforce
* Generalized reinforcers are associated with
multiple other reinforcers
* Examples: money, points to redeem for reward,
tickets to redeem for prizes, prestige/status, etc.
- Wolfe (1936)
- Trained 6 chimpanzees to place tokens (poker chips) in a
machine (“chimp-o-mat”) to obtain grapes& bananas, etc. - They had to operate a heavy lever to obtain tokens
- Wolfe found that they would work as hard to obtain tokens
as they would to obtain direct access to grapes - Chimps would also hoard their chips (save them for later)
- Blue token for 2 grapes, and white token for 1 grape –
chimps learned to value blue tokens more - When tested in pairs the dominant ape would push aside
subordinate ape to work lever; dominant ape would also
steal tokens from subordinate ape
- Operant behaviors can also become reinforcing
- Example
- Aid workers who visit foreign countries in times of
crisis (R) are praised in the media for their work
(SR). Over time, the act of helping becomes
reinforcing even in the absence of external praise.
- Strength of Backup Reinforcers
- Variety of Backup Reinforcers
- Simple Conditioned Reinforcer – paired with a single backup
reinforcer - Generalized Conditioned Reinforcer – paired with many
different kinds of backup reinforcers - Strength depends in part on number of different backup
reinforcers available for it. - Schedule of Pairing with Backup Reinforcer
- More effective if does not follow each occurrence
- Extinction of the Conditioned Reinforcer
- Must continue to pair conditioned reinforcer with backup
reinforcer, at least occasionally
- Strength of Backup Reinforcers
- Variety of Backup Reinforcers
- Simple Conditioned Reinforcer – paired with a single backup
reinforcer - Generalized Conditioned Reinforcer – paired with many
different kinds of backup reinforcers - Strength depends in part on number of different backup
reinforcers available for it. - Schedule of Pairing with Backup Reinforcer
- More effective if does not follow each occurrence
- Extinction of the Conditioned Reinforcer
- Must continue to pair conditioned reinforcer with backup
reinforcer, at least occasionally
Intrinsic and extrinsic
reinforcement
- Intrinsic reinforcement
- Reinforcement is provided by the act of performing the
behavior - Example: do quilting because you find is
satisfying/enjoyable - Extrinsic reinforcement
- The reinforcement provided by the external
consequences of the behavior - Example: Child who cleans up his room in order to
receive praise from parents. - Example: Juggle balls 20 times in a row to receive 25
cents.
- Natural reinforcer
- Expected within certain settings
- Contrived (artificial) reinforcer
- Arranged to modify a behavior; not typical consequence
- Natural reinforcers are more efficient
- Contrived reinforcers used until natural ones can take over
Theories of reinforcement
- In the effort to answer the question, “What makes
reinforcers work?”, researchers have developed
some theories. - Drive reduction theory
- The Premack principle
- Response deprivation hypothesis
- Behavioral bliss point approach