Topic 4: Reinforcement Flashcards
Reinforcement
the process in which the occurrence of a behavior is followed by a consequence that results in an increase in the future probability of the behavior
Consequence
the stimulus or event occurring immediately after a behavior
Operant Behavior
behavior that acts on the environment to produce an immediate consequence and, in turn, is strengthened by that consequence
Reinforcer
a stimulus or event that increases the future probability of a behavior when it occurs contingent on the occurrence of the behavior
Positive Reinforcement
a type of reinforcement in which, contingent on a behavior, a stimulus or event is presented and the probability of the behavior increases in the future
Negative Reinforcement
a type of reinforcement in which the occurrence of the behavior is followed by the removal or avoidance of an aversive stimulus
it results in an increase in the future probability of the behavior
Stimulus
an environmental event that can be detected by one of the senses
Positive Reinforcer
a stimulus that will increase the future probability of a behavior when the stimulus is delivered contingent on the occurrence of the behavior
Aversive Stimulus
a stimulus that will decrease the future probability of a behavior when the stimulus is delivered contingent on the occurrence of the behavior, also called a punisher
Premack Principle
one type of positive reinforcement in which the opportunity to engage in a high-probability behavior is made contingent on the occurrence of a low-probability behavior to increase the low-probability behavior
Escape Behavior
behavior that results in the termination of an aversive stimulus
the termination of aversive stimulus negatively reinforces the behavior
Avoidance Behavior
a behavior that prevents an aversive event
the behavior is negatively reinforced by the avoidance of the aversive event
Unconditioned Reinforcers
a stimulus that is naturally reinforcing because the capacity for our behavior to be strengthened by the stimulus has survival value
no prior conditioning is necessary for an unconditioned reinforcer to be a reinforcer
examples include food, water, escape from extreme stimulation, and sexual contact
Conditioned Reinforcers
a previously neutral stimulus that has been paired a number of times with an established reinforcers and consequently functions as a reinforcer itself
Token
a conditioned reinforcer used in a token economy
the token is something that can be given to another person and accumulated by the person
the token is a conditioned reinforcer because it is given to the person after a desirable behavior and is exchanged for established reinforcers called backup reinforcers
Backup Reinforcers
reinforcers used in a token economy
a client receives tokens for desirable behaviors and exchanges a specified number of token for any of a variety of backup reinforcers
Generalized Conditioned Reinforcer
a conditioned reinforcer that has been paired with a variety of other reinforcers
money and praise generalized are conditioned reinforcers for many people
Response
a response is one instance or occurrence of a particular behavior
Contingency
a relationship between a response and a consequence in which the consequence is presented if and only of the response occurs
when such a relationship exists, the consequence is said to be contingent on the response
Motivating Operations (MO)
an antecedent stimulus or event that alters the value of a reinforcer and alters the probability of the behavior that produces that reinforcer
establishing operations and abolishing operations are two types of MOs
Establishing Operation (EO)
one type of motivation operation, an event that increases the potency of a particular reinforcer at a particular time and evokes the behavior that produces that reinforcer
deprivation is a type of establishing operation
Abolishing Operation (AO)
one type of motivating operation, an event that decreases the potency of a particular reinforcer at a particular time and makes the behavior that produces that reinforcer less likely to occur
satiation is a type of abolishing operation
Deprivation
a condition in which the person has gone without a particular reinforcer for a period of time
deprivation is a type of establishing operation, it makes the reinforcer the person has gone without more potent
Satiation
progressive (and ultimately total) loss of effectiveness of a reinforcer
satiation occurs when you have recently consumed a large amount of a particular reinforcer or when you have had substantial exposure to a reinforcing stimulus
Schedule of Reinforcement
specifies which responses will be followed by delivery of the reinforcer
Continuous Reinforcement (CRF) Schedule
a schedule of reinforcement in which each instance of the behavior is followed by the reinforcer
Intermittent Reinforcement Schedule
a schedule of reinforcement in which not every instance of the behavior is followed by the delivery of the reinforcer
includes fixed ratio, fixed interval, variable ratio, and variable interval schedules
Acquisition
the development of a new behavior through reinforcement
Maintenance
continuation of the behavior change for a long period after the termination of a behavior modification program
also continuation of an operant behavior with intermittent reinforcement
Fixed Ratio (FR) Schedule
a schedule of reinforcement in which a specific number of responses must occur before the reinforcer is delivered
the number of responses needed for reinforcement does not change
Variable Ratio (VR) Schedule
a schedule of reinforcement in which a specified number of responses is needed for the delivery of the reinforcer
the number of responses needed variables around an average number
Fixed Interval (FI) Schedule
a schedule of reinforcement in which the reinforcer is delivered for the first response that occurs after an interval of time has elapsed
the interval is the same each time
Variable Interval (VI) Schedule
a schedule of reinforcement in which the first response that occurs after a specified time interval is reinforced
the time interval varies around an average value
Concurrent Schedules of Reinforcement
schedules of reinforcement that exist at the same time for two or more different behaviors (which are called concurrent opertants)
which particular behavior at a particular time depends on the relative schedule of reinforcement, magnitude of reinforcement, delay of reinforcement, and response effect for the available behaviors
Concurrent Opertants
two or more different behaviors or response options are concurrently available for the person
each response option is associated with a specific reinforcement schedule
Response Effect
the amount of force, exertion, or time involved in executing a response
with an increase in response effect for one behavior, the probability of that behavior decreases relative to the probability of a functionally equivalent alternative behavior
What was Thorndike’s reinforcement experiment?
placed hungry cat into an escapable “puzzle box”, with a plate of fish outside the box
cat could eventually open the box, using trial and error
behaviors became quicker over time
law of effect
What is law of effect?
behavior followed by pleasant consequences is more likely to occur again in that situation
What is the definition of operant?
functioning or tending to produce effects: effective; of or relating to the observable or measurable
that is, a behavior that operates on the environment
What is operant (or instrumental) conditioning?
entails manipulating consequences of behavior
the consequences may increase or decrease a behavior can itself be a stimulus or event that leads to further behavior (e.g. eat salty snack –> get thirsty –> drink sugary pop)
consequences occur immediately after a behavior
What is reinforcement?
the process in which the consequence of a behavior strengthens the behavior
behavior is more likely to occur in the future (frequency), or occurs more quickly (latency)
What is a reinforcer?
a stimulus, object, or event that strengthens a behavior, often is an appetitive stimulus (characterized by a natural desire to satisfy bodily needs)
e.g. after a dog follows your command, you give it a treat: reinforcement is giving the treat to increase the behavior, the reinforcement is the treat itself
What is positive reinforcement?
a situation in which a behavior is followed by the presentation of an appetitive (pleasant) stimulus that increases the behavior
e.g. I tell a joke –> you laugh; this makes me more likely to tell more jokes in the future
What is negative reinforcement?
a situation in which a behavior is followed by the removal of an aversive (unpleasant) stimulus that increases behavior
e.g. putting up an umbrella –> stops cold rain falling on you; this makes you more likely to use an umbrella in the future when it’s raining
What is escape behavior?
causes removal of existing aversive stimulus
e.g. when you feel cold, you put on a sweater
What is avoidance behavior?
prevents presentation of aversive stimulus
e.g. before you go outside, you put on a sweater
What is natural reinforcement?
occurs spontaneously as part of everyday life
e.g. your friend laughs when you tell a joke
What is programmed reinforcement?
planned and systematic; given as part of a behavioral treatment
e.g. giving yourself rewards as part of a self-management program
What is social reinforcement?
involves another person to deliver reinforcing consequences
e.g. teacher praises a student for completing her homework
What is automatic reinforcement?
the individual gets reinforcing consequences directly from the environment
e.g. scratching an itch makes it go away
What is tangible (or material) reinforcement?
access to a preferred object (includes consumable reinforcement)
e.g. getting toys, stickers, or snacks after good behavior
What is activity reinforcement?
engaging in a preferred behavior after doing a non-preferred behavior
e.g. the pomodoro technique
What is the Premack principle?
a high-probability behavior can serve as positive reinforcement for performing a low-probability behavior, thus increasing it
e.g. after studying for an hour, you play video games for an hour; making it more likely you’ll study again in the future
What is an unconditioned (or primary) reinforcer?
stimulus or event that has natural reinforcing effects (i.e. not due to prior conditioning or learning), may enhance survival
e.g. food, water, absence of pain
What is a conditioned (or secondary) reinforcer?
previously neutral stimulus that has become associated with an unconditioned reinforcer
e.g. money is a generalized condition reinforcer which can be used to obtain almost any other primary reinforcer
e.g. in token reinforcement, tokens can be exchanged for backup reinforcers (like money, food, or TV time)
e.g. animal clicker training: animals are reinforced with food which is paired with a click sound; eventually the sound becomes reinforcing on its own
What is reward value?
quantity and quality of the reinforcer, and its value to the individual
What are motivating operations (MOs)?
antecedent events that can (temporarily) alter the effectiveness of reinforcement, and thus affect behavior; also called setting events
What are some types of setting events or MOs?
social: e.g. presence of an attractive person
physiological: e.g. having a headache
environmental: e.g. quiet library for studying
What is an establishing operation (EO)?
establishes/increases the effectiveness of reinforcement
e.g. caloric deprivation is EO for food
What is an abolishing operation (AO)?
decreases the effectiveness of reinforcement
e.g. satiation (fullness) is AO for food
What is the timing of reinforcement?
reinforcement should occur soon after the behavior
What is the contingency of reinforcement?
consequences should consistently follow the behavior
What is a continuous schedule of reinforcement?
reinforcement given for each response
leads to rapid acquisition (performing a new behavior)
e.g. putting money into a vending machine –> getting candy every time
What is an intermittent schedule of reinforcement?
only some response are reinforced
acquisition phase is longer
asking random strangers out on a date –> someone accepts only occasionally
What is a fixed ratio schedule of reinforcement?
reinforcer given after a set number of responses
FR 10: every 10 bar presses –> 1 food pellet
high response rate; brief post-reinforcement phase
e.g. salesperson gets a bonus every time they sell 10 cars
What is a variable ratio schedule of reinforcement?
reinforcer given after a random number of responses (number deviates around a mean)
VR 20: on average, every 20 bar presses –> 1 food pellet
high response rates
e.g. slot machine
What is a fixed interval schedule of reinforcement?
reinforcer given when response occurs after a certain length of time
FI 5”: first responses after every 5 seconds –> 1 food pellet
responses increase as reinforcement time nears
e.g. checking mailbox behavior increases as typical delivery time approaches
What is a variable interval schedule of reinforcement?
reinforcer given when response occurs after a variable length of time (length deviates around a mean)
VI 30”: first response after an average of 30 seconds since the last reinforcement –> 1 food pellet
slow, steady responding
e.g. paddling out to surf