b mod final Flashcards
unconditioned/ primary reinforcer
consequence that functions as a reinforcer without prior learning (eg. food, water, sex)
infant test
a reinforcer is an unconditioned reinforcer if it functions as a reinforcer for a newborn infant (exceptions: things that don’t become natural reinforcers until puberty, like sex)
conditioned reinforcer
consequence that acquires the capacity to function as a reinforcer through learning (association with unconditioned/primary reinforcer or other conditioned reinforcer)
backup reinforcer
reinforcer that gives the originally neutral stimulus its ability to reinforce behaviour
token economy
system in which tokens that can be accumulated are given as reinforcers for behaviour, then exchanged for backup reinforcers at a later point in time (often used with groups)
advantages of conditioned reinforcers, such as praise (2)
(1) can be delivered more immediately than a backup reinforcer and (2) can be used to bridge the delay between a behaivour and a backup rinforcer
conditioned punisher
consequence that acquires the ability to punish behaviour through association with (predictive of) backup punishers (eg. demerit, threat)
unconditioned punisher
consequence that functions as a punisher without prior learning (eg. pain, electric shock)
factors influencing effectiveness of conditioned reinforcement (4)
(1) powerful backup reinforcers produce powerful conditioned reinforcers, (2) variety (the more backup reinforcers the better), (3) schedule of pairing with backup (more frequent is better) and (4) respondent extinction
simple conditioned reinforcer
associated with a single backup reinforcer
generalized conditioned reinforcer
associated with many backup reinforcers
pitfalls of conditioned reinforcers (2)
(1) unknowingly misapply principle of conditioned reinforcement and (2) cease pairing a conditioned reinforcer with the backup reinforcer
liking for licks
spanking becomes a conditioned reinforcer because it is predictive of a treat/toy provided by a parent that feels guilty about spanking (no longer a punishment)
unknowlingly misapplying principle of conditioned reinforcement
pairing backup reinforcer with stimuli meant to be punishing; consequence meant to be a punisher by parent may actually come to function as a reinforcer for the undesirable behaivour because of the attention (conditioned reinforcement) it brings
extinction
non-reinforcement of a previously reinforced behaviour leads to a decrease in the likelihood of that behaviour occurring in that context; gradual and can be preceded by an initial increase in the behaviour frequency
factors that influence extinction (8)
(1) control the occurrence of reinforcers for the behaviour, (2) combine extinction with reinforcement of an alternative behaviour, (3) control the setting in which extinction is carried out, (4) use instruction, (5) Humphrey’s Paradox, (6) extinction bursts, (7) extinction elicits aggression and (8) spontaneous recovery
controlling reinforcement during extinction (4)
make sure behaviour is not inadvertantly reinforced by others or the environment, (2) make sure you control the correct reinforcer, (3) understand that sensory/automatic reinforcement is difficult to control and (4) be prepared for criticism,
Humphrey’s Paradox
extinction occurs quicker for a behaviour that is reinforced every time it occurs than for a behaviour that is intermittently reinforced, because it is easiter to discriminate that the contingency of reinforcement has changed
resistance to extinction = ____ in extinction = ?
persistence; index of the stength of a behaviour
extinction bursts
behaviour gets worse before it gets better; when extinction procedure is initiated, behaviour increases in frequency/ intensity before it gradually declines; DO NOT give up
extinction elicits aggression
placing a behaviour on extinction can produce an emotional reaction that can take the form of aggression directed toward other people or things in the context; less likely when extinction is combined with reinforcement of an alternative behaviour
spontaneous recovery
after a delay, a behaviour that has been extinguished can recover spontaneously (at a lower level than originally)
remedy for spontaneous recovery
conduct further extinction sessions, because recovery will be less following each session
extinction: positively reinforced behaviour
extinction involves withholding a reinforcer that was previously delivered contingent upon a behaviour
extinction: negatively reinforced behaviour
extinction involves preventing avoidance or escape from aversive stimulus (aversive stimulus is no longer removed by behaviour)
common misconception:
extinction = ____ the behaviour (ONLY true if reinforcer is attention)
ignoring
pitfalls of extinction (2)
(1) distribute attention unwisely (only to problem behaviours) and (2) apply extinction unknowingly to the behaviour of others in your context
shaping
development of a new behaviour through reinforcement of successive approximations to a final desired behaviour
shaping combines ___ and ____
reinforcement; extinction
what does extinction do in shaping?
induces variability in a response that was previously reinforced; variability inceases likelihood that the behaviour will meet the new criterion
factors influencing the effectiveness of shaping (4)
(1) precisely specifying the terminal behaviour, (2) choosing the starting behaviour, (3) choosing the shaping steps and (4) moving along at the correct pact
requirements for starting behaviour in shaping (2)
(1) occurs frequently enough to be reinforced within the session time and (2) approximates the final behaviour
pitfalls of shaping (2)
(1) can inadvertently shape a harmful behaviour and (2) failing to shape a desirable behaviour
intermittent reinforcement
not every response produces reinforcement
continuous reinforcement
every response produces reinforcement
intermittent reinforcement is used to ____ behaviour
maintain
advantages of intermittent schedules of reinforcement (4)
(1) reinforcer is effective for longer due to slower satiation (especially consummables), (2) behaviour is more resistent to extinction, (3) individuals work more consistently for reinforcement and (4) behaviour will be more readily transfered to control by natural reinforcers in the environment
fixed ratio (FR) schedule
reinforcement occurs after set number of responses
advantages of fixed ratio (FR) (2)
(1) high resistance to extinction and (2) produces high steady rate of responding (after PRP)
post-reinforcement pause (PRP)
no responses immediately after reinforcement; the more responses required to obtain reinforcement, the longer the PRP will be
which intermittent reinforcement schedules have a PRP and why?
fixed ratio and fixed interval; because they are predictable
ratio strain
increasing the number of responses required for reinforcement too quickly causes deterioration in responses (too rapid of a ratio change)
avoiding ratio strain
gradually increase number of response required to obtain reinforcement
variable ratio (VR) schedule
reinforcement after varied number of responses (average out to a fixed number)
advantages of VR (2)
(1) high steady rate of response and (2) no PRP
VR maintains behaviour at ____ ratio values than FR
higher
VR has ____ resistance to extinction than FR
greater
fixed interval (FI) schedule
reinforcement produced by the first response after a fixed interval of time (response before interval has elapsed has no effect)
PRP in fixed interval (FI) schedules varies directly with _____ of the ____
duration; interval
variable interval (VI) schedule
reinforcement produced by the first response after an interval of varying length (averages out)
in FI schedules, responses increase as the ___ of the interval nears
end
advantages of VI (2)
(1) moderately steady rate of response and (2) no PRP
in practice, ____ schedules are less common than ___ schedules
because?
interval; ratio
because:
- FI has long PRP
- VI has lower response rate than VR
what schedule type is a proposed model for procrastination?
fixed interval (FI)
limited hold
interval when reinforcement is available if response is made; if response does not occur during the limited hold interval, reinforcement is lost
limited hold schedules are most like ___ schedules
ratio
abstract ex: of an interval limited hold schedule
FI 2min/ LH 10sec
_____FI_____ { LH } ______FI______ { LH }
with a ___ FI, an FI/LH produces responses similar to a FR
small/short
limited hold schedules are used to produce ___-like responding with ____ schedules
ratio; interval
real example of a limited hold schedule
bus
are LH schedules more or less common than basic interval schedules in the real-world?
more
ratio schedule with limited hold
setting a deadline for required number of responses
what does an FR(30)/ LH(2min) schedule mean?
reinforcement requires 30 responses in 2 minutes
duration schedule
must produce behaviour for full duration (fixed or interval) for reinforcement
real example of duration schedule
hourly wage
fixed duration (FD) produces a ____ but variable duration (VD) does not
PRP
when are duration schedules used?
when behaviour can be monitored continuously and reinforcement is based on duration
concurrent schedule
more than one reinforcement schedule is used at the same time and the individual can respond on any schedule to get reinforcement
matching law (allocation of behaviour)
distribution of responses in concurrent schedules tends to match rate of pay off
components of a situation in which a behaviour occurs (3)
ABC’s : (1) Antecedent stimuli, (2) the Behaviour itself and (3) the Consequences of the behaviour
ABC assessment
identifiying the antecedents and consequences of a behaviour
stimulus control
the degree of correlation between the occurence of a particular antecedent stimulus and the occurrence of a subsequent response
discriminative stimulus: S^D
a stimulus in the presence of which a resposne will be reinforced; cue that a particular response will pay off (availability of reinforcement)
discriminative stimulus: S^delta
a stimulus in the presence of which a response will not be reinforced; cue that a particular repsonse will NOT pay off (non-availability of reinforcement)
stimulus discrimination training
the procedure of reinforcing a repsonse in the presence of an S^D and extinguishing that response in the presence of an S^delta
effects of stimulus discrimination training (2)
(1) good stimulus control (strong correlation between a particular stimulus and response) and (2) stimulus discrimination
stimulus generalization
the procedure of reinforcing a repsonse in the presence of a stimulus or situation and the effect of the response becoming more probably in the presence of another stimulus or situation
reasons for stimulus generalization (3)
(1) unlearned (strong physical similarity), (2) learned (limited physical similarity) and (3) learned (no physical similarity)
conceptual behaviour
emitting appropriate resposnes to all members of a common-element stimulus class and not to stimuli that do not belog to the class (Eg. all red things)
stimulus equivalence class
a set of completely dissimilar stimuli that an individual has learned to group or match together or respond to in the same way
factors influencing the effectiveness of stimulus discrimination training (4)
(1) choosing distinct signals, (2) minimizing the opporutnities for error, (3) maximizing the number of trials and (4) using rules: describe the contingencies
contingency
an if-then type of arrangement
rule-governed behaviour
controlled by the statement of a rule
behaviour chaining
sequence of discriminative stimuli and responses in which each response produced the S^D for the next response with the last response producing a terminal reinforcer (S^r+)
link
each S^D->R is a link in the chain; if an R fails to produce a S^D then the chain fails at that point (weak link)
methods of behavioural chaining (3)
(1) total task presentation, (2) backward chaining and (3) forward chaining
total task presentation
learner attempts to do all the behaivours in the sequence and continues until all steps in the chain are mastered
when is total task presentation used?
when sequence is fairly short and simple with discrete tasks (then it’s the BEST choice)
advantages of total task presentation (3)
(1) teacher spends less time in partial assembly, (2) can produce results quicker and (3) maximizes learner independence (particularily if steps are already familiar)
backward chaining
systematically constrct the chain in reverse order starting with the S^D and R that produce the terminal reinforcement
advanatage of backward chaining
always stengthening the S^Ds as conditioned reinforcers by associating them with the terminal reinforcement
forward chaining
teach inital link first using the terminal reinforcement, then train the initial and second link followed by the terminal reinforcement… and so on
what is the most common chaining method in the natural environment?
forward chaining and then total task presentation
factors influencing the effectiveness of chaining (6)
(1) task analysis, (2) encourage use of prompts by learner, (3) conduct a preliminary modeling trial, (4) begin trainign the chain, (5) use ample social praise and other reinforcers and (6) decrease extra assistance at each step as quickly as possible
error correction in chaining
provide necessary instructions to promt resposne or use physical guidance to help learner perform step correctly
pitfalls of chaining (2)
(1) unaware misapplication and (2) partial knowledge misapplicaiton
adventitious chaining
contains response that is not necessary for reinforcement
adventitious chaining has a _____ component
superstitious
superstitious conponent of adventitious chaining
unnecessary response component that is not functional for reinforecement
partial knowledge misapplication in chaining
solution
learner learns to make errors when instructor’s response is to repeat the question and give answers (engage in an imitation trial)
solution: increase reinforcement for correct response on question trials and lower reinforcement for imitation
differential reinforcement of low rates (DRL)
only reinforcement if interval between successive responses is >x seconds; goal is to reduce but not eliminate rate of response
types of DRL (2)
(1) spaced-responding and (2) limited-responding
spaced-responding DRL
requires that a behaviour NOT occur during a specified period/interval, but after interval, a response will produce reinforcement
limited-responding DRL
max number of responses taht can occur in an interval, if max is exceeded then no reinforcement is provided
differential reinforcement of zero responding/ other responding (DRO)
a reinforcer is presented only if a specified response does not occur during a specified period of time
size of DRO should increase until: (2)
(1) behaviour is occuring very rarely/ not at all and (2) a minimum amount of reinforecement is being given for its nonoccurrence
differential reinfrocement of incompatible behaviour (DRI)
withholding reinforcers for a target behaviour and reinforcing an incompatible response
differential reinforcement of alternative behaviour (DRA)
a procedure that involves the extinction of a problem behaviour combined with reinforcing a behaviour that is topographically dissimilar to, but not necessarily incompatible with, the problem behaviour
punisher (aversive stimulus)
an immediate consequence of an operant behaviour that causes that behaivour to decrease in frequency
principle of punishment
if, in a give situation, someone does something that is immediately followed by a punisher, then that person is less likely to do the same thing again when she or he next encounters a similar situtation
ways “punisher” is different than the common meaning (3)
(1) occurs immediately after the problem behaviour, (2) it is not a form of moral sanction, vengeance or retribution and (3) it is not used to deter others from engaging in the target behaviour
types of punishers (4)
(1) physical punisher, (2) reprimand, (3) timeout and (4) response cost
physical punishers
stimuli that activate pain receptors (nociceptors) or otherwise cause discomfort (without prior learning)
reprimand
a strong negative verbal stimulus immediately contingent on behaviour; generally a conditioned punisher
timeout
a period of time immediately following a particular behaviour during which an individual loses the opportunity to earn reinforcers
types of timeout (2)
(1) exclusionary and (2) nonexclusionary
exclusionary timeout
removing an individual briefly from a reinforcing situation immediately following a behaviour
nonexclusionary timeout
introducing into the situation, immediately following a behaviour, a stimulus associated with less reinforcement
response cost
immediate removal of a valued stimulus contingent upon a behaviour (negative punishemnt)
factors influencing the effectiveness of punishment (5)
(1) maximizing conditions for desirable alternative behaviour, (2) minimizing the cause problem behaviour, (3) selecting punisher, (4) add antecedents for punishment and (5) delivering punisher
selecting a punisher (3)
(1) punisher must be effective, (2) potentially a verbal reprimand and (3) effectiveness is increased if the punishment is varied
antecedents for punishment (S^DP)
will lead to a faster decrease in problem behaviour and increase in desirable behaviour; rules, warnings… etc
delivering punishers (4)
(1) the sooner the better, (2) intermittent punishemnt is LESS effective (be consistent), (3) delivery should not be paired with positive reinforcement (liking for licks; attention) and (4) be calm and matter of fact!!
physical punishment is associated with … (6)
(1) increased aggression, (2) increased antisocial behaivour, (3) poor academic achievement, (4) poor parent-kid relationships, (5) mental health problems and (6) diminished moral internalization
in Canada, parents and teachers can use physical punishment with “ ___ ___”
reasonable force (no objects or on face)
potential harmful side effects of punishment (6)
(1) aggressive behaviour, (2) emotional behaviour, (3) escape and avoidance behaviour, (4) no new behaviour, (5) modeling of punishment and (6) overuse of punishment
principle of escape conditioning (negative reinforcement)
the removal of certain, already present, stimuli (aversive stimuli) immediately after the occurrence of a behaviour will increase the likeliood of that behaviour
escape extinction
reversing escape conditioning by not allowing the behaviour to cause the aversive stimulus to be removed
principle of avoidance conditioning
a contingency in which a behaviour prevents an aversive stimulus from occurring thereby resulting in an increase in the frequency of the behaviour
warning stimulus (conditioned aversive stimulus)
stimulus that signals a forth-coming aversive stimulus (in avoidance conditioning)
discriminated avoidance conditioning
avoidance conditioing that includes a warning signal that enables the individual to discriminate a forthcoming aversive stimulus
respondent behaviour
reflexive; aka unconditioned reflexes
unconditioned stimulus
elicits a response (unconditioned response) without prior learning or conditioning
conditioned reflex
a stimulus-response relationship in which a stimulus elicits a repsonse because of prior respondent conditioning
factors influencing respondent conditioning (5)
(1) greater number of pairings of a CS with an US, the great ability of the CS to elicit the CR, (2) stronger conditioing if the CS precedes the US by about a 1/2 second, (3) a CS acquires greater ability to elicit a CR if the CS is always paired with the US than if it is only occasionally paired with the US, (4) when several neurtal stimuli precede a US, the stimulus that is most consistently associated with the US is the one most likely to become a strong CS and (%) respondent conditioning will devleop more quickly and strongly when the CS or US or both are intense rather than weak