Topic 4: Operant conditioning: reinforcement Flashcards

Question 1

Q

Operant (instrumental) conditioning

Answer

A

Learning that is controlled by the consequences of the organisms behavior

Question 2

Q

Reinforcement

Answer

A

-Process in which a behavior is strengthened by the immediate consequence that reliably follows its occurrence
-Strengthened=more likely to occur in future
-Thorndike law of effect
-Skinner + operant boxes

Question 3

Q

Thorndike’s law of effect

Answer

A

-“If a response, in the presence of a stimulus, is followed by a satisfying state of affairs, the bond between stimulus and response will be strengthened”
-Satisfaction=Stamping in
-Discomfort=Stamping out

Question 4

Q

Positive reinforcement

Answer

A

-Adding something good/desirable that makes the behavior occur in the future
-Eg) Yelling at a kid who enjoys the attention at being yelled at will yell again in the future
-Adding in yelling (positive in this scenario) leads to more yelling in future (behavior reinforced)

Question 5

Q

Negative reinforcement

Answer

A

-Taking away something bad/undesirable that encourages behavior to occur more often
EG) Turning off a loud buzzing noise when child finishes cleaning room
-Negative reinforcement does NOT equal punishment

Question 6

Q

Positive punishment

Answer

A

-Adding something bad/undesirable to make behavior happen less often
-EG) adding more chores when child misbehaves

Question 7

Q

Negative punishment

Answer

A

-Taking away something good that leads to behavior occurring less in the future
-EG) kid is having fun but is too loud, they get a timeout away from having fun. When out of timeout they wont be loud again to avoid being sent away from having fun

Question 8

Q

Antecedent

Answer

A

-The conditions you are in that determines whether a behavior will occur or not
-Could also be called stimulus

Question 9

Q

Operant behavior

Answer

A

-A behavior that is strengthened through the process of reinforcement
-Acts on environment to produce a consequence

Question 10

Q

Operant learning

Answer

A

Change in a behavior as a function of the consequence that followed it

Question 11

Q

Reinforcement

Answer

A

The procedure of providing consequences for a behavior that increases or maintains the probability of that behavior occurring in the future

Question 12

Q

Reinforcer

Answer

A

Any event or stimulus that follows an operant response and increases or maintains its future probability

Question 13

Q

Positive reinforcement #2

Answer

A

Any event or stimulus that, when presented as a consequence of a behavior, increases or maintains the future probability of that behavior

Question 14

Q

Negative reinforcement #2

Answer

A

Any event or stimulus that, when removed as a consequence of a behavior, increases or maintains the future probability of that behavior

Question 15

Q

Escape behavior

Answer

A

-When operant behavior increases by removing an ongoing event/stimulus
-Eg) turning off alarm clock or pressing lever to stop electric shock

Question 16

Q

Avoidance behavior

Answer

A

-When operant behavior increases by preventing the onset of the event or stimulus
-Eg) pressing a lever to prevent an electric shock

Question 17

Q

Discrete trial procedure

Answer

A

-Instrumental response produced once per trial
-Each training trial ends with removal of animal from the apparatus
-Each trial is done as an isolated chunk

Question 18

Q

Free-Operant procedure

Answer

A

-Animals remain in apparatus and can make many responses
-No intervention by the experimenter
-Developed by skinner
-Continuous trials

Question 19

Q

Cumulative record

Answer

A

-Based on old cumulative recorder device
-Constant paper output, pen jumps with each
-Plot of cumulative responses (y-axis) over time (x-axis)

Question 20

Q

Unconditional (primary) reinforcer

Answer

A

-A reinforcer that acquired its properties as a function of species evolutionary theory
-Stimuli and events have biological importance
-Usually depends on some amount of deprivation

Question 21

Q

conditional reinforcer

Answer

A

-Otherwise neutral stimuli or events that have acquired the ability to reinforce due to a contingent
relationship with other, typically unconditional, reinforcers

Question 22

Q

Immediacy

Answer

A

-A stimulus is more effective as a reinforcer when it is delivered immediately after the behavior

Question 23

Q

Specific reinforcer used

Answer

A

-Certain reinforcers are preferred over others
-Chocolate > Sunflower seeds

Question 24

Q

Task characteristics

Answer

A

Reinforce a pigeon pecking for food vs. a hawk pecking for food

Question 25

Q

Contingency

Answer

A

-A stimulus is more effective as a reinforcer when it is delivered contingent on the behavior
-The degree of correlation between a behavior and its consequence

Question 26

Q

Contiguity

Answer

A

-Nearness of events in time (temporal contiguity) or space (spatial contiguity)
-High contiguity referred to as pairing
-Less contiguity (longer delays) between the operant responses and the reinforcer diminishes the effectiveness of the reinforcer
-Hyperbolic decay function

Question 27

Q

Motivating operations

Answer

A

-Establishing operations
-Abolishing operations

Question 28

Q

Establishing operations

Answer

A

-Make a stimulus more effective as a reinforcer at a particular time
-Eg) deprivation

Question 29

Q

Abolishing operations

Answer

A

-Make a stimulus less potent as a reinforcer at a particular time
-Eg- Satiation

Question 30

Q

Reinforcer magnitude

Answer

A

-Generally a more intense stimulus is a more effective reinforcer
-relationship between size and effectiveness is not linear
-More magnitude increases the less benefit you get from increase
-Effectiveness of unconditional reinforcers tend to diminish quickly

Question 31

Q

Schedule of reinforcement

Answer

A

-A rule describing the delivery of reinforcement
-Different schedules produce unique schedule effects

Question 32

Q

Schedule effects

Answer

A

-Particular pattern and rate of behavior over time
-Over the long-term, effects are very predictable
-Occur in numerous species (humans too)

Question 33

Q

Continuous reinforcement schedule

Answer

A

-Behavior is reinforced each time it occurs
-Rate of behavior increases rapidly
-rare in natural environment

Question 34

Q

intermittent reinforcement schedule

Answer

A

-Four different types
1) fixed ratio
2) Variable ratio
3) Fixed interval
4) Variable-interval

Question 35

Q

1) fixed ratio schedule

Answer

A

-Behavior reinforced after a fixed number of times
-Generates post-reinforcement Pause
-Generates study run rates followed the post-reinforcement pause (PRP)

Question 36

Q

Post-reinforcement pause

Answer

A

Pausing typically increases the ratio size and reinforcer magnitude

Question 37

Q

2) Variable ratio schedule

Answer

A

-Number of responses needed varies each time
-Ratio-requirement varies around an average
-PRP’s are rare and very short, they are influenced by the lowest and/or average ratio
-Produces higher rates than a comparable fixed-ratio schedule
-Common in natural environmental
-2 common variations–1) Random-Ratio, 2) progressive ratio

Question 38

Q

1) random ratio

Answer

A

-Scheduling is controlled through a random number generator
-Produces similarly high rates of responding
-eg) casinos or video games

Question 39

Q

2) Progressive ratio

Answer

A

-Ratio requirements move from small to large
-PRP increase with ratio size
-creates a “break point” measure of how hard an organism will work

Question 40

Q

Fixed-Interval schedule

Answer

A

-Behavior is reinforced when it occurs after a given period of time
-Produce PRPs
-Responding increases gradually creating a scallop shape
-Uncommon in natural environment

Question 41

Q

Variable interval schedule

Answer

A

-The timing of the response needed varies each time
-Interval varies around an average
-PRP’s are short and rare
-steady rates of responding, not as high as VR
-Common in natural environments
-

Question 42

Q

Premack principle

Answer

A

-In nature different behaviors have different probabilities of occurring
-Low to high..reinforces low
-High to low.. does not reinforce high
-Any high probability response will can serve as a reinforcer for a lower probability response

Question 43

Q

How to test premack principle

Answer

A

1) establish baseline responding for different behaviors
2) Instrumental conditioning procedure with low to high and high to low

Question 44

Q

Example of premack principle

Answer

A

-If a child prefers playing pinball to eating veggies you can reinforce eating veggies by letting them play pinball each time they eat veggies
-high-probability behavior reinforces low-probability behavior

Question 45

Q

Problems with premack principle

Answer

A

-Does not nicely account for conditional reinforcement
-low probability behavior can reinforce high-probability behavior when the organism has been deprived of the low probability behavior

Question 46

Q

Antecedents/controlling stimuli

Answer

A

-Controlling stimulus is a stimulus that changes the probability of an operant behavior
-2 types
1) Discriminative stimulus
2) Extinction stimulus/ S delta

Question 47

Q

1) Discriminative stimulus/occasion setter

Answer

A

-A stimulus or event that precedes an operant and sets the occasion for its reinforcement
-Makes behavior less likely to occur in the moment

Question 48

Q

2) Extinction stimulus

Answer

A

-A stimulus or event that precedes an operant and sets the occasion for non-reinforcement
-Makes behavior less likely to occur in the moment

Question 49

Q

Antecedents

Answer

A

-Include establishing and abolishing operations as well as control stimuli
-Evoke a behavior
-Alter the current probability of behavior

Question 50

Q

Consequences

Answer

A

-Include reinforcers and punishers
-Strengthen or weaken the behavior
-Alter they future probability of behavior

Question 51

Q

When does discrimination occur?

Answer

A

-When the presence or absence of stimuli is the occasion on which a response will be followed by reinforcement
eg) Pecking is only reinforced when the green light is on
-The green light IS the occasion when pecking will be reinforced

Question 52

Q

What does discrimination refer to?

Answer

A

-The effect an occasion setting contingency has on behavior
-Refers to the effect of the response being more likely to occur in the presence of the SD than its absence

Question 53

Q

Stimulus control

Answer

A

A change in operant behavior that occurs when either S^D or S-Delta is presented

Question 54

Q

Discrimination index (ID)/Discrimination ratio

Answer

A

A measure of the stimulus control exerted by an S^D or S-Delta

Question 55

Q

Generalization

Answer

A

-Less precise control
-Obtained by training in a wide array of settings/stimuli

Question 56

Q

Stimulus generalization

Answer

A

1) Process where once a CS has been established, similar stimuli may also produce a CR
2) process by which an operant response occurs to one discriminative stimulus, it also occurs to other similar stimuli

Question 57

Q

Stimulus discrimination

Answer

A

1) process where we exhibit less pronounced CR to CSs that differ from the original CS
2) Process where less responding occurs to stimuli that are different from the original trained stimulus
-Operant response to trained stimuli but not others

Question 58

Q

Concept formation

Answer

A

1) The generalization within classes of stimuli; and
2) The discrimination between classes of stimuli

Question 59

Q

Generalization in practice

Answer

A

-Occurred when the target behavior occurs in situations other than the specific training conditions
-Ideally it involves having the target behavior occur in all relevant situations

Question 60

Q

Promoting generalization

Answer

A

-Reinforce occurrences of generalization
-Training setting and criterion setting should be quite similar
-Training and criterion should gradually become more dissimilar

Question 61

Q

Stimulus exemplar

Answer

A

Stimuli that represent the range of relevant stimulus situations in which the response should occur after training

Question 62

Q

General case programming

Answer

A

Different S^D’s may require different responses to obtain the same reinforcer

Question 63

Q