Chapter 5- Operant Reinforcement Flashcards

Question

Training an animal or person to perform a behaviour chain

Answer 1

Breaking the task down into its component elements, a procedure called task analysis

Answer 2

Backward chaining and forward chaining

Answer 3

Backward chaining Example: first training a rack to drop the marble down at two, then training it to carry the marble to the tube and drop it, and then move onto the next link, and so on. The chain is never performed backward, the parts of the chain are always performed in their proper sequence but is backward only in the sense that links in the chain are added from back to front

Answer 4

Forward chaining

Answer 5

Each link in the chain is reinforced by the opportunity to perform the next step in the chain. The last external reinforcer is crucial, without it, the chain is not likely to be performed

Answer 6

Where operant learning is concerned, the word contingency refers to the degree of correlation between a behaviour and its consequences. The rate at which learning occurs varies with the degree to which a behaviour is followed by a reinforcer

Answer 7

Contiguity is the gap between a behaviour and its reinforcing consequence. Contiguity has a powerful effect on the rate of operant learning, in general, the shorter this interval is, the faster learning occurs One reason that immediate consequences produce better results is that a delay allows time for other behaviour to occur. This behavior, and not the appropriate one, is reinforced. However, learning can occur despite reinforcement delays if the delay is regularly preceded by a particular stimulus

Answer 8

Other things being equal, a large reinforcer is more effective than a small one. The relation between reinforcer size, sometimes referred to as a reinforcer magnitude, and learning is not, however, linear. In general, the more you increase the reinforcer size, the less benefit you get from the increase

Answer 9

Certain qualities of the behaviour being reinforced affect the ease with which it can be strengthened. For instance, behaviour that depends on smooth muscles and glands is harder to modify through operant procedures than is behaviour that depends on skeletal muscles

Answer 10

The effectiveness of food, water, and warmth as reinforcers varies with the extent to which an organism has been deprived of these things. In general, the greater the level of deprivation, the more effective the reinforcer Deprivation is less important where secondary reinforcers are concerned

Answer 11

Example: much of the difference between fast and slow learning school children disappears when both have similar learning histories.

Answer 12

The effects of reinforcing a behaviour will be very different if the behaviour also produces punishing consequences or if reinforcers are simultaneously available for other kinds of behaviour

Answer 13

Extinction

Answer 14

An abrupt increase in the behaviour on extinction

Answer 15

Extinction burst

Answer 16

Behavioural variability: the organism "tries something else", often a variation of the previously reinforced behavior. We can make use of this phenomenon during shaping: after repeatedly reinforcing and approximation of the desired behavior, we can withhold reinforcement. This increases the variability of the behavior, which makes it likely that a better approximation of the goal behaviour will appear. When it does it can be reinforced Aggression: extinction also often increases the frequency of emotional behavior. Ex. When lever pressing no longer produces food rats may bite the lever or another animal if present

Answer 17

One extinction session is often not enough, even if it lasts for several hours and involves hundreds or even thousands of unreinforced acts. The longer the interval between the two extinction sessions, the greater the recovery

Answer 18

After one extinction session, the rate of the previously reinforce behaviour declines and finally stabilizes at or near its pre-training level. Extinction appears to be complete, however if the animal or person is later put back into the training situation, the extinguished behaviour occurs again. This reappearance of a previously extinguished behaviour is called spontaneous recovery

Answer 19

Resurgence Example: a pigeon is trained to peck a disk and then this behaviour is extinguished. Now some new behaviour such as wing flapping is reinforced. When this behaviour is then put on extinction, wings flapping declines, but the bird may begin to peck the disk again

Answer 20

Regression is the tendency to return to more primitive, infantile modes of behaviour. When something does not produce the consequences we like, we may revert to a form of behaviour that had been reinforced in similar situations in the past. The behaviour may very well be unconscious, that is, the person probably cannot specify the learning history that produced it.

Answer 21

The number of times the behaviour was reinforced before extinction, the effort the behaviour requires, and the size of the reinforcer used during training

Answer 22

One nonreinforcement does not cancel out one reinforcement. Behaviour is usually acquired rapidly and extinguished slowly

Answer 23

He tried to draw a four-inch line with his eyes closed over and over again for a total of 3000 attempts, yet there was no improvement. He also performed this experiment with students who without feedback did not improve. When he allowed them to open their eyes after each attempt to see the results of their effort there was a marked improvement. He concluded that practice is important only in so far as it provides the opportunity for reinforcement.

Answer 24

Hull's Drive-reduction theory

Answer 25

Premack's relative value theory

Answer 26

Advantages: it is strictly empirical, no hypothetical concepts, such as Drive, are required. And event is reinforcing simply because it provides the opportunity to engage in preferred behaviour Disadvantages: the secondary reinforcers are troublesome because his theory does not explain why the word yes, for example, is non-reinforcing. Low probability behaviour will reinforce high probability behaviour if the participant has been prevented from performing the low probability behaviour for sometime

Answer 27

Premack principle Example: if a rat shows a stronger inclination to drink then to run in and exercise wheel, drinking can be used to reinforce running. To get a drink, the rats had to run. The result was that the time spent running increased. Drinking reinforced running

Answer 28

Response deprivation theory

Answer 29

Disadvantages: it also has trouble explaining secondary reinforcers like the word yes Advantages: works well enough for many reinforcers

Answer 30

Example: when a bell sounds, a cat put in a room will be blasted with cold air. After sometime the cat will move out of the way of the cold air. Eventually, just hearing the sound of the bell will make the cat move even before the cold air blows

Answer 31

Escape-avoidance learning

Answer 32

While escaping from an aversive stimulus is reinforcing and not puzzling, performing an act that avoids the aversive stimulus is. This means that something that did not happen is a reinforcer

Answer 33

Two-process theory

Answer 34

Problems: even when the signal for shock does lose its aversiveNess, the avoidance response persists Another problem has to do with the failure of avoidance behaviours to extinguish

Answer 35

Sidman avoidance procedure There is no signal such as a light going off or tone, correlated with impending shock

Answer 36

One-process theory Whereas two-process serious say that the absence of shock could not reinforce behavior, something that does not happen cannot be a reinforcer, one-process theory save that something does happen, there is a reduction in exposure to shock, and this is reinforcing It deals with the resistance of avoidance behaviours to extinction by getting an animal or person to stop performing an unnecessary avoidance behaviour by preventing both the behaviour and it's aversive consequences from occurring

Answer 37

Thorndike studied animal intelligence by studying animal learning. He would place a hungry cat in a puzzle box and put food in Plainview but out of reach. The box had a door that could be opened by some simple acts such as pulling a wire loop or stepping on a treadle. The cats begin by performing a number of ineffective acts but eventually the cat would pull in the loop or step on the treadle and the door would follow open, and the cat would make its way to freedom and food. With each succeeding trial, the animal made few were ineffective movements until after many trials, it would immediately pull on the loop or step on the treadle and escape. Concluded that a given behaviour typically has one of two kinds of consequences or effects. One kind of consequence was a satisfying state of affairs, the other was an annoying state of affairs. He later called this relationship between behaviour and its consequences as the law of effect

Chapter 5- Operant Reinforcement Flashcards

(62 cards)