Chapter 10- Scheduled Of reinforcement Flashcards

Question

The density or frequency of a reinforcement schedule is a continuum. What extremes are on either end?

Answer 1

At one extreme we find continuous reinforcement, an FR schedule in which every single occurrence of a behaviour is reinforced. At the other extreme we find extinction, a schedule in which a behaviour is never reinforced

Answer 2

Partial reinforcement affect or PRE In a rat lever pressing study, the thinner the reinforcement schedule before extinction, the greater number of lever presses during extinction

Answer 3

It is paradoxical since the law of effect implies that the unreinforced lever presses that occur during an intermittent schedule should weaken the tendency to press, not make it stronger.

Answer 4

The discrimination hypothesis Example: it takes longer to discriminate between extinction and NFR 30 schedule then it does to discriminate between extinction and FR1 schedule

Answer 5

Frustration hypothesis The thinner the reinforcement schedule during training, the higher is the level of frustration when the rat finally receives food

Answer 6

Sequential hypothesis Attributes the PRE to differences in the sequence of cues during training Extinction proceeds rapidly after continuous reinforcement because an important cue for performing is missing The thinner the reinforcement schedule, the more resistant the rat will be to extinction, since a long stretch of non-reinforced lever pressing has become the queue for continued pressing. In other words, the rat performs in the absence of reinforcement because, in the past, long strings of non-reinforced presses have reliably preceded reinforcement

Answer 7

Response unit hypothesis In an FR 2 schedule, where one press does nothing, but two presses produces food, we should not think of this as press-failure, press-reward, but rather as press-press-reward. The unit of behaviour being reinforced is two lever presses. When responses are defined in terms of the units required for reinforcement, the total number of responses during extinction declines as the reinforcement schedule gets thinner. Behaviour on intermittent reinforcement only seems to be more resistant to extinction because we have failed to take into account the response units required for reinforcement

Answer 8

Multiple schedule Abbreviation: MULTI in front of the different schedules Example: a pigeon is reinforced for packing on an FI 10" schedule when a red light is on, but on AVR 10 schedule when a yellow light is on. The two reinforcement schedules alternate, with the changes indicated by changes in the colour of the light

Answer 9

Mixed schedule MIX FI 10" VR 10 schedule: disc pecking might be reinforced on an FI 10" schedule for 30 seconds and then on VR 10 schedule for 60 seconds, but there is no clear indication that the schedule has changed

Answer 10

Chain schedule

Answer 11

Tandem schedule

Answer 12

Cooperative schedule Two pigeons receive food by picking a disk when the two of them have packed a total of 20 times. One might pack the disc at the rate of 10 times a minute while the other packs at 40 times a minute. As soon as the total number of packs reaches 20, they each receive a few pieces of green

Answer 13

Concurrent schedule A pigeon may have the option of picking a read disc on a VR 50 schedule, or packing a yellow disc on a VR 20 schedule. The concurrent schedule involves a choice

Answer 14

Matching law

Answer 15

It makes sense to identify the more reinforcing schedule as quickly as possible and remain loyal to it. Switching back-and-forth between two ratio schedules is pointless. You should discriminate between which schedule is more dense

Answer 16

There will be periods during which lever pressing is useless. Some of this time could be spent pressing the lever on the other schedule. It therefore makes sense for the animal to devote most of its effort to the more dense schedule but occasionally press the lever on the thinner schedule

Answer 17

Ba/Ba + Bb = rA/rA + rB Ba and Bb represent two behaviors, behaviour A and behaviour B, and rA and rB represent The reinforcement rates for behaviours A and B, respectively. This equation is merely a reformulation of the matching law

Answer 18

Ba/Ba + Bo = rA/rA + rO Ba represents the particular behaviour we are studying, and Bo represents all other behaviors, rA represents the reinforcers available for Ba, and rO represents the reinforcers available for all other behaviours This formula has less predictive value then the formula for the two-choice situation, because it is not possible to specify all the behaviours that may occur, nor all the reinforcers those acts may produce Reminds us that behaviour is a function of the reinforcers available for any behaviour that might occur, not merely the reinforcers available for the behaviour that interests us at the moment

Answer 19

A farmer may devote most available farmland to a crop that produces a nice profit under typical weather conditions, and planting a smaller area in a less profitable crop that does well under adverse weather conditions. When we spend more time at a high-paying job then at a low-paying one When college students devote more time to a five credit course then to a one credit course, since a high-grade in the former pays better than a high-grade in the latter

Answer 20

The payoff in most games of chance resembles variable ratio schedules of reinforcement, and such schedules can produce high rates of behaviour that are highly resistant to change Momentary variations in schedules that lead to early wins and near misses can lead to compulsive gambling because the person is reinforced early on If I owned a gambling enterprise, I would put my machines on a variable ratio schedule, and have people win during their first few bets, and after that have them almost win every once in a while

Answer 21

Experimental or behavioural economics

Answer 22

Economists know that when the price of a luck Sharee item rises, the consumption of that item declines. But when the price of an essential item, such as food, rises, there is little change in consumption. The same phenomenon has been demonstrated in rats. Rats will work for psychoactive drugs, a luxury, but increases in the price of the drug, the number of lever presses required for a dose, usually results in decreased consumption; yet large increases in the price of food, an essential, do not lower consumption substantially Psychiatric patients who earn tokens for performing various tasks and exchanges them for cigarets and other items are given a choice between activities for which tokens are available (ex. Doing laundry), and other activities for which reinforcers are available (watching tv). The distribution of the tokens among patients resembled the United States population. Those in the top 20% held a total of 41% of all tokens, while those in the bottom 20% held only 7%.

Answer 23

Tested the idea that operant behaviour associated with chronic pain may be maintained by reinforcement after the pain has ceased. Pairs of rats on a cooperative schedule where one was exposed to a mild electric shock resulted in an abrupt reduction in the amount of work done by the rat in pain. These rats continue to press the lever, but at a much lower rate than during baseline. After the shock was terminated, the rats continue to work at a lower pace, only gradually increasing their share of the work load. This is interesting because the slower rate of work reduced the amount of food both rats received. Although the partner read could take up the slack, it necessarily took longer to reach the 50 lever press is required for reinforcement when one rat did little Work Suggests that there is good reason to believe that people made malinger if others are willing to press the lever more often to make up for someone who appears to be hurting. Malingering may occur even though everyone, including the malingerer, loses by it

Answer 24

Argue that the schedules of reinforcement studied in the laboratory are artificial constructions not found in the real world Complain that schedules research generally produces trivial findings Complain that reinforcement schedules reveal considerably more about rats and pigeons then they do about people.

Answer 25

These kinds of explanations merely name the behavior to be explained. Identifying the kinds of reinforcement schedules that produce these behaviours is a considerable advance

Answer 26

When the goal is to discover rules that describe the way the environment affects behaviour it is difficult if not impossible to discover such rules unless the experimenter simplifies the environment as experimenters do with research into schedules of reinforcement Allows us to answer questions that might be difficult to answer otherwise Gives us a more scientific way of accounting for differences in behaviour The only reason that studies with humans sometimes reveal patterns of behaviour different from those obtained with animals may be because human subjects often receive instructions about what they are to do which have a powerful effect on human behaviour Provides a very good way of testing the effects of variables on behaviour

Answer 27

Example: researchers trained rats to press a lever to get access to an exercise wheel and later administered cocaine in varying amounts 10 minutes prior to running the rest. The cocaine had no detectible influence on the pattern of behaviour until the researchers reached a dosage of 16 mg per kilogramme of body weight. Add to this level, the scalloped pattern of the FI schedule began to deteriorate. In much the same way, researchers used schedules as a basis for comparing the effects of alcohol and cocaine on human performance. Schedule performance can provide a baseline for evaluating the effects of toxins, diet, sleep deprivation, exercise, brain stimulation, and many other variables

Answer 28

Example: an increase in the rate of behavior. A pigeon that turns in counterclockwise circles at the rate of three or four a minute may learn to make 10 or 15 turns a minute Example: a reduction in the rate of behaviour A bird that turns counterclockwise 10 times a minute can learn to make one turn a minute Example: a change in the pattern of performance as well as the rate The cook learns to avoid opening the oven in the first few minutes but to check on the cookies more and more often during the last few minutes of baking time

Chapter 10- Scheduled Of reinforcement Flashcards

(52 cards)