Chapter 5: Operant Conditioning Flashcards by Sofia Panasiuk

Define operant conditioning.

The process where an organism learns to make or to refrain from making certain responses in order to obtain or avoid outcomes (outcomes depend on the responses).

How well did you know this?

Not at all

Perfectly

Define positive reinforcement, positive punishment, negative reinforcement and negative punishment.

Reinforcement: behaviour increases
Punishment: behaviour decreases
Negative: Taking away
Positive: Adding something

How well did you know this?

Not at all

Perfectly

Describe the R (Response) part of the S-R-O arc.

The response is a voluntary process which becomes like a reflex over time. If a normal motor program is blocked, the animal will generalize the motion and use other methods to achieve the same ends.

How well did you know this?

Not at all

Perfectly

What experiment showed generalizability of the response in OC?

Rats trained to wade through a maze partially filled with water, later with the maze flooded, rats swam to the goal no problem.

How well did you know this?

Not at all

Perfectly

What is the Law of Effect?

When an animal’s behavior is followed by a positive outcome, then the likelihood of the animal performing the behavior again increases.

How well did you know this?

Not at all

Perfectly

What were the methodological problems with Thorndike’s Puzzle Box? (3)

Have to repeat trials over and over, having to reset the animals and device
Experimenter may either add a reward or punishment unconsciously.
The experimenter decides when the trial is complete (experimenter bias)

How well did you know this?

Not at all

Perfectly

What did B.F Skinner say were problems with comparing animals across trials of Thorndike-like experiments? (4)

What is one animals is just slower, does it mean it’s worse?
What is counted as the worst performance?
Time to R decreases with learning, we get progressively worse at discriminating time differences.
How do you generate a prediction from latencies? What if the animal is a masochist.

How well did you know this?

Not at all

Perfectly

What are the advantages of The Skinner Box? (2)

Experimenter does not have to chase the escaping animal

2. Self-trials (animal dictates their own rate of response)

How well did you know this?

Not at all

Perfectly

What was Skinner’s Box?

Sd: light that signals that box is “on”
R: rate of lever pressing
Outcomes(O): Food delivery (reinforcement), shock through floor wires (punishment)

How well did you know this?

Not at all

Perfectly

What is the progression of operant conditioning?

Pre-training: low spontaneous rate of R (exploring stage)
Training: contingency is introduced (If S then R-> O)
Acquisition: animal discovers contingency, rate of R increases
Extinction: contingency is eliminated, rate of R decreases.

How well did you know this?

Not at all

Perfectly

What are the four characteristics of Operant Conditioning?

Animal operates on the environment
Stimulus evokes a response to produce an outcome
Animal connects context, behavior and outcome
Operant conditioning is more flexible/powerful

How well did you know this?

Not at all

Perfectly

What are the three characteristics of CC?

Environment operates on animal
Stimulus evokes a response
Animal learns that a CS predicts a US

How well did you know this?

Not at all

Perfectly

Describe shaping.

Pigeon turn experiment with B.F. Skinner, evokes a behaviour through successive approximations which build up complex R incrementally. Initially contingency is introduced for simple behaviour, as rate of R improves, contingency is moved to a more complex version of R, gradually builds a complex R animal would never spontaneously produce

How well did you know this?

Not at all

Perfectly

Describe chaining.

Builds complex R sequences by linking together S, R, O conditions. Initially, train animals to pick up objects. Next, reward for picking up and then throwing it. Allows SERIES of behaviour (as opposed to shaping, which elaborates on a single response).

How well did you know this?

Not at all

Perfectly

Describe the process of backwards chaining.

Outcome first, then more complicated steps. (sometimes easier than chaining)

How well did you know this?

Not at all

Perfectly

Talk about the mine-sniffing giant pouched rats.

Study These Flashcards

Mozambique rats are better than dogs at sniffing out TNT, partially blind so depend more on smell and hearing.
Trained to smell TNT
Most take 1 year to train (some elite can train in 8 months) by shaping. Clickers are often used for training animals (conditioning, pair a clicker with the food), clickers are secondary reinforcers (initially no value, but eventually learned that they predict primary reinforcers)
Better than feeding with hand (hand is not instantaneous)

What did Clark Hull propose as a motivational source for OC? What’s a criticism?

Study These Flashcards

He proposed the drive reduction theory, by obtaining primary reinforcements (food, water, mates) reduces that innate drive and satiates us. That’s not true for all learning, we’re not always motivated by obtaining something.

How was internal vs. external motivation measured experimentally?

Study These Flashcards

In an experiment featuring the stop-watch and the watch-step task, Ps were put into two categories: those who were given a 200 yen reward for a correct trial, and those who were informed that their entire trial will get them 2000 yen in total. The non-performance based reward group spent more time practicing the task during the free-time they were given than the group that was given a reward after each trial. Furthermore, once in the second session both groups were told that they were not getting a performance based reward, the same pattern was observed!

What is negative reinforcement contrast?

Study These Flashcards

If rats are first trained to respond in order to obtain sugared water, and then the reinforcer is switched to food pellets, the rats rate of responding plummets.

What is positive reinforcement contrast?

Study These Flashcards

If rats are first trained to respond in order to obtain pellets, and then the reinforcer is switched to sugared water, the rats rate of responding doesn’t go up as much as we would expect. It has to be a huge increase to elicit a response.

What is the problem with doing experiments with punishment?

Study These Flashcards

Punishment may not actually evoke stopped behaviour, but a general stopping of all behaviour.
Using punishment can lead to more variable behaviour since alternatives are not precise
Can encourage cheating, behavior occurs only when certain discriminating stimulus (called circumvention)

What is concurrent reinforcement and why is is bad for OC? (3)

Study These Flashcards

It can undermine punishment. For example getting punished by teacher, but reinforced by laughter. Or getting attention from parents despite being punished. It can also produce other emotions that impair behavior, agression and produces generalized behaviour disruption.

What are three ways to make punishment effective?

Study These Flashcards

Contingency should always be in effect
Initial Intensity should be strong (experiments show that increasing shock intensity was less effective than starting out strong)
Don’t Use It At All! (Reinforce the behavior that prevents the unwanted behavior, used for children with severe autism).

Describe the fixed ratio schedule.

Study These Flashcards

Rule: Every X Rs produces 1 outcome

Responding: Steady responding upward until reinforcement
Post-reinforcement pause (flat line): time out from responding after each reward. Higher ratio of O to R = longer pause after each reward (often animals are consuming the reward during the pause)

Describe the Variable Ratio schedule.

Rule: Every X Rs produces 1 outcome, but X changes with each reinforcer Response: Behavior is constant and has a high rate of responding. Much faster behavior than the FR. Video games is one example.

Describe the Fixed Interval Schedule.

Rule: After Y seconds, 1R produces 1O. Response: Scallop shaped response, no responding at the beginning, rapid rate of response before interval expiration. (i.e., watching the clock for an appointment)

Describe the Variable Interval Schedule.

After Y seconds, 1R produces 1O, but Y changes after each O. Response: Behavior is steady but low rate of responding (i.e., checking emails)

What is a schedule?

a pattern of behavioural contingency (If 10 Rs, then O or if 10 minutes and then R, then O)

What is a concurrent reinforcement schedule?

When two or more schedules are presented at the same time (i.e., two levers on VI 2 and VI 4).

What is the matching law of choice to behavior?

Response rates to concurrent VI schedules often respond to the rate of reinforcement for each schedule.

Chapter 5: Operant Conditioning Flashcards

(30 cards)