Unit 3 - Ch. 5 Flashcards

Question 1

Q

Another name for negative reinforcement

Answer

A

escape training

Question 2

Q

Shaping

Answer

A

Shaping is the reinforcement of successively closer approximations of a desired behaviour.
In shaping, it is sometimes a good idea to back up– ie, to reinforce earlier approximations of the desired behaviour.

Question 3

Q

Discrete training

Answer

A

Performance of a behaviour defines the end of a trial.
Operant training procedure.
The training procedure Thorndike used in his famous experiment with cats is best described as a discrete trial.
The dependent variable is usually related to how long it takes a participant to reach the end, the number of errors before getting there, or the number of times a behaviour was performed within the time frame.

Question 4

Q

Positive reinforcement is also called

Answer

A

reward training

Question 5

Q

Response deprivation theory

Answer

A

A behaviour becomes reinforcing for an organism when the organism is prevented from engaging in that behaviour at its normal frequency.
Schoolchildren are eager to go to recess because they have been deprived of the opportunity to exercise.

Question 6

Q

Law of effect

Answer

A

The law of effect says that behaviour is a function of its consequences (behaviour changes in relation to how the consequences change)

Question 7

Q

Notation for reinforcement

Question 8

Q

Relative value theory

Answer

A

The reinforcement properties of an event depend on the extent to which the event provides access to high probability behaviour.
High probability behaviour can be used to reinforce low probability behaviour.
Limitation: low can be favoured if it’s what you’ve been deprived of.
Premack’s name is most logically associated with relative value theory.

Question 9

Q

John Nevin says reinforcement gives behaviour

Question 10

Q

Premack is associated with

Answer

A

Relative value theory

Question 11

Q

Chaining

Answer

A

A chaining procedure is a series of steps to reinforce a behaviour chain. The first step is called task analysis: you break down the task into its component elements, identifying each link in the chain.
Chaining is a useful procedure for shaping behaviour in laboratory animals, and it is important in shaping the behaviour of wildlife

Question 12

Q

Extinction procedure leads to:

Answer

A

increase in variability of behaviour, increase in irritability, short-term extinction burst increase in behaviour

Question 13

Q

Another word for operant

Answer

A

Instrumental.

Another word for operant is instrumental (the behaviour is instrumental in producing the consequences)

Question 14

Q

Resurgence

Answer

A

The reappearance of previously reinforced behaviour during extinction is called resurgence. (reintroduce some other thing that worked in the past- pecking if flapping on extinction)

Question 15

Q

Sidman avoidance procedure

Answer

A

The distinctive characteristic of the Sidman avoidance procedure is that the aversive is not signalled.

Question 16

Q

Connectionism

Answer

A

Thorndike speculated that reinforcement strengthened bonds between neurons, a view that many cognitive scientists have now embraced and called connectionism

Question 17

Q

Contingency square

Answer

A

A contingency square is a grid with strength of behavior (x) and consequence (y) (stimulus is presented or removed) axes.
Positive Reinforcement, Positive Punishment
Negative Reinforcement, Negative Punishment

Question 18

Q

3 essential features of reinforcement

Answer

A

behaviour must have a consequence; behaviour must increase in strength/occur more often; increase in strength must be a result of that consequence.

Question 19

Q

Tips for shaping behaviour

Answer

A

Reinforce small steps
Provide immediate reinforcement
Provide small reinforcers. (Too much food takes too long to eat)
Reinforce the best approximation available
Back up when necessary

Question 20

Q

Operant learning

Answer

A

Bevaviour operates on the environment.
Behaviour is strengthened or weakened by its consequences. The behaviour is typically instrumental in producing these consequences– so this type of learning is also called instrumental learning.

Question 21

Q

Reinforcement

Answer

A

Reinforcement is the procedure of providing consequences for a behaviour that increase or maintain the strength of that behaviour.

Question 22

Q

Escape training

Answer

A

Escape training is the reinforcement of a behaviour to end an aversive stimulus. For example, coming in out of the rain so you don’t get soaked.

Question 23

Q

Avoidance training

Answer

A

What reinforces your behaviour involves preventing or postponing an aversive stimulus. This might be not going out when you see or read that it’s about to rain.

Question 24

Q

Free operant procedure

Answer

A

A free operant procedure is associated with Skinner. The behaviour may be repeated any number of times, so there isn’t an “end” in the same way there is with a discrete trial procedure. For example, a participant may push the lever in one of Skinner’s boxes many times for food within a single session/experiment.

Question 25

Q

Compare and contrast operant and Pavlovian conditioning.

Answer

A

In operant conditioning, a stimulus (the reinforcing or punishing consequence) is contingent on a behaviour. It usually involves voluntary behaviour.

Pavlovian conditioning involves one stimulus (the US) that is contingent on another stimulus (the CS). It mostly involves involuntary/reflexive behaviour.

Though different Pavlovian and operant experiences often happen together. The distinction is tough when, as evidenced in the case of Albert and the rat: the fact that Albert reached for the rat just before the loud noise occurred means that operative learning was involved in addition to Pavlovian conditioning.

Question 26

Q

What are primary reinforcers?

Answer

A

Primary (unconditioned) reinforcers are those that are not dependent on their association with other reinforcers. Examples would be food, water, sexual stimulation, stimulation of the brain’s “pleasure centres”, relief from hot/cold, and certain drugs.

Question 27

Q

What are secondary reinforcers?

Answer

A

Secondary reinforcers depend on their association with other reinforcers. Examples include praise, recognition, smiles, and positive feedback. These reinforcers are conditioned.

Question 28

Q

What are the advantages of secondary reinforcers over primary reinforcers?

Answer

A

Primary reinforcers lose their reinforcing value quickly (if you’re full, hunger works less and less)
It’s easier to reinforce behaviour immediately with secondary reinforcers (ie clicker versus walking over with food)
Conditioned reinforcers are less disruptive (don’t take time)
Conditioned reinforcers can be used in many situations, including when subject isn’t hungry or thirsty

Question 29

Q

What are generalized reinforcers?

Answer

A

Generalized reinforcers are those conditioned reinforcers that have been paired with a number of primary reinforcers and can be used in a variety of situations (such as money).

Question 30

Q

What are the 2 types of chaining procedures?

Answer

A

Forward chaining is when you build each successive link in the chain as you reinforce always the furthest step in the chain. You’d start with reinforcing step 1, then when they did step 2 reinforce only when they reach that, and so on.

Backward chaining is when the training starts with the last link, and backs up to start with earlier and earlier links.

With both, if the next behaviour isn’t happening, you reinforce the closest approximation (called shaping) until they reach the full behaviour for each step.

Question 31

Q

What conditions affect the effectiveness of a reinforcer?

Answer

A

size (bigger better to a point)
Task characteristics (some things are just harder to reinforce)
Deprivation level (how hungry?)
Prior learning
Competing contingencies

Question 32

Q

Hull’s drive-reduction theory

Answer

A

Empty explanatory concept.
According to Hull, a reinforcer is something that reduces a drive. Drive’s are in effect primary reinforcers that reduce physiological needs. The problem is that there are reinforcers that don’t reduce physiological needs, but nonetheless work as reinforcers.

Question 33

Q

2-process Theory of Avoidance

Answer

A

Both Pavlovian and operant learning are involved in avoidance learning. The escape is negatively reinforced operant learning, but eventually Pavlovian conditioning comes in, as any trigger or sign that the negative reinforcer is coming becomes the CS for fear.

Question 34

Q

1-process Theory of Avoidance

Answer

A

One-process theory says that avoidance involves only operant learning. The reinforcer in avoidance is the reduction in exposure to the negative stimulus. Evidence for this is found in the fact that preventing both the avoidance behaviour and the consequences results in extinction of the avoidance behaviour.