Unit 3 - Ch. 5 Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Another name for negative reinforcement

A

escape training

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Shaping

A

Shaping is the reinforcement of successively closer approximations of a desired behaviour.
In shaping, it is sometimes a good idea to back up– ie, to reinforce earlier approximations of the desired behaviour.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Discrete training

A

Performance of a behaviour defines the end of a trial.
Operant training procedure.
The training procedure Thorndike used in his famous experiment with cats is best described as a discrete trial.
The dependent variable is usually related to how long it takes a participant to reach the end, the number of errors before getting there, or the number of times a behaviour was performed within the time frame.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Positive reinforcement is also called

A

reward training

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Response deprivation theory

A

A behaviour becomes reinforcing for an organism when the organism is prevented from engaging in that behaviour at its normal frequency.
Schoolchildren are eager to go to recess because they have been deprived of the opportunity to exercise.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Law of effect

A

The law of effect says that behaviour is a function of its consequences (behaviour changes in relation to how the consequences change)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Notation for reinforcement

A

B–>SR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Relative value theory

A

The reinforcement properties of an event depend on the extent to which the event provides access to high probability behaviour.
High probability behaviour can be used to reinforce low probability behaviour.
Limitation: low can be favoured if it’s what you’ve been deprived of.
Premack’s name is most logically associated with relative value theory.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

John Nevin says reinforcement gives behaviour

A

momentum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Premack is associated with

A

Relative value theory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Chaining

A

A chaining procedure is a series of steps to reinforce a behaviour chain. The first step is called task analysis: you break down the task into its component elements, identifying each link in the chain.
Chaining is a useful procedure for shaping behaviour in laboratory animals, and it is important in shaping the behaviour of wildlife

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Extinction procedure leads to:

A

increase in variability of behaviour, increase in irritability, short-term extinction burst increase in behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Another word for operant

A

Instrumental.

Another word for operant is instrumental (the behaviour is instrumental in producing the consequences)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Resurgence

A

The reappearance of previously reinforced behaviour during extinction is called resurgence. (reintroduce some other thing that worked in the past- pecking if flapping on extinction)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Sidman avoidance procedure

A

The distinctive characteristic of the Sidman avoidance procedure is that the aversive is not signalled.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Connectionism

A

Thorndike speculated that reinforcement strengthened bonds between neurons, a view that many cognitive scientists have now embraced and called connectionism

17
Q

Contingency square

A

A contingency square is a grid with strength of behavior (x) and consequence (y) (stimulus is presented or removed) axes.
Positive Reinforcement, Positive Punishment
Negative Reinforcement, Negative Punishment

18
Q

3 essential features of reinforcement

A

behaviour must have a consequence; behaviour must increase in strength/occur more often; increase in strength must be a result of that consequence.

19
Q

Tips for shaping behaviour

A
  1. Reinforce small steps
  2. Provide immediate reinforcement
  3. Provide small reinforcers. (Too much food takes too long to eat)
  4. Reinforce the best approximation available
  5. Back up when necessary
20
Q

Operant learning

A

Bevaviour operates on the environment.
Behaviour is strengthened or weakened by its consequences. The behaviour is typically instrumental in producing these consequences– so this type of learning is also called instrumental learning.

21
Q

Reinforcement

A

Reinforcement is the procedure of providing consequences for a behaviour that increase or maintain the strength of that behaviour.

22
Q

Escape training

A

Escape training is the reinforcement of a behaviour to end an aversive stimulus. For example, coming in out of the rain so you don’t get soaked.

23
Q

Avoidance training

A

What reinforces your behaviour involves preventing or postponing an aversive stimulus. This might be not going out when you see or read that it’s about to rain.

24
Q

Free operant procedure

A

A free operant procedure is associated with Skinner. The behaviour may be repeated any number of times, so there isn’t an “end” in the same way there is with a discrete trial procedure. For example, a participant may push the lever in one of Skinner’s boxes many times for food within a single session/experiment.

25
Q

Compare and contrast operant and Pavlovian conditioning.

A

In operant conditioning, a stimulus (the reinforcing or punishing consequence) is contingent on a behaviour. It usually involves voluntary behaviour.

Pavlovian conditioning involves one stimulus (the US) that is contingent on another stimulus (the CS). It mostly involves involuntary/reflexive behaviour.

Though different Pavlovian and operant experiences often happen together. The distinction is tough when, as evidenced in the case of Albert and the rat: the fact that Albert reached for the rat just before the loud noise occurred means that operative learning was involved in addition to Pavlovian conditioning.

26
Q

What are primary reinforcers?

A

Primary (unconditioned) reinforcers are those that are not dependent on their association with other reinforcers. Examples would be food, water, sexual stimulation, stimulation of the brain’s “pleasure centres”, relief from hot/cold, and certain drugs.

27
Q

What are secondary reinforcers?

A

Secondary reinforcers depend on their association with other reinforcers. Examples include praise, recognition, smiles, and positive feedback. These reinforcers are conditioned.

28
Q

What are the advantages of secondary reinforcers over primary reinforcers?

A
  1. Primary reinforcers lose their reinforcing value quickly (if you’re full, hunger works less and less)
  2. It’s easier to reinforce behaviour immediately with secondary reinforcers (ie clicker versus walking over with food)
  3. Conditioned reinforcers are less disruptive (don’t take time)
  4. Conditioned reinforcers can be used in many situations, including when subject isn’t hungry or thirsty
29
Q

What are generalized reinforcers?

A

Generalized reinforcers are those conditioned reinforcers that have been paired with a number of primary reinforcers and can be used in a variety of situations (such as money).

30
Q

What are the 2 types of chaining procedures?

A

Forward chaining is when you build each successive link in the chain as you reinforce always the furthest step in the chain. You’d start with reinforcing step 1, then when they did step 2 reinforce only when they reach that, and so on.

Backward chaining is when the training starts with the last link, and backs up to start with earlier and earlier links.

With both, if the next behaviour isn’t happening, you reinforce the closest approximation (called shaping) until they reach the full behaviour for each step.

31
Q

What conditions affect the effectiveness of a reinforcer?

A
  1. size (bigger better to a point)
  2. Task characteristics (some things are just harder to reinforce)
  3. Deprivation level (how hungry?)
  4. Prior learning
  5. Competing contingencies
32
Q

Hull’s drive-reduction theory

A

Empty explanatory concept.
According to Hull, a reinforcer is something that reduces a drive. Drive’s are in effect primary reinforcers that reduce physiological needs. The problem is that there are reinforcers that don’t reduce physiological needs, but nonetheless work as reinforcers.

33
Q

2-process Theory of Avoidance

A

Both Pavlovian and operant learning are involved in avoidance learning. The escape is negatively reinforced operant learning, but eventually Pavlovian conditioning comes in, as any trigger or sign that the negative reinforcer is coming becomes the CS for fear.

34
Q

1-process Theory of Avoidance

A

One-process theory says that avoidance involves only operant learning. The reinforcer in avoidance is the reduction in exposure to the negative stimulus. Evidence for this is found in the fact that preventing both the avoidance behaviour and the consequences results in extinction of the avoidance behaviour.