Instrumental Conditioning Flashcards

Question 1

Q

Instrumental Conditioning

Answer

A

If classical conditioning involves forming new reflexive responses, instrumental conditioning involves forming new voluntary behaviours that direct goal-centered actions.

Question 2

Q

Law of effect aka cats in a box experiment

Answer

A

At first and overtime, the frequency of irrelevant behaviour gradually decreased, while the cat’s tendency to perform correctly increased.
Considered this response as “stamped in” when target response is followed.
Random, irrelevant behaviours are “stamped out”.

Question 3

Q

Primary reinforcers

Answer

A

Reinforcer with intrinsic value such as food, water or a mate

Question 4

Q

Secondary Reinforcers

Answer

A

Only comes to be reinforced through previous learning.
EX. Money

Question 5

Q

Generalization and Discrimination

Answer

A

Asks when a contingency is valid
S+ is when the contingency is valid
S- is when the contingency is not valid
EX. pigeon pecks button when it lights green…food is dispensed. But with red light, not food is dispensed..,therefore the pigeon can discriminate colours as it gets in-enforced.

Question 6

Q

Reinforcer

Answer

A

Any stimulus that is presented after a response that impact the frequency that the response is performed.
Positive and negative reinforcers can be removed.

Question 7

Q

Reward training

Answer

A

Presentation of a positive reinforcer leads to increase in positive behaviour
EX. Give a dog a treat when they learn a new trick.

Question 8

Q

Punishment training

Answer

A

Presentation of a negative reinforcer will decrease negative behaviour.
EX. Zapping??

Question 9

Q

Escape training

Answer

A

Removal of a negative reinforcer will increase positive behaviour.
EX. No reed to wash dishes if all homework is done.

Question 10

Q

Omission Training

Answer

A

Removal of positive reinforcer will decrease negative behaviour.
EX. Staying in and not playing outside with friends.

Question 11

Q

Immediate and delayed consequences

Answer

A

Training is most effective when the consequence immediately follows the target behaviour rather than being delayed. This allows an organism to accurately associate the correct behaviour with the reinforcer.

Question 12

Q

Shaping

Answer

A

By successive approximation which can be used to brain complex response.
Broken down into gradual stages of behaviour and reinforcement, a sequence of improvement.
Shaping can drastically reduce acquisition time.

Question 13

Q

Chaining

Answer

A

Develop a sequence (chain) of responses to build even more complex behaviours. The reinforcement is the opportunity to perform the next response.
Rewards start at the final behaviour and move backward in training.

Question 14

Q

Continuous reinforcement (CRF)

Answer

A

A reinforcer follows every correct response made by the individual.

Question 15

Q

Partial reinforcement (PRF)

Answer

A

Where the reinforcer follows only some of the responses.
In cases where reinforcement is not continuous, there are two basic methods for determining when reinforcement will be delivered:
1. RATIO: Made based on the number of responses that are made…although not every response is reinforced, the more responses that are made, the more reinforcements will be obtained.
2. INTERVAL: Based on time elaspsed since the last reinforcer was delivered, not the number of responses made. Aka when the response is made.

Question 16

Q

Fixed

Answer

A

The number of responses required, or the time that must pass before reinforcement is available, is made always the same.

Question 17

Q

Variable

Answer

A

The number of responses (ratio variable) or the interval of time (interval ratio) before a reinforcer is delivered varies randomly.
More resistant to extinction than fixed schedule since individuals are less likely to have expectations about when reinforcement is coming, thus it takes longer to realize reinforcement is never coming again.

Question 18

Q

Fixed Ratio (FR)

Answer

A

reinforcement follows after a fixed number of responses.
EX. FR-5 schedule where a rat is reinforced after completing 5 lever presses.

Question 19

Q

Ratio strain

Answer

A

As the number of responses required for reward increases, the post-reinforcement pause tends to get longer. If the required responses continue to increase, the organism will eventually reach a break point and stop responding completely.
EX. a rat might press a lever 5,10,50 times before they are rewarded only 1 food pellet.

Question 20

Q

Variable Ratio (VR) schedule

Answer

A

reinforcement follows after a variable number of responses have been completed.
EX. Unlike a FR-5 trial where reward is presented after 5 trials, the VR-5 trial is reward on average after 5 trails are completed.
EX. 20 responses completed, reinforcement may follow after response 4,7,11,19.
Produces high, steady rates of responding without the reinforcement cause.

Question 21

Q

Fixed interval (FI)

Answer

A

The first correct response that occurs after a fixed interval of time is reinforced.
EX. FI-1 min schedule where rat is reinforced after at least 1 minute after the last reinforcement was delivered
FI scalloped

Question 22

Q

Variable Interval (VI)

Answer

A

Reinforcement follows the first correct response to occur after a variable interval of time has passed. The average time required characterizes a particular VI schedule.
EX. VI-5 min schedule, rat is reinforced for the first level press that occurs after an average interval of 5 minutes. However, the actual amount of time that must elaspe before reinforcement is avaliable will vary from trail to trail.
—– 30 minutes, rewarded 5 times, in 4,9,16,19,22, 30 minutes

Question 23

Q

Autoshaping

Answer

A

Learning without direct guidance.