Chapter 5 Flashcards

Question 1

Q

Define Instrumental behaviour:

Answer

A

Behaviour that
occurs because it was previously needed
for producing certain consequences

Question 2

Q

Define instrumental conditioning

Answer

A

A form of learning in which behaviour is modified by
administering rewards and/or punishments

Question 3

Q

Behaviourist School of Thought (4):

Answer

A

– Skinner + Watson
– Human behaviour is shaped primarily by their
environment
– Learning is a product of reinforcement and punishment
– We are born as blank slates

Question 4

Q

What is the “equation” for instrumental conditioning?

Answer

A

Voluntary Response/ Behaviour
(ex: biting ones nails)
+
consequence (punishment)
=
Increase or
decrease in voluntary response (ex: no more biting of nails)

Question 5

Q

What is the “equation” for classical conditioning:

Answer

A

stimulus + stimulus = conditioned reflexive response

Question 6

Q

In instrumental conditioning, voluntary responses are __

Question 7

Q

Give the basic procedure of instrumental conditioning (Step 1 , Step 2, and consequence)

Answer

A

Step 1:The organism ‘reacts or behaves’

Step 2: A behaviour modification technique is applied

Consequence:The reaction or behaviour either
occurs more frequently or is reduced/stopped

Question 8

Q

Instrumental conditioning can be used to:

Answer

A

produce complex behaviours

Question 9

Q

instrumental conditioning is a type of learning in which the _

Answer

A

consequences of behaviour tend to modify
that behaviour in the future

Question 10

Q

definition instrumental behaviour:

Answer

A

Behaviour that
occurs because it was previously needed
for producing certain consequences

Question 11

Q

Instrumental conditioning:

Answer

A

procedures developed to study instrumental behaviour

Question 12

Q

instrumental conditioning rationale, behaviour that is rewarded or reinforced tends to be:

Question 13

Q

instrumental conditioning rationale: Behaviour that is ignored or punished is _

Answer

A

less likely to be repeated

Question 14

Q

Edward L. Thorndike

Answer

A

The first serious theoretical analysis
of instrumental conditioning

Question 15

Q

thorndike studied instrumental conditioning using the

Answer

A

puzzle box

Question 16

Q

Thorndike’s Early Studies

Answer

A

Initially, a lot of behaviours are tried out
* Animal tracks outcomes of behaviours
– S -> R -> O
– In context (S), response (R) produces outcome (O)
* This knowledge guides future behaviours:
– Behaviours with positive outcomes increase
– Behaviours with negative outcomes decrease

Question 17

Q

Thorndike’s “Law of Effect”

Answer

A

If a response in the presence of a stimulus is followed by a satisfying event, association between
the stimulus (S) and the response (R) is strengthened
Conversely:
If a response is followed by an undesirable event, the
S-R association is weakened

Question 18

Q

Notes on Thorndike’s Law of effect: The resulting event is ___ of the association

Question 19

Q

Notes on Thorndike’s Law of effect:The satisfying or annoying consequence serves to ____

Answer

A

Strengthen or weaken the S-R association

Question 20

Q

DEfine variables S-R-O

Answer

A

S=stimuli
R=Response
O= outcome

Question 21

Q

what are some methodological problems with thorndike’s puzzle boxes(5)?

Answer

A

(1) Have to repeat trials over and over, resetting animal and device
(2) cutoff: what is the worst performance?
(3) decreases with learning
(4) hard to compare across animals, trials
(5) How do you generate a prediction from latencies?m

Question 22

Q

what are the two types of procedures to study instrumental conditioning?

Answer

A

(1)Discrete-trial procedures (puzzle boxes + maze learning)
(2) Free operant procedures

Question 23

Q

What are the two types of discrete-trial procedures?

Answer

A

(1) Puzzle boxes
(2) Maze learning (T-Maze, 8-Arm Radial Maze)

Question 24

Q

What are the different types of mazes?

Answer

A

(1) Runway maze (aka straight-alley maze)
(2) T-maze
(3) 8-arm Radial Maze

Question 25

Q

8-arm Radial Maze

Answer

A

Often used for memory tasks

High off the ground, rats hesitant to walk off : makes learning more obvious when they DO walk off

Question 26

Q

free operant procedures in comparison to discrete trial allows

Answer

A

more dependant variables ; we can look at RATE of conditioning

Question 27

Q

The operant response is defined in terms of its:

Answer

A

effect on the environment

Question 28

Q

Different types of operant responses:

Answer

A

Lever-press
Chain pull
Nose-poke
Peck

Question 29

Q

What is the dependant variable in free operant procedures:

Answer

A

Response rate
2.Total number fo responses
3.Latency to respond

Question 30

Q

BF skinner is considered to be:

Answer

A

the leading authority of IC

Question 31

Q

BF skinner was influenced by:

Answer

A

Thorndike

Question 32

Q

B.F skinner invented the “skinner box” to:

Answer

A

test IC through shaping

Question 33

Q

shaping reinforces

Answer

A

any movement in the direction of the desired response

Question 34

Q

shaping rewards:

Answer

A

gradual successive
approximations

Question 35

Q

Shaping is __ than waiting for the
response to occur and then
reinforcing it

Question 36

Q

Used effectively to condition humans
and many types of animals

Question 37

Q

Shaping:

Answer

A

Shaping through successive
approximation builds a complex R
incrementally

Question 38

Q

Describe the steps in shaping (3):

Answer

A

Initially, the contingency is
introduced for simple behaviour (R)
As the rate of R improves, the
contingency is moved to a more
complex version of R
Gradually, it builds a complex R
animal that would never be
“spontaneously” produced

Question 39

Q

Chaining:

Answer

A

Chaining builds complex R
sequences by linking together
S–>R–>O conditions

Question 40

Q

Describe chaining process (3):

Answer

A

Initially, train the animal to pick up an object
Next, reward it for picking it up and then throwing it

Question 41

Q

Chaining allows:

Answer

A

A series of behaviours
(as opposed to shaping, which simply elaborates on a single response)

Question 42

Q

Shaping and chaining can be used together to:

Answer

A

Train animals to complete incredibly complex behaviours

Question 43

Q

Shaping and chaining cannot:

Answer

A

move too fast

Question 44

Q

Shaping involves

Answer

A

combining familiar response
components into a new activity

Question 45

Q

shaping depends on:

Answer

A

inherent response variability

Question 46

Q

How To Get a Rat To Lever Press: Shaping:

Answer

A

Magazine/food port training

Food is available here!

Shaping

– Define the final response

– Identify the starting point of the behaviour

– Divide the progression from starting point to final point into a of steps – training plan

– Reinforcing successive approximations of the final behavioural response (and non-reinforcement of earlier response forms)

Question 47

Q

In the skinner box, the animal is:

Answer

A

free in the chamber, no experimenter intervention: Free operant learning
-> free operant learning!

Question 48

Q

Positive reinforcement

Answer

A

Press lever (R) –>Get food

Question 49

Q

Negative punishment

Answer

A

Press lever (R) -> Food stops

Question 50

Q

Negative reinforcement

Answer

A

Press lever (R) -> End shock

Question 51

Q

Positive punishment

Answer

A

Press lever (R) –> Get shocked

Question 52

Q

Describe IC in the skinner box:

Answer

A

(1):Initially, tries
many things;
eventually,
accidentally
presses the
lever, produces
a positive
effect
(2)Now starts
hanging
around the
lever,
accidentally
presses it again
(3)Rat has
learned a
contingency: if
light on (S),
pressing lever
(R) –> food (O);
spends much
of its day
pressing and
eating

Question 53

Q

Basic Pattern of IC: Pre-training:

Answer

A

Low spontaneous rate of R

Question 54

Q

Basic Pattern of IC:Training

Answer

A

Contingency is introduced:
* If S, R->O

Question 55

Q

Basic Pattern of IC: Acquisition:

Answer

A

-Animal discovers contingency
– Rate of R increases

Question 56

Q

Basic Pattern of IC:Extinction

Answer

A

Contingency is eliminated
R–> __
Rate of R decreases

Question 57

Q

Basic Pattern of IC: R has a __ initial rate

Answer

A

R has a LOW initial
rate

Animal must discover the
contingency

Question 58

Q

__ occurs in IC

Answer

A

generalization

Question 59

Q

Generalization:

Answer

A

Responding to other, similar
stimuli

Question 60

Q

example of generalization:

Answer

A

Pigeons respond to
different colours of disks The less similar the colour, the lower the pecking rate

Question 61

Q

Discrimination:

Answer

A

learning to distinguish
between a stimulus that has been reinforced and others that may be similar

Question 62

Q

Instrumental Conditioning: Influencing Factors (5):

Answer

A

‘Quality’ of the outcome (appetitive stimulus/ aversive stimulus)

2.Relationship between the instrumental behaviour and the outcome (positive or negative contingency)

Magnitude of reinforcement
Immediacy of reinforcement
Level of motivation

Question 63

Q

Influencing factor: What are the two different “quality of the outcome” factors:

Answer

A

(1) Appetitive stimulus: “pleasant” event or outcome in the context of instrumental conditioning

(2) Aversive stimulus:‘unpleasant’ event or outcome
in the context of instrumental conditioning

Question 64

Q

define appetitive stimulus:

Answer

A

‘pleasant’ event or outcome
in the context of instrumental conditioning

Answer 60

A

‘unpleasant’ event or outcome
in the context of instrumental conditioning

Answer 61

A

(1) Positive contingency
(2) Negative contingency

Answer 62

A

The instrumental response
causes an outcome/stimulus to APPEAR

Answer 63

A

The instrumental response
causes a stimulus to DISSAPEAR or be ELIMINATED

Answer 64

A

(1) Acquisition of a response is faster
(2) Rate of responding is higher
(3) Resistance to extinction is greater

ex: people work harder for $30/hr than $10/hr

Answer 65

A

(1)If reinforcement is immediate, responses are
conditioned more effectively
(2)As a rule, the longer the delay in reinforcement,
the more slowly the response will be acquired

Answer 66

A

Higher motivation leads
to faster learning

Answer 67

A

The nature of the outcome, and whether or not the outcome is presented or eliminated

Answer 68

A

Where the relationship between the response (R) and the outcome (O) INCREASES the probability of the response occuring

Answer 69

A

Where the relationship between the response (R) and the outcome (O) DECREASES the probability of a response occurring

Answer 70

A

Anything that STRENGTHENS a response (or increases the probability that the response will occur)

Answer 71

A

Primary reinforcers fulfill basic physical needs for survival

Answer 72

A

Food, water, termination of pain

Answer 73

A

Secondary reinforcers are acquired or learned by association with other reinforcers

Answer 74

A

Money, praise, awards and good grades

Answer 75

A

Anything that SUPPRESSES a response (or decreases the probability that the response will occur)

Answer 76

A

Response produces an
appetitive stimulus

Answer 77

A

Response eliminates or prevents the occurence of an appetitive stimulus

Answer 78

A

Reinforcement is the INCREASE in response rate
and punishment is the DECREASE in response rate

Answer 79

A

Positive: Behaviour produces an appetitive stimulus

Reinforcement: increase in response rate

Answer 80

A

REMOVE something to increase behaviour

Answer 81

A

Add something to decrease behaviour

Answer 82

A

Remove something to decrease behaviour

Answer 83

A

Clean room -> get weekly allowance

Answer 84

A

tease little sister –> receive parental scolding

Answer 85

A

Fight with other children –> time-out from play

Answer 86

A

escape/avoidance training

Answer 87

A

omission training; differential reinforcement of other behaviour, negative punishment

Answer 88

A

in instrumental conditioning, the animal operates on the environment

Answer 89

A

In classical conditioning the environment operates on the animal

Answer 90

A

A response to produce an outcome (S –> R –> O)

Answer 91

A

context, behaviour and outcome

Answer 92

A

a response

Answer 93

A

CS predicts US

Answer 94

A

between two stimuli

Answer 95

A

The type of association between a response and its consequence

Answer 96

A

Involuntary or reflexive response

Answer 97

A

voluntary response

Answer 98

A

internal response (emotional or glandular reactions)

Answer 99

A

external responses (muscular and skeletal movement and verbal responses)

Answer 100

A

relatively simple

Answer 101

A

simple to highly complex

Answer 102

A

Emotional reactions (fear,likes, dislikes)

Answer 103

A

goal-orienteed responses

Answer 104

A

animals cease many activities, not just the punished one

Answer 105

A

He proposed that punishment does not produce true IC (not true learning of a contingency

Instead it produces a generalized suppresion –> a temporary decrease in most behaviours

Answer 106

A

NO skinner was WRONG

Answer 107

A

Although punishment can produce a general deccrease in behaviour, the punished behaviour decreases much more

Punishment does not always wear off
(this is a problem only with weak punishers, which can cause habituation)

Answer 108

A

(1) Indicates what behaviour is bad but does not help a person develop better behaviours (should be used in conjunction with rewards for appropriate behaviour)
(2) circumvention –> animal may learn discriminative stimuli that help it avoid punishment
(3)punished animal become fearful. and angry towards punisher (can lead to retaliation, avoidance, escape,etc)
(4) Punishment leads to agression
(children that were punished using physical force often also end up using same tactic on chilfren)

Answer 109

A

use NEGATIVE REINFORCEMENT:

example:

POSITIVE PUNISHMENT: A parent punishes a child after
repeated requests to clean room are ignored
– Result: room probably does not get cleaned, kid
angry/resentful at parent

NEGATIVE REINFORCEMENT:
A parent grounds
kid until the room is clean
– Result: room is more likely to get cleaned, kid
can only be angry at himself/herself!

Answer 110

A

(1) Timing : right away, not delayed
(2) Intensity: Minimum necessary to suppress problem behaviour
(3) Consistency: do not punish half the time, punish every time
(4) never punish out of anger: anger renders you less rational and less likely to be rational

Answer 111

A

– Weak punishers habituate
– Escalating punishers habituate more
– It is necessa

Answer 112

A

Discriminative stimulus –> signals the
contingency is in effect

With punishment, discriminative stimuli can
enable cheating –> For effective punishment, contingency
should always be in effect

Answer 113

A

if given task or Instrumental behaviour, and task can be done in a simplisitic way you will do it the exact same way over and over again –> cognitively the easiest

Answer 114

A

R is a “behavioural unit”
– Not a single behaviour but a class of behaviours producing an effect
– Some cognitive psychologists would call it a goal
or intention

Answer 115

A

(1) instrumental response
(2) The outcome of the response (the reinforcer)
(3) The relation or contingency between the response and outcome

Answer 116

A

It is possible to maintain variability of responses using reinforcement
– However, unless variability is explicitly reinforced,
responding will be become more stereotypical

Answer 117

A

– Easier to train responses that ‘belong’ with
the reinforcer
* e.g., cannot train yawns or scratching as an
escape response

Answer 118

A

Extra responses that are performed instinctively
because they are RELATED TO THE REINFORCER
– They compete with the response required by
the training procedure
* e.g., cannot teach raccoons to drop coins in a box

Answer 119

A

positive reinforcement

Answer 120

A

(1) quantity of the reinforcer
(2) quality of the reinforcer

Answer 121

A

Positive and negative contrasts

Answer 122

A

The time between the response
and the appearance of the reinforcement

Answer 123

A

refers to the situation where
the reinforcer appears immediately after the response

Answer 124

A

The extent to
which the response is necessary and sufficient for
occurrence of the reinforcer (CAUSAL)

Answer 125

A

YES:BY marking the target instrumental response

Answer 126

A

Instrumental responding acquired through accidental
or adventitious reinforcement
– Contiguity and not contingency is important

He was WRONG to a certain degree

Answer 127

A

Staddon & Simmelhag

Answer 128

A

A behaviour-systems theory explanation:
Periodic presentation of reinforcer produces
behavioural regularities based on the interval

terminal vs interim responses

Answer 129

A

behvaiour is not linked with finding or production of food

Answer 130

A

Periodic deliveries of food activate feeding systems and corresponding pre-organized species-typical foraging and feeding responses

Answer 131

A

post-focal search responses near food cup

Answer 132

A

general search behaviours

Answer 133

A

focal search behaviours near
food cup

Answer 134

A

The effect of controllability: The learned
helplessness effect

Answer 135

A

triadic design

Answer 136

A

(1)The learned helplessness hypothesis
(2)Activity deficit:

Answer 137

A

– Animals learn in Phase 1 that there is nothing they can do
to control the shocks
– Leads to an expectation of lack of control in future, which
undermines their ability to learn a new response

Answer 138

A

Since nothing they can do helps (during Phase 1), the
animals conserve energy and stop (almost) all behaviours

Answer 139

A

Escape:

– Performing an instrumental response that results in the
termination of an aversive stimulus

– Making an escape response produces sensory feedback
cues, which come to predict the termination of shock

Answer 140

A

– Learned helplessness
– Trained that no response you make will alter
outcome, so stop trying

Answer 141

A

The outcome occurs only if the organism makes a response
This is not the case in CC

Answer 142

A

particular
response will result in a particular outcome

Answer 143

A

obtain reinforcers and avoid punishment

Answer 144

A

not ideal for a variety of reasons

Answer 145

A

eliminating unwanted behaviour

Answer 146

A

shaping and chaining

Answer 147

A

positive or negative

Answer 148

A

An instance in which the delivery of a reinforcer happens to coincide with a particu- lar response, even though that response was not responsi- ble for the reinforcer presentation. Also called adventitious reinforcement. This type of reinforcement was considered to be responsible for “superstitious” behavior.

Answer 149

A

Same as accidental reinforcement.

Answer 150

A

A pleasant or satisfying stimulus that can be used to positively reinforce an instrumental response.

Answer 151

A

An unpleasant or annoying stimu- lus that can be used to punish an instrumental response.

Answer 152

A

An instrumental conditioning procedure in which the instrumental response prevents the deliv- ery of an aversive stimulus.

Answer 153

A

Change in the value of a reinforcer produced by prior experience with a reinforcer of a higher or lower value. Prior experience with a lower valued reinforcer increases reinforcer value (positive behavioral contrast), and prior experience with a higher valued reinforcer reduces reinforcer value (negative behavioral contrast).

Answer 154

A

The idea, originally proposed by Thorndike, that an organism’s evolutionary history makes certain responses fit or belong with certain rein- forcers. Belongingness facilitates learning.

Answer 155

A

A stimulus that becomes an effective reinforcer because of its association with a pri- mary or unconditioned reinforcer. Also called secondary reinforcer.

Answer 156

A

The occurrence of two events, such as a response and a reinforcer, at the same time or very close together in time. Also called temporal contiguity.

Answer 157

A

An instrumental conditioning procedure in which a positive reinforcer is periodically delivered only if the participant does something other than the target response.

Answer 158

A

A method of instrumental conditioning in which the participant can perform the instrumental response only during specified periods, usually determined either by placement of the participant in an experimental chamber or by the presentation of a stimulus.

Answer 159

A

An instrumental conditioning procedure in which the instrumental response terminates an aversive stimulus. (See also negative reinforcement.)

Answer 160

A

A method of instrumental conditioning that permits repeated performance of the instrumental response without intervention by the experimenter. (Compare with discrete-trial procedure.)

Answer 161

A

A gradual drift of instrumental behavior away from the responses required for rein- forcement to species-typical, or instinctive, responses related to the reinforcer and to other stimuli in the experimental situation.

Answer 162

A

An activity that occurs because it is effective in producing a particular consequence or reinforcer.

Answer 163

A

A response that has its highest probability in the middle of the interval between suc- cessive presentations of a reinforcer, when the rein- forcer is not likely to occur.

Answer 164

A

The time between the start of a trial (or the start of a stimulus) and the instrumental response.

Answer 165

A

A mechanism of instrumental behavior, proposed by Thorndike, which states that if a response (R) is followed by a satisfying event in the presence of a stimulus (S), the association between the stimulus and the response (S-R) will be strengthened; if the response is followed by an annoying event, the S-R association will be weakened.

Answer 166

A

Interference with the learning of new instrumental responses as a result of exposure to inescapable and unavoidable aversive stimulation.

Answer 167

A

The proposal that exposure to inescapable and unavoidable aversive stimu- lation reduces motivation to respond and disrupts subse- quent instrumental conditioning because participants learn that their behavior does not control outcomes.

Answer 168

A

A preliminary stage of instru- mental conditioning in which a stimulus is repeatedly paired with the reinforcer to enable the participant to learn to go and get the reinforcer when it is presented. The sound of the food-delivery device, for example, may be repeatedly paired with food so that the animal will learn to go to the food cup when food is delivered.

Answer 169

A

A procedure in which the instru- mental response is immediately followed by a distinc- tive event (the participant is picked up or a flash of light is presented) that makes the instrumental response more memorable and helps overcome the deleterious effects of delayed reinforcement.

Answer 170

A

Same as omission training or differential reinforcement of other behavior.

Answer 171

A

An instrumental condition- ing procedure in which there is a negative contingency between the instrumental response and an aversive stimulus. If the instrumental response is performed, the aversive stimulus is terminated or canceled; if the instrumental response is not performed, the aversive stimulus is presented.

Answer 172

A

An instrumental conditioning procedure in which the instrumental response prevents the delivery of a reinforcing stimulus. (See also differen- tial reinforcement of other behavior.)

Answer 173

A

A response that is defined by the effect it produces in the environment. Examples include pressing a lever and opening a door. Any sequence of movements that depresses the lever or opens the door constitutes an instance of that partic- ular operant.

Answer 174

A

An instrumental condition- ing procedure in which there is a positive contingency between the instrumental response and an appetitive stimulus or reinforcer. If the participant performs the response, it receives the reinforcer if the participant does not perform the response, it does not receive the reinforcer.

Answer 175

A

Same as punishment.

Answer 176

A

An instrumental conditioning procedure in which there is a positive contingency between the instrumental response and an aversive stimulus. If the participant performs the instrumental response, it receives the aversive stimulus; if the participant does not perform the instrumental response, it does not receive the aversive stimulus.

Answer 177

A

The relation of a response to a reinforcer defined in terms of the proba- bility of getting reinforced for making the response as compared to the probability of getting reinforced in the absence of the response.

Answer 178

A

Reinforcement of successive approximations to a desired instrumental response.

Answer 179

A

How fast (e.g., in feet per second) an animal moves down a runway.

Answer 180

A

Same as conditioned reinforcer.

Answer 181

A

Behavior that increases in fre- quency because of accidental pairings of the delivery of a reinforcer with occurrences of the behavior.

Answer 182

A

Same as contiguity.

Answer 183

A

The time interval between an
instrumental response and the reinforcer.

Brainscape's Knowledge GenomeTM

Chapter 5 Flashcards

Instrumental Conditioning: Foundations

Brainscape's Knowledge Genome^TM