final Flashcards

Question

appetitive conditioning example

Answer 1

Coyotes associate the human scent or voice with food (litter on trails)

Answer 2

Cues can announce something not pleasurable

Answer 3

Humans use human scent with hazing methods to scare the coyotes away

Answer 4

food, sex, drugs

Answer 5

get coyotes afraid of what they are supposed to avoid. Coyotes predicted the presence of food with humans, food and humans became the same thing — this classical conditioning was the problem. Became operant when the coyotes actually acted on the human to obtain to the food (an attack)

Answer 6

hazing program may not be successful with coyotes that already formed that association. Could be useful in young coyotes, to begin to conditioning coyotes that humans are dangerous. Have to be consistent in these pairings.

Answer 7

you stop pairing of CS-US

Answer 8

yes, but also learning a new response — learning what happens when CS is followed by US of one nature and of another nature

Answer 9

Responses are suppressed or not expressed

Answer 10

a suppression of response. It is not permanent, if the right conditions are back together the response comes back

Answer 11

CS-US pairing in context X Extinction in context Y — now bringing the organism to a different environment on purpose to extinguish the response CS in context X > renewed

Answer 12

1. CS — US 2. CS — no US

Answer 13

is context specific — extinction must be done in context of acquisition

Answer 14

therapy: eg. Might be able to handle it in the therapists office but not in the real environment If you experience of fear of explosions in war, being exposed just to the sound in a clinic may not help you

Answer 15

loss of information

Answer 16

The presentation of 2 or more CSs, ether - simultaneous - serial

Answer 17

CSs presented at same time

Answer 18

CSs presented in a sequence

Answer 19

Tone + light > US (food) This will condition normally, as if one S But, if later, the tone is tested alone…the CR won’t be as strong — compound stimuli will be stronger If you really want a good response to the tone, condition the tone alone

Answer 20

cues CSs are competing

Answer 21

the importance of having salient cues

Answer 22

Salient cues = more intense. The more intense cue is more likely to get conditioned and be noticed

Answer 23

mackintosh attentional theory of conditioning

Answer 24

1. Be more noticeable > overshadowing (because of saliency) 2. Be there sooner > blocking (because of temporal priority) — when something happens first, in the sequence of conditioning

Answer 25

- More salient cues get more learning in compound conditioning, still less than if trained alone - This brings us back to the idea of an information value of cues (cognitive interpretation): we use here the concept of information - If you really want to have a clear effect of one CS, try to isolate that completely and train it alone Neural networks/information theory involved here in classical conditioning

Answer 26

CSa + CSb > US is compound conditioning, ie. competing CSs

Answer 27

overshadow the less salient ones

Answer 28

Picture book for kids. The book contains text and pictures, but the pictures will most likely overshadow the text In other words, learning to read this way is not optimal

Answer 29

Similarly, in spatial navigation, landmark cues may overshadow more general spatial cues — most of the time when you navigate proximal cues influence you most. The most important cues are the most salient

Answer 30

This is the opposite of overshadowing In this case, there is more conditioning to a weak conditioned stimulus if it is combined with a more salient conditioned stimulus during conditioning

Answer 31

The conditioning of a weak odour may be strong if it is combined with a salient taste during conditioning Eg. Gadbois aversion for even the smell of whisky in his taste aversion for whisky Taste and odour go together — they are intimately linked stimuli. Smell is usually the early CS that something good/bad is coming.

Answer 32

Scenario: visit in-laws every Sunday. Your FIL offers an anise based liqueur, but you drink it every time even though it makes you sick. One day, he offers you bagels with the anise. Will you develop an aversion to bagels? No. Even if you get a bit sick, you will attribute your malaise to the anise not the bagels.

Answer 33

CSa: anise Cab: bagels

Answer 34

you were exposed to the aversive first, then you combined the bagel (neutral). Only the aversive maintains the negative response.

Answer 35

The experimental group gets the pairing as described above When bagels are presented without anise (Test phase) almost no CR Less response (CR) in the experimental group than the control group

Answer 36

prior conditioning with the anise has blocked conditioning to the bagels Basically, the anise already predicted the sickness. Adding the bagels did nothing.

Answer 37

the basic compound conditioning explained previously

Answer 38

compound conditioning group

Answer 39

pre-trained

Answer 40

basics of classical conditioning slide 56

Answer 41

It occurs in only one conditioning phase

Answer 42

See application mentioned by Boynton for chemotherapy: scapegoating Give patients a salient new flavour just before chemo (eg. Ginger candy) This new flavour will overshadow any other flavour eaten in recent meals (just before the chemo) The ginger candy becomes the scapegoat…problem is…the patient may never be able to stomach ginger candies in the future

Answer 43

- The CS-US association is an information provider unless the informational value is blocked - to be useful, a cue needs to be non-redundant (eg. Needs to be a non-redundant predictor) - Redundant cues are typically not very informant

Answer 44

the CS provides new information on the US

Answer 45

compete with each other

Answer 46

a less informative CS

Answer 47

- Psychophysics and the study of sensation and perception - Learning theory and the study of conditioning and learning

Answer 48

- most well known mathematical model in experimental psych and has been applied beyond learning theory - Tries to explain blocking - A lot about the unconditioned stimulus

Answer 49

- A lot about conditioned stimulus - Attentional theory (saliency)

Answer 50

US and CS modulation, attention, negative transfer

Answer 51

Priming (Attentional priming), affective dimension

Answer 52

We know that cues compete with each other for associative strength We know and will reaffirm that contiguity (in time and space) is not the full story. Information produced by the cues needs to be: Reliable Non-redundant

Answer 53

difference between phase 1 and 2. In all experiments in learning theory we have an acquisition phase and after. In acquisition phase, you do want that CS/US pairing to be tight and predictable. Later, if the CS always announces the US, your animal loses attention (redundancy). Then work with the saliency of the CS/US or make it less predictable. Uncertainty is what facilitates learning after a while (NOT initially)

Answer 54

1. Prediction error: The difference between what is predicted, and what actually happens 2. Error detection: Ability to detect errors or discrepancies between the predicted and actual occurrences (detect prediction errors) 3. Error correction learning: Similar to “trial and learning”. Learning to reduce the error or discrepancy between predicted and actual occurrences

Answer 55

- This type of learning applies to classical conditioning and many other types of learning, ie. skill and motor learning - This includes prediction error and error detection

Answer 56

Bell — food, Salivate salivate Or CS — US, CR UR

Answer 57

generalization

Answer 58

SR learning SS learning

Answer 59

CS-UR ~ note that this form of learning is known to occur in some case

Answer 60

US » CS = stimulus substitution theory or "cognitive" theories of classical conditioning; this view was confirmed by Rescorla in 1978.

Answer 61

One method can help answer this question: US devaluation

Answer 62

- Second order or higher order conditioning - Sensory pre-conditioning

Answer 63

association is essential!

Answer 64

Sign tracking but not goal tracking is resistant to outcome devaluation

Answer 65

gain "incentive salience"

Answer 66

you will decrease sign tracking, but increase goal tracking.

Answer 67

without an US > pro-SR learning

Answer 68

You get bitten (US) by a dog (CS) » Fear (CR) The later you associate the park (CS2) with dogs and get fearful (CR) entering the park

Answer 69

pro SR learning

Answer 70

Guilt by association" analogy - Peter and John hang-out together. Peter is arrested for using cocaine. You assume John is using cocaine.

Answer 71

wo neutral S get associated: The target CS (tone) never gets paired with the response it evokes. In sensory preconditioning, the same two stimuli are paired before the light can even elicit a CR. So with that logic, it is a case of S-S learning.

Answer 72

pure S-S learning

Answer 73

concise def ???

Answer 74

concise def ???

Answer 75

concise def ???

Answer 76

- Proboscis Extension Response (PER) conditioning to artificial odours in bees - Eyeblink conditioning in rabbits

Answer 77

- Fear conditioning in rats - Autoshaping in pigeons

Answer 78

- Appetitive conditioning in rats - Taste aversion learning

Answer 79

UR = eyeblink CR = eyeblink US = puff of air CS = tone/light

Answer 80

UR = bar pressing (from training) CR = fear/suppression of bar pressing US = shock CS = tone/light/noise

Answer 81

the fear response is often "freezing".

Answer 82

The response (pressing the bar) is suppressed.

Answer 83

the dependent variable, i.e., a measure of conditioned fear » suppression ratio

Answer 84

responding CS/(responding CS + responding pre-CS)

Answer 85

= 0.5. A complete suppression is 0.

Answer 86

CBT for fear and anxiety disorders, even some elements of PTSD.

Answer 87

motivational aspects of instrumental conditioning

Answer 88

focus on the CS's, or cues predicting the US.

Answer 89

focus on the USs

Answer 90

sign tracking

Answer 91

Skinner boxes. Alternative: successive approximations

Answer 92

- You light the "pecking key" for a few seconds, then the pellet magazine opens. - After a number of pairings (45 or so, more or less), the pigeons will peck at the key (it announces the delivery of food, and it is signtracking... that cue becomes associated with the food). Note: the food delivery is not contingent upon the pecking. The pigeons does not have to peck the key.

Answer 93

- This is the "magazine approach procedure": CS (any) - US (food) - The approach is more likely to happen when the CS is produced. This is goal tracking as defined earlier.

Answer 94

UR = nausea CR = nausea US = drug injection (emetic drug or nausea/vomiting inducing) CS = flavour S (saccharin)

Answer 95

CS-US can be separated in time by hours and still the conditioning will take place.

Answer 96

Clear and dramatic example in humans: Observed in humans with chemotherapy as well.

Answer 97

Time / timing Novelty of CS and US Intensity of CS and US Pseudoconditioning

Answer 98

1. Delay conditioning - Short-delay - Long-delay 2. Trace conditioning 3. Simultaneous conditioning 4. Backward conditioning - Other factor: Trial spacing

Answer 99

the CS occurs before the US .

Answer 100

typically no

Answer 101

The interval of 1me depends on the type of conditioning.

Answer 102

Example: Eyeblink (very short) vs taste aversion (very long) conditioning.

Answer 103

Gap between CS and US = "trace interval"

Answer 104

Fine procedure, but decreased efficiency with increased delay.

Answer 105

- The memory of the CS. - Inability for the animal to differentiate between the trace interval from the time between trials.

Answer 106

Full temporal overlap of the CS and US.

Answer 107

Usually not great results, but some exceptions.

Answer 108

May be a signal of "no US" (i.e., a conditioned inhibitor).

Answer 109

with relief in fear conditioning. = “safety signal”

Answer 110

trials are spaced-out over time (spaced trials); The ITI is crucial.

Answer 111

interstimulus interval

Answer 112

intertrial interval

Answer 113

Short ISI’s Long ITI’s

Answer 114

very important: Pre-exposure to CS and US before conditioning can interfere with learning.

Answer 115

"latent inhibition". You can habituate to a potential CS with repeated exposures.

Answer 116

Randich & LoLordo . Will delay subsequent conditioning. You can habituate to a potential US with repeated exposures.

Answer 117

real-life implications, e.g., for fear/phobia acquisition.

Answer 118

The CR will be strong, if the US is strong. So the intensity of food or shocks, etc., will influence the magnitude of the response.

Answer 119

Strong salient CS's will positively influence learning as well. So a strong tone, or flash of light, etc., will influence the magnitude of the response.

Answer 120

With CS's salient, but not scary (overpowering) is good.

Answer 121

You use CS = light. Response (UR) = blinking. Natural blinking or conditioned blinking? So what is going on? » impossible to tell

Answer 122

Sensitization

Answer 123

Pseudoconditioning

Answer 124

Blinking for the camera. Initially a natural response, can get sensitized and anticipated.

Answer 125

Mere exposure to the US! "Increased responding that may occur to a stimulus whose presentations are intermixed with presentation of a US in the absence of the establishment of an association between the stimulus and the US". (Domjan, 2015) Experiments need to control for both sensitization and pseudoconditioning.

Answer 126

- The US will be unexpected - You have a positive prediction error - The larger the error, the greater the learning!

Answer 127

- The US is expected… - There is no error prediction (because no discrepancy) - There is no learning.

Answer 128

- The absence of the US will be unexpected - You have a negative prediction error - There is a decrease in CS-US association - This is extinction

Answer 129

he theory is that technically it would suggest the CS (clicker) itself is reinforcing. After a while the dog might not even need the treat, the click is reinforced enough. If the association between the US/CS is weakened, therefore the clicker would not become as good of a predictor overtime and would lose its reinforcing value.

Answer 130

do not, reward

Answer 131

do not think clicker training would work if the clicker is used in isolation

Answer 132

- Each CS has an association weight, or strength of association, with the US. - Cues (CS’s) compete for associative strength. - This tells us that information, or the surprise value of the stimuli involved in important

Answer 133

the element of suprise!

Answer 134

- What is surprise? It is what is unexpected. - Part of this may be about attention — how much attention are you giving to elements in that model - Either you think its coming or its not, or vice versa, or a completely novel experience

Answer 135

The US needs to be "surprising". With time, the US, in any conditioning sessions, becomes less and less "surprising".

Answer 136

Why? Because the CS becomes a solid predictor of the US.

Answer 137

clicker training in dogs

Answer 138

the CS predicts the US perfectly.

Answer 139

Clicker training, your click pouch should be full of whole bunch of different intensity treats (surprise element). Slight variation of the US is good — this is for appetitive learning Could apply to a person as well. If a CS announces a shock, same principles apply especially if it is always the same small shock. If you start varying the intensity of the shock, then the person will start paying attention again.

Answer 140

no learning, learning depends on not being exactly sure how that works.

Answer 141

Repetitions get boring after a while, if you can always predict things then you start getting bored. That means though that you have already learned the task. But how do you keep a strong response/motivation — this is where unpredictability might help - You don’t want the response to start to extinguish - But if you maintain the link between CS/US it shouldn’t extinguish

Answer 142

maintain the link between CS/US

Answer 143

you are maintaining the strength of the association… so maybe not.

Answer 144

- Then you create a negative prediction error (see above) - There is a decrease in CS-US association… - So a US is important (at least some of the time…)

Answer 145

Uncertainty/intermittent reward is good for this

Answer 146

a negative prediction error, you are affecting the CS/US relationship

Answer 147

the neural network-like model:

Answer 148

the associative strength between the US and the CS.

Answer 149

Based on strengthening of the relationship between the CS and US. The more you do click/treat, with time the dog learns that the click predicts the treat.

Answer 150

This learning ceiling effect is called lambda, it is the asymptote of the learning curve.

Answer 151

When dog completely knows that click means treat Asymptote is very much determined by the value of the US (Whether it is good or bad) If it was about shocks, probably a more intense shocks get you to a higher lambda.

Answer 152

All about the US — that is the important part of this. Relatively of the US is very important

Answer 153

predictive value

Answer 154

the value of the US

Answer 155

△V = ⍺β(ƛ-V)

Answer 156

CS = ⍺ (it comes first) = salience of CS

Answer 157

US = β = salience of US

Answer 158

V = associative strength = learning parameter = how well the CS predicts the US

Answer 159

ƛ = ceiling effect (how much that animal is going to learn) — stands for the US

Answer 160

△ = delta = change

Answer 161

(ƛ-V) = surprisingness of the US = prediction error

Answer 162

the Rescorla Wagner model

Answer 163

The difference between the US and what the CS predicts (ƛ-V) is called the prediction error.

Answer 164

the CS becomes a better and better predictor of the US.

Answer 165

the easier it is for the animal to learn

Answer 166

smaller and smaller

Answer 167

djusting the prediction error

Answer 168

larger too (higher asymptote)

Answer 169

a strong/salient US

Answer 170

- Use a strong US e.g., in dog training, liver treats instead of regular kibble - Easier to make you get nervous about the possibility of getting a shock when the shock is very painful

Answer 171

The lambda can move (you can bypass the ceiling of (1.0), all depending on the strength of 𝛃

Answer 172

the learning will be faster (i.e., it affects the rate of learning).

Answer 173

get a salient CS (⍺)

Answer 174

e.g., in dog training, a stronger, louder, better defined (crispier) clicker sound.

Answer 175

lot of learning and fast learning.

Answer 176

With time is it possible things get boring, boredom can take place. It is technically possible that after a while these parings get boring. Then you could replace the clicker with a whistle or something to add an element of surprise/novelty.

Answer 177

will impact the associative strength/asymptote or lambda (𝛌).

Answer 178

will impact the rate of learning.

Answer 179

they all contribute to the conditioning (and may compete... as you may remember with blocking)

Answer 180

This is why they added sigma to the formula, the sum of the associative strengths — acknowledge the potential of sum variability in adding CS.

Answer 181

In the acquisition of a traumatic experience it might not always be clear what caused it, but rather an association of many things

Answer 182

potentiation effect

Answer 183

𝚫V = 𝛂𝛃(𝛌-𝚺V).

Answer 184

1. Phase 2: The tone competes with the light 2. Phase 3: The tone loses

Answer 185

Why? The light already accurately predicts the US (shock). - The light already acquired the strength of the CS/US pairing

Answer 186

Phase 1: Patient drinks milk » stomach ache Phase 2: Patient drinks milk + and has garlic » stomach ache Phase 3: Could the garlic cause the stomach ache? Diagnosis: “Unlikely, eat garlic, but avoid milk”

Answer 187

Implication (cognitive): Clearly the physician is thinking of a milk allergy here. But… if she/he had considered that both milk and garlic are common triggers for IBS (therefore a milk — and garlic — intolerance, not a milk allergy), then garlic would have been seen as a potential player. - Have to test things one by one. If you start combining things you will never be able to isolate the cause — not just because of the metabolic response, but also the chance of classical conditioning

Answer 188

There is less learning with a high suppression ratio.

Answer 189

then little or no conditioning will occur (resulting in a high suppression ratio).

Answer 190

that the response to the light is not well acquired (high suppression ratio).

Answer 191

Rescorla-wagner explained blocking, big victory. However, it does not explain a lot of other things.

Answer 192

By showing that the second stimulus does not acquire an associative value in the second wave

Answer 193

modulate the US (saliency of US is the focus of the model)

Answer 194

Increasing the US (shock) in phase 2.

Answer 195

In other words, V, or the associate strength, will decrease during extinction

Answer 196

So lambda becomes 0 (there is no US anymore)

Answer 197

There is very strong spontaneous recovery as soon as you reintroduce the US. What may actually work is if you repair the clicker with the food (marking or keep going signal). Fear is that if you never pair the clicker with the US again, it may change the meaning of the click. The meaning of the click is how it was associated with the US. What if sometimes you click and its not followed by food, but sometimes you click and it is followed by food. That model here does not address that.

Answer 198

L (CS) + Shock (US) L (CS) + N (CS) + no US > the noise becomes an inhibitor

Answer 199

If you combine an inhibitor with an excitor during extinction, the inhibitor will protect the excitor from losing associative strength (V).

Answer 200

This may explain relapse in exposure therapy for fear or phobias — you would try to extinguish the link

Answer 201

extinguish excitors. While doing so, you may have cues that become inhibitory.

Answer 202

it looks like you have no more fear. But if later you test the excitor by itself, then fear comes back.

Answer 203

- Well remember the concept of surprise. - The noise (inhibitory) predicts no US (no fear). But if you present the light again... fear is back! - Surprise is the key for learning! - In some cases you can have two excitors

Answer 204

- Then you have a very efficient extinction!

Answer 205

combine the CS with other excitors (not inhibitors).

Answer 206

US is less probable in the presence of the CS - This scenario leads to inhibitory conditioning of the CS

Answer 207

US is equally probable in the absence or presence of the CS - This leads to no learning

Answer 208

context is always processed at the same time (comparator theory)

Answer 209

How the attentional theory of theory (attentional learning of classical conditioning)

Answer 210

Group 1: used noise followed by shock, no phase 2, light noise than much better shock, then test light and see what happens Group 2: noise followed by shock, light/noise shock, light noise than much bigger shock, then test

Answer 211

- Based on Rescorla and Wagner, the groups should not have differed, but the control group (1), sees more conditioning (remember, with the SR, low scores indicate more conditioning) - Group 1 (control) learns because of unblocking, i.e., the use of a larger shock. - Group 2 (experimental): The LN - Shock trial interfered with the learning. - Group 2, phase 2: Subjects learn that the light (L) is redundant, so they pay less attention to it!

Answer 212

that we learn to ignore redundant predictors of a US, good or bad.

Answer 213

learn to ignore them!

Answer 214

If the CS is a good predictor, then we pay attention to it. If not... we don’t

Answer 215

how well the CS predicts its consequences

Answer 216

attention we pay to the CS

Answer 217

salient, CS

Answer 218

US and CS are important

Answer 219

Think about it. How does the dog know which click is theirs? There must be a mechanism for that dog to understand which click is important to them. Attention is important!

Answer 220

1. Does not explain well extinction of inhibition 2. Does not explain well latent inhibition (pre-exposure to a CS) 3. May have some issues even with the blocking effect 4. Miller, barnet & Grahame identity 23 predictive failures of the model

Answer 221

Mackintosh raised some issues with how the US needs to be surprising, thinks attention is more important

Answer 222

The more you repeat something, the better the outcome — talking about elemental isolated stimuli that get a certain type of learned irrelevance

Answer 223

CS that is not redundant (or surprising) may be good

Answer 224

we learn to ignore redundant predictors of a US - So it may not be about the ineffectiveness of the US but rather the ineffectiveness of the CS

Answer 225

If the CS is a good predictor, than we pay attention to it. If not…we don't

Answer 226

how well the CS predicts its consequences

Answer 227

- Attention to the CS = alpha in the RW formula - CS salience (alpha) is key, the more salient — the faster the learning - Game is to pay attention to the CSs that are the best predictors of the US

Answer 228

CS pre-exposure effect

Answer 229

if a stimulus is not suprising anymore, after a while you will stop paying attnetion

Answer 230

no. thats the problem

Answer 231

changed the landscape of models for classical conditioning

Answer 232

Group 1 = tone and shock, tone and SHOCK Group 2 = light and shock, tone and SHOCK

Answer 233

We know here that group 1 should learn quickly, because the tone is present in phase 1 and 2, with a US of stronger value. What we have here is the opposite due to negative transfer

Answer 234

Why pay attention to a well established CS? (Elements of mackintosh here) Instead, we pay attention to unknown/unconditioned CS — really coming back with idea of surprise

Answer 235

organisms attend to reposed to uncertain predictors

Answer 236

modern attentional thoery

Answer 237

As in mackintosh, alpha is attention but the value of alpha is based on how surprising the US is — important because alpha brings attention in

Answer 238

both CS and US, could argue all elements of RW model are here

Answer 239

Step 1: Condition rats to light CS — initially orient to it (eg. Sign tracking). If they orient to it it means they are paying attention Step 2. Three groups Continuous Partial None

Answer 240

CS/US pairing on every trial (R&W in support of this)

Answer 241

50% CS/US paring

Answer 242

CS never paired with US

Answer 243

Found that partial pairing actually worked the best. Continuous pairing is fine but not great, no pairing does not perform with. Partial pairing is really when the rats are paying attention, in other words, a little uncertainty is not bad.

Answer 244

- How surprising the US is on preceding trial is crucial - How surprising the US is, depends on the CS - Little bit of uncertainty may actually maintain responses - The vale of alpha remains high - US component (R&W), CS component (Mackintosh) — this pairing creates uncertainty and ends up being in your advantage - Explains negative transfer - Explains latent inhibition

Answer 245

uncertainty is a good thing and both CS/US are important

Answer 246

Group 1 = tone and shock, tone and SHOCK Group 2 = light and shock, tone and SHOCK

Answer 247

Phase 1: CS becomes a perfect predictor of the US - Problem, US becomes unsurprising with time — the repetition, fact that CS is losing some of its predictive value is problematic. With time, less and less attention is payed to the CS.

Answer 248

is very low

Answer 249

initially little learning is possible - Negative transfer occurs: you pay less and less attention to the CS

Answer 250

Synthesis is to in a sense, take all of the components together Importance of CS comes together with Pearce and Hall

Answer 251

1. Cues with high predictive value (mackintosh) 2. Cues high in uncertainty (pearce-hall): organisms attend to uncertain predictors

Answer 252

Surprisingness of the US — high saliency of US gets you to pay a lot more attention to predictors

Answer 253

previous learning

Answer 254

can condition attention AND can condition motivation as well. Cognitive processes of attention and motivation can be conditioned, these are essential to learning

Answer 255

working memory

Answer 256

werner honig

Answer 257

Memory that you use as you are doing a task. Could include rats in a skinner box, or people being given a phone number. Form of short term memory that stays as long as the info is relevant (usually not very long)

Answer 258

self-generated priming

Answer 259

retrieval-generated priming

Answer 260

Brings back the surprise element here Brings back primed component of STM

Answer 261

If you are primed, in most memory tasks that primed element will be easier to retrieve from memory later. Here, a primed element is less surprising — therefore we have a bit of a problem

Answer 262

self generated retrieval generated

Answer 263

via sensory memory (basic structure of the stimulus)

Answer 264

via LTM (actually from a complex retrieval process of info — already most likely in LTM)

Answer 265

From long term memory via a retrieval cue

Answer 266

From sensory memory via a recent presentation

Answer 267

retrieval generated priming

Answer 268

the US is primed by the CS

Answer 269

blocking - because the surprisingness of the US is dampened

Answer 270

in the moment, self-generated

Answer 271

long term memory, retrieval generated priming

Answer 272

- RW: combined processing of the CS and US in STM is important - Wagner thought that maybe he could look into latent inhibition (CS pre exposure effect again) - With this model, exposure to the CS before conditioning should reduce the CS surprisingness.

Answer 273

habituation

Answer 274

habituation is simply a decrease in the "surprisingness" of the stimulus with both types

Answer 275

Standard operating procedure or sometimes opponent process

Answer 276

This is a connectionist model, very much a neural network model — parallel processing

Answer 277

donald hebb

Answer 278

PDP or parallel distributed processing

Answer 279

1. Processing in the brain is parallel, not serial (like the information processing models always suggested). 2. Processing in the brain is distributed, not localized or modular (like emphasized in [clinical] neuropsychology and evolutionary psychology).

Answer 280

1. Learning and memory (including priming) 2. Perception / pattern recognition

Answer 281

1. Language 2. Problem solving

Answer 282

address time

Answer 283

1. Addresses the timing of the CS and US 2. Addresses backward conditioning (US presented before the CS). Explains how you can get conditioned inhibition; timing is still important here for this effect. 3. Compound conditioning: Leads to inhibition.

Answer 284

wagner and brandon

Answer 285

Wagner and Brandon added the emotional qualifies of the stimuli, particularly the US.

Answer 286

affective extension - emotional aspect

Answer 287

sensory node affective node

Answer 288

CS gets linked to both of these nodes

Answer 289

you are tapping into the sensory system, likely near the limbic

Answer 290

pairing can be both sensory and emotional

Answer 291

It is particularly useful when discussing US's that may have a strong emotional valence (positive or negative).

Answer 292

how the emotional response may potentiate the sensory response

Answer 293

So this adds an other variable/factor to conditioning. Think about flashbacks in PTSD! Physical stimuli and emotions being combined

Answer 294

Suddenly this theory is much about the "sign" or "stimulus" and is less stuck in a "poverty of the stimulus" view of the US.

Answer 295

Cognitive psych is also responsible for poverty of stimulus because it focuses on representation of the stimulus by itself. Here we are saying that none of that is terribly valid if you don’t take the emotional value into consideration.

Answer 296

simultaneous sensory and affective processing.

Answer 297

Explains why conditioning works better (often worse), with strong reactions — eg. Trauma

Answer 298

limbic system is working with sensory areas of brain

Answer 299

They (Wagner & Brandon) were influenced by Pearce (from Pearce and Hall)

Answer 300

configural thoery of learning

Answer 301

The set, or configuration, is what is conditioned to the US

Answer 302

Each CS conditioned independently to the US

Answer 303

configural theory of learning — means that if you have many CS, Pearce saw them as typically being a whole. In other words, they are more than just the parts (elemental view)

Answer 304

It means that the whole is more than the sum of its parts. Example: Play a chord on your piano or guitar, say A minor. - Means that configurations are meaningful and you can actually be conditioned by configurations

Answer 305

more that the sum of the elements.

Answer 306

weakens the value of the single, elemental CS (they are in fact, literally not the same).

Answer 307

lose 33% of the conditioning (from R-W, remember that CS's share a lambda).

Answer 308

Wagner's original theory ~ SOP ~ AESOP

Answer 309

- Compound conditioning effects (Rescorla-Wagner) - Attentional and priming effects (Mackintosh, Pearce-Hall) - Time constraints in conditioning - Sensory and emotional dimensions of conditioning - Stimulus configuration, and even generalization

Answer 310

Focus on response-outcome learning

Answer 311

- Parenting - Training (animals, humans, etc.) - Teaching (behaviourist methods) - Cognitive Behavioural Therapies (and others), behaviour modification - Programming of gaming software, gambling games and machines (e.g., slot machines) - Advertisement - Any social situations (making compliments, ignoring people, etc) - The tipping game! Etc. (so many more examples)

Answer 312

- Contingency between outcome and response is extremely clear in operant — the is why an animal learns - In classical conditioning this is very fuzzy because it is often automatic/unconscious

Answer 313

- Both have a learning curve. - Both have extinction.

Answer 314

Thorndike: Reinforcement theory » law of effect.

Answer 315

Contiguity theory » Contiguity is enough.

Answer 316

(Why the dog doesn’t learn to hear the porcupine? Because of lack of contiguity in time and space) Often dogs get hit by porcupine, may feel little bit of pain but there is an analgesic in the quills. Very quickly the animal doesn’t feel pain, the more they start moving the more the quills go in. Why is it that my dog is not learning the porcupine is a big thing from the initial hit? Because of continuity. They end up learning the most pain they experience that day is hours later at the vet clinic. That is what they learn to be afraid of.

Answer 317

stimulus elements are at the centre of this theory:

Answer 318

the organism notices more stimulus elements.

Answer 319

just a very salient stimulus — we know it is more than that now. How close time and space associations are happening is the key for Guthrie

Answer 320

One of the first cognitive theories (uses the term mental map or cognitive map).

Answer 321

inherently flexible.

Answer 322

He showed that rats in mazes learn better about places than responses Main point: They can learn on non-reinforced trials. It becomes obvious when the reward is provided. This is called latent learning

Answer 323

performance!

Answer 324

provider motivation (idea developed by spence and hull)

Answer 325

SD > R > O SD = discriminative stimulus R = response O = outcome

Answer 326

a response for a specific outmode (Eg. Kibbles in spinner box) — operating conditioning. Contingencies must be vert explicit.

Answer 327

SD = box R = sequence of events to open O = outcome

Answer 328

they reinforce

Answer 329

they are a very salient stimulus

Answer 330

they motivate

Answer 331

In fact they are/do all three — reinforce, salient stimuli and actually motivate.

Answer 332

positive and negative

Answer 333

pre-exposure to stimulus facilitates received of that stimulus

Answer 334

Wagner SOP the exposure to the stimulus gets in the way of learning

Answer 335

when any type of prior knowledge interferes with current knowledge (learning)

Answer 336

Peace-Hall negative transfer: pairing a CS with a weak US slows-down conditioning when the CS is subsequently paired with a strong US

Answer 337

CS is dog, US is bite

Answer 338

CS is still dog, now there is two US — one purely sensory (pain), one affective (learned fear)

Answer 339

AESOP explains conditioned emotional responses, often based on fear conditioning, very well

Answer 340

all about spatial and temporal (how close in space and time things are) - CS co-occurs with the US: they are contiguous, or close together, in space and time

Answer 341

the CS predicts the US: the occurrence of the US is contingent on the prior occurrence of the CS.

Answer 342

discrete trial method free-operant method

Answer 343

Stand-alone trials, and many of them, in a session. This controlled by the experimenter. - Eg. Gadbois lab — presenting dog with a problem. Accumulate trials

Answer 344

The animal or the human control the apparatus/computer. - Typically used for skinner box — animal controls flow of experiment by giving specific responses to specific stimuli

Answer 345

Typical learning curve as response rate showing acquisition and extinction (when you stop producing the kibbles). — response for minute The cumulative recorder taking the cumulative responses. As you learn they increase, as it extinguishes they decrease

Answer 346

Able to identify SD as something that should provoke a response and outcome

Answer 347

The other stimulus, the one you don’t want to respond to because it won’t provide an outcome

Answer 348

Eg. Want a dog that can find specific snakes (eg. Gartner snake) in a park that is full of different kinds of snakes. Need dogs to tell difference between ribbon and garter snakes. How do you teach a dog to discriminate just ribbon snakes? We rely on their nose. It is very simple, show the ribbon snake smell and encourage the dog for sniffing (give treat or click). Now grab the other stimulus (garter snake) and do not encourage the dog for sniffing (no treat). Do you train no response (just walk away) or a distinct response for a no (train them to sniff, then sit down because it is wrong). S+ or yes response would be hold nose. Contrasting approach: systematically reinforcing a response where they initially just show interest and ignore with the S-.

Answer 349

Light on » Press lever » get food Light off » Press lever » no food So after a while, the rat presses the lever only or mostly when the light is on.

Answer 350

a stimulus will set the occasion for the response. - The light is a discriminative stimulus (also labelled S^D or S+): It is associated with reinforcement. The other stimulus is the S delta or S-.

Answer 351

stimulus control

Answer 352

superstitious

Answer 353

If there is too much of a delay before giving reward, can occur because they have to hold their nose for 5 seconds, you can develop superstitious behaviours. Smarter breeds are more likely to develop superstitious behaviours (border collies) — because they are trying to determine what you want. Idea is that you need to extinguish the response, may feel like you are going backwards. You have to get them to step back, do a normal entrance, then reinforce. How do you do this? Often going backwards. Get them to enter, put them right in front of the vial before the trial starts and reshape just the nose hold — not the whole entrance. This helps to reduce the possibility of reinforcing the wrong behaviour.

Answer 354

In maze running, food encountered in the maze won't stop the running to the goal (the end that is baited). - If you put the reinforcement before the end goal, often the rat will just run by it and complete the task. That just means they have really learned the task and developed the habit of going through the full actions (almost the opposite of superstitious behaviours in some ways)

Answer 355

response associations are not uncommon.

Answer 356

- Going to pick-up somebody at the airport but you take the Dartmouth Crossings exit because you go there often. - These behaviours can be really hard to extinguish

Answer 357

specificity

Answer 358

that rats that learn to run dry maze, will immediately learn to swim it if it is flooded.

Answer 359

no only about 50% - dog knows the odours but the change in context is enough to confuse them

Answer 360

generalization

Answer 361

might have to do with focus of control, or sign tracking/goal tracking

Answer 362

errorless discriminative training

Answer 363

The dog learns to respond to the S+ (e.g., scent of lavender). The S- (scent of oregano) is introduced early. The S- is not presented in the first trial, and then is presented in weak form at first, and gradually strengthened: this is “fading” (fading-in).

Answer 364

Manipulation of the saliency of the S-. Difficult to do if you don’t have the right equipment

Answer 365

what you are getting the animal/human to pay attention to. Based on simple premise — during learning you commit the least amount of error possible and also the outcome is that (often) the performance of the animal/human will be with less errors than with typical discrimination training.

Answer 366

S+ (one you want the dog to respond to) is aways shown at full strength. You present the two stimuli at the same time. S+ (lavender) always shown at full saliency, but not the oregano. Present a very weak, varying, amount of the S- (oregano). Sometimes they will look towards the oregano, but ignore that and they typically go towards the more salient stimulus. Idea is that through all the trials, you make the saliency of S- stronger until they are at the same level. This works because you are indicating right away to the dog which stimulus is important and which to ignore. In some cases, they barely learn to ever respond to the S- so by the end of the task they have responded only to the S+, even though the S- is at full strength at the end.

Answer 367

you diminish the amount of potential error that can be created

Answer 368

This got the attention of people working with a autistic children. If you go with trial/error method they often get very frustrated when it is not reinforced. How to teach autistic children without them making errors, created techniques that resembled this in some ways.

Answer 369

Large reduction of errors compared to traditional discrimination learning Less mistakes are possible during training. No negative emotions during training (e.g., frustration, helplessness, etc.) The training is potentially very fast

Answer 370

Mistakes are NOT necessary for learning! Evidence from neural networks that sometimes it is It is likely an attentional mechanism: You learn (early!) to ignore the irrelevant stimulus and basically never respond to it (or at very low rate and only initially in the training) Trial and error is NOT necessary, in principle you can learn without the error just by focusing on attentional processes right from the get go

Answer 371

Modifications or reversals in training are difficult: e.g., if lavender (S+) becomes the S- and you try to train the S- (oregano) as the S+. Acquisition of the S+ will be delayed and/or impaired. Example: Training a dog to discriminate (for field search and detection) a specific species of snake within the Thamnophis genus. If you ever want that dog to switch to Garter snakes as the target species, it may be challenging.

Answer 372

Money — there is a delay, humans still understand this The clicker in animal training or TAG teach (TAG = Teaching with Acoustical Guidance; e.g., used to train movement precision in dance or sports) — make the exact moment they did the exact right movement

Answer 373

classically conditioned discriminative

Answer 374

pavlovian CS

Answer 375

because it has acquired a reinforcing value in itself

Answer 376

Situation arises first when trying to train orcas and dolphins in captivity — idea is that you dont want to interfere with animals during training. How do you tell them “yea you got that right, continue”. The conditioned reinforcer was created, not giving a primary but marking it with a secondary (the clicker or a whistle) That in itself is literally telling you that the clicker does not need to be followed by the reinforcer, at least every time.

Answer 377

When pigeons were in the Skinner box, when being shaped and giving the right response, the food magazine would drop but make a click while doing so. Quite accidentally, researchers noticed that when the magazine would empty, pigeons would continue to respond for a long time, and most likely we responding to the click as a conditioned reinforcer.

Answer 378

Used a lot with kids, eg. Stickers. These are secondary reinforcers — they have been conditioned to come with a reinforcer later.

Answer 379

deteriorate

Answer 380

1. Pair the clicker sound with the primary reinforcer , e.g. food (with a short of a delay between the two as possible). 2. When that pairing is well established, then:

Answer 381

the clicker announces the food

Answer 382

In this case, the clicker announces the possibility of food. "... as long as the secondary reinforcer is occasionally followed by food, the behaviour is maintained” Or the clicker is used as a marker (yes, this was the right answer, but continue what you are doing; e.g., in “chaining” — more on this below). Could be called “keep going signal” Becomes useful in a number of situations — solves issue of distance, need for spatial/temporal precision,

Answer 383

1. Necessary because of delay in delivering the primary, usually because of distance between the trainer and the trainee. 2. Spatial and/or temporal precision is necessary. 3. You are training behavioural chains, and use the clicker as a "keep going" signal (similar arguments as per #1):

Answer 384

You use chaining to train complex sequences

Answer 385

sports, music, dance, and other high motor skill learning — eg. Flying an airplane

Answer 386

Backward chaining: 3, then 2, then 1 (reverse order of actions). Learn the whole sequence by starting at the end and go backwards. Could be useful in correction of superstitious behaviour in a dog for example.

Answer 387

train new pilots and flight simulators

Answer 388

their own reinforcing value”

Answer 389

If not that, then at least it is feedback that informs the organism it will soon get the primary (sometimes called a “ keep going signal" in training circles).

Answer 390

Idea that learning/dopamine is what gets attention/learning to happen

Answer 391

Rats (and humans) are sensitive to the contrast between reinforcers of different values.

Answer 392

Group 1: plain water as reinforcement — fine, especially if animal had been water deprived. But water is not a terribly good reinforcer. Food when hungry works better than water when thirsty. Group 2: Sweetened water as reinforcement — glucose gives it more value Group 3: Sweetened water then plain water — they will get less responses to plain water.

Answer 393

Science suggests that reinforcement is the way to go, but that punishment works (albeit with negative consequences) if just talking about performance, punishment does work. Question is what supports reinforcement — it is ethical. Important to make an ethical argument, what ethics tell us is that you well develop a better relationship with the human/animal if you use reinforcement as opposed to punishment

Answer 394

Emotional distress, frustration, toxic relationship, etc. And not even necessary... other things work.

Answer 395

Sam is misbehaving at school and gets punished by the teachers... but the attention he gets from his peers is more reinforcing.

Answer 396

till producing the "bad" behaviour when the punisher is not around.

Answer 397

DRA or differential reinforcement of alternative (other) behaviours

Answer 398

positive reinforcement — that is better because it won’t damage relationship with the person. 98% of the time this is the ethical and scientific perceptive

Answer 399

conditioned reinforcers

Answer 400

early training — kibble has to come immediately after the behaviour you want is produced

Answer 401

every time the animal produces the behaviour it is reinforced.

Answer 402

continuously. - Behaviour that is reinforced every time, when you stop reinforcing it will extinguish immediately. Typical example in parenting — you give a reward to your child for doing something, one day you dont and they won’t do the behaviour anymore.

Answer 403

an expectation

Answer 404

a continuous reinforcement (CRF) schedule.

Answer 405

Fixed ratio Variable ratio Fixed interval Variable interval

Answer 406

Those schedules are recorded by a cumulative recorder and the output is a cumulative record. It shows the pattern of response based on the reinforcement delivery (the schedule).

Answer 407

Ratio is based on a count of the number of occurrences or trials — each trial is every chance you would have to reinforce the organism. In a ratio schedule, you do not reinforce every trials — instead, skip a few then reinforcer - Ratio schedules are based on the rate of response, i.e., a x number of responses (e.g., you reinforce the rat every 5 good responses, this would be a FR 5).

Answer 408

Interval is about the timing that elapses before the reinforcement, not the number of trials. - Interval schedules are based on an interval of time between reinforcements, i.e., regardless of the number of correct bar presses (assuming some good responses occurred), the rat is reinforced at an interval of time (e.g., every 2 minutes, this would be a FI 2).

Answer 409

Dog trainers were taught to do continuous reinforcement, because ‘otherwise the dog would bark or get frustrated’ — gadbois says if that is happening you are doing it wrong

Answer 410

Intermittent reinforcement for every experiment people were taught to go slowly. Eg. If you see any stress or aggravation in your rat you are going too quickly. Observing your organism is important — figure out what they like or dont. If you sense any frustration stop because something is wrong.

Answer 411

you are going too fast

Answer 412

Study said dogs in intermittent reinforcement got more stressed — they went from 100% continuous reinforcement immediately to 60%. Imagine your dog gets constantly reinforced and then suddenly only 60% of the time and they wonder why the dog is stressed. Thats because you were going way to fast. Actual way to do it is very slowly omit a few of the rewards, with time increasingly omit more. It is that simple. If you start to see any aggression, reinforce a little more. You are setting up the animal to understand that when they give a right answer — give them feedback they did, but not always a reward. Eventually you will have pigeons that will give 3000 responses without reinforcement — that is how powerful intermittent reinforcement is when used right.

Answer 413

need an adaptation period

Answer 414

- FR 5 = every five answers you give a reward - If you are continuously reinforcement start with FR2 for example, needs to be low early on. - FR 5 is extremely predicable — they know when the reinforcement is coming. They will tap the lever five times quickly to get the reward, they will accelerate. - They have an expectation and that is the issue with the fixed ratio — they learn they just have to produce five responses

Answer 415

- Variable ratio = VR 5 +/- 2 - You set the computers to come up with these numbers, what it means is that they know they will get a response but not every time.

Answer 416

- FI 10 - Unit of time - Regardless of number of responses they give, they will not be reinforced until the interval is over — eg. 5 minutes - Now they know if they wait a certain time they will get a reinforcement - Problem is that they figure out what they do doesn’t really matter until the end of the internal - Can correct this problem using variable interval

Answer 417

- VI 10+/- 3 minutes - Dont know exactly the interval of time where they will start getting a reward — therefore will work really hard in those conditions - Will typically get you a more consistent reinforcement, lower chance of extinction

Answer 418

Anticipation and learning connected, anticipation will be better when you dont know when you are getting the reward therefore fixed and variable intervals are better.

Answer 419

Most dogs don’t care about the treat because they are so into the task (working breeds). Whole point of doing intermittent reinforcement is that you train your brain that this is fun because you dont know when you are getting the reward — its a surprise! Very resistant to extinction — falls into category of habit learning When you keep habit learning it becomes pretty much unextinguishable. Problematic when you do therapy on behaviours you are trying to get rid of that have become habits.

Answer 420

they are predictable!

Answer 421

By applying those, the rat or pigeon is unsure about when the reward is delivered and is more likely to respond eagerly and the behaviour acquired is much more resistant to extinction

Answer 422

intermittent

Answer 423

Relationships that are hard to snap out of are relationships that are unpredictable, they’re exciting — there is a lot of anticipation That is because you are on a variable ratio — most social interactions are on intermittent reinforcement Reinforcement matters in social behaviour, that means that all principles you know from learning theory applies to these situations.

Answer 424

Reward every xth response. Consequence: Produces a good amount of behaviours, but lots of post reinforcements pauses. You know that after you are done that ratio, there is no need to rush.

Answer 425

The ratio is harder to predict. Consequence: High and persistent rates of responding. - Slot machines!

Answer 426

Reward at a fixed (xth) interval of time. Consequence: Low rate response after each reward. Same problem as fixed ratio because it is relatively easy to predict.

Answer 427

Steady rates of behaviour because the outcome is difficult to predict in time. Social media likes or messages

Answer 428

social media companies have people that are trained in learning theory to control this, they control when they let you know, how they let you know, when you stop responding and they look at your rate of responses. They use these intermittent schedules of reinforcement, mostly variable ratio/interval Can be used for good and bad things!

Answer 429

more persistent behaviour less chance of extinction

Answer 430

Strong correlation between the rate of reinforcement and the rate of behaviour. Faster response rates (the “hurried-up" nature of the behaviours produced, e.g., lever-pressing).

Answer 431

Reinforcement rates can be steady, but the rate of responding can vary immensely. More pauses and wait times between responses.

Answer 432

Use of more than on schedule. In fact, outside of the laboratory, these are by far the most common schedules of reinforcement.

Answer 433

Mixed/multiple/tandem/chained

Answer 434

When training an organism you want to see what they respond to best — test what works best. There is some evidence that some species will not deal well with variable intervals (as much as we said they are good if you go slow enough), some species will not work unless it is continuous reinforcement - Skinner thought there were universal laws here but that is not the case

Answer 435

Give two different schedules to a pigeon A: pecking at disk A is reinforced on a VI 2 min schedule B: pecking at disk B is reinforced on a VI 1 min schedule - In this experiment you get more for VI 1 min, this could change with time and will depend on how hungry the animal is. Eventually, they may switch to VI 2.

Answer 436

relative rate of reinforcement for that choice (response alternative).

Answer 437

explains the relationship between payoff and choice Idea is that animals will try to maximize reinforcement at a lower cost

Answer 438

1. Animals are likely to try to maximize the rate of reinforcement: How much you get per unit of time / session. 2. They try to improve the local rate of reinforcement, so they shift between the options (choices). This is called melioration 3. Rate is not the same as strength, so both are factors (the same way as speed and mass influence momentum in kinematics): This brings us to momentum

Answer 439

this depends on emotional state — will often simply switch to a better schedule when it fits their current metabolic needs better

Answer 440

persistence rather than competence

Answer 441

ometimes reinforcement is delayed. How well animals take delayed reinforcement matters (if too long, organisms do not take it well). If humans are being explained to — going to work two weeks for a lot but then get paid, humans are okay with that.

Answer 442

impulsive people are more likely to want their goods right away

Answer 443

a lot of this has to do with your ability for delayed gratification. Idea is that you have ease into it and develop a technique of finding small gratifications.

Answer 444

1. delayed reinforcement 2. impulsiveness vs. self control 3. procrastination and working priorties

Answer 445

The general observation: Animals and humans are impulsive, i.e., prefer smaller immediate rewards.

Answer 446

biweekly paycheque

Answer 447

not as reinforcing as immediate ones.

Answer 448

a pre-commitment strategy

Answer 449

studying, weight loss, cessation of smoking, snoozing the alarm in the morning, etc.

Answer 450

get people to announce what they are going to do (eg. New years resolution). You are dangling the possibility of shame if you dont go through with what you said you would Example: Dry January and other resolutions that are “publicly" announced.

Answer 451

temperament / personality, development and psychopahtologies

Answer 452

impulsivity

Answer 453

Impulse control (especially in boys), etc.

Answer 454

DHD, etc. ADHD is complicated: idea that attention may be a problem. If you cannot focus attention, learning is not going to happen.

Answer 455

1. Make the immediate reward less "rewarding" (or even coming at a cost). 2. Make abstention profitable: You get a reward if you don't procrastinate. 3. Make the associated delayed reward explicitly less appealing if the choice is to get rewarded immediately

Answer 456

Put a cost on engaging in the behaviour now. They still get a choice in the matter.

Answer 457

Ironically, that profit can be delayed, it will still work, e.g., "If you finish your homework now, you can play video games for longer tomorrow”. Appetitive reinforcement?

Answer 458

Personality difference here, impact may depend on whether you can be patient.

Answer 459

Much of this can work if you make "anticipation" exciting (as it should be). It tends to be more rewarding to some than others. "Waiting" does not have to be aversive. Humans have working memory and prospective memory (planning) — ability to deal with things that haven’t happened yet.

Answer 460

- Working memory: Good working memory = reduced impulsiveness. Distraction from the appeal of the immediate reward is the strategy here. Remember that working memory is conceptually overlapping with attention. - Prospective memory and planning: To process delayed rewards, you need to be able to process the future (i.e., mental "time travel")

Answer 461

asal ganglia/dopaminergic system

Answer 462

Gambling Addiction Risk junkies (sky-diving, bungee jumping, risky sexual behaviours, etc) ADHD Borderline personality disorder — characterized by risky sexual behaviours

Answer 463

Dogs in 1992: paper that showed a massive difference between breeds that have high vs. Low dopamine. High = border collie, jack Russel, Australian Shepard (more energetic) Low = great dane, saint bernard, Newfoundlander (more lazy) Difference is activity level and impulsiveness Can be applied to friends: you may have friends that prefer skydiving vs. Friends that prefer laying on couch

Answer 464

raits and dopaminergic system — idea of addictive personalities. We know this is a thing, but we don’t know what to do about it. We think the key to additive personalities is sensitivity to reinforcement — responding to variable intervals, excitement, etc. Idea that it is dopamine quantity and sensitivity both — unknown

Answer 465

1. Give choices — with any organism, test your procedure, reinforcement and schedule — every individual may be different 2. Help make better choices 3. Be clear about the impact of the choices — explain consequences

Answer 466

1. Contingency management — directly from learning theory 2. Incentive-based treatment

Answer 467

- These two fields have massively influenced research in behaviour and neuroscience, and have been influenced by those sciences as well. Go a little beyond basic cognitive psych and neuro — weird interaction between rational and irrational behaviour - We know that humans love to think we are cognitive and rational, good at following rules. When actually, we are unbelievably limbic in the way we make our decisions. What we do is limbic not cortical, based on emotions/raw motivations/reinforcement — applies to psych, bio, etc.

Answer 468

1. Drive reduction theory 2. The Premack principle 3. Behavioural regulation theory 4. Selection by consequences

Answer 469

not very helpful

Answer 470

1. They avoid mentalistic concepts, hypothetical/theoretical constructs, e.g., “motivation”. — had to adopt one to explain how motivation works (drive theory)The Skinnerian arguments to explain reinforcement are circular arguments (tautologies). 2. They do not have a theoretical framework to explain reinforcement, nor do they seem to mind.

Answer 471

theory of attrition/deficit, missing something so you get it — reducing drive

Answer 472

Reinforcers reduce drives.

Answer 473

Example: Treats reduce hunger (while training a dog; logically, this means that a hungry dog would learn better, faster, etc.).

Answer 474

It is a deficit theory (you do or learn something because you need to reduce a drive, e.g., hunger, thirst, etc.). Hull eventually realized the drive theory had limitations and started to talk about “incentives". In the section on motivation, we will cover the incentive theory that does a much better job at explaining motivation and its role in learning (and behaviour in general). Neuro and psych have adopted incentive theory — works better than drive because you can classically condition motivation

Answer 475

the differential probability principle

Answer 476

Contingency between a behaviour and the reinforcement.

Answer 477

The contingency is between two behaviours — contrast between pressing a lever and getting fed, to him this is the essence of reinforcement. More probable response will reinforce less probable responses

Answer 478

more probable responses will reinforce less probable responses The question: What is the contingent behaviour? In the Skinner box, what are the two behaviours?

Answer 479

Skinner box: Bar pressing Eating - But the rat would rather eat... In other words, reinforcement happens when the (instrumental) behaviour gives access to a preferred behaviour. - The bar pressing gets you to eat which is the motivation - Theory about contrasting two different processes

Answer 480

Do a preference test! See what is more reinforcing to the subject: Food, toy, play, etc. Find the behaviour (instrumental act) that provides access to a more preferred behaviour.

Answer 481

play a pinball machine or have candy Some kids chose candy (they spent more time eating candy than play pinball). With these kids, if candy is made contingent on playing pinball, pinball playing increases. The reverse is true for kids that chose pinball over candy.

Answer 482

really important!

Answer 483

Dog owners bringing dog preferred treats to lab

Answer 484

al behaviours have a preferred level. If you restrict access to that behaviour, the organism will engage in an other behaviour or behaviours to gain back access to that behaviour.

Answer 485

Bliss point and the minimal distance model. How the organism distributes its behaviours to maximize reinforcement (the bliss point). So all organisms will learn to maintain a preferred distribution of behaviours.

Answer 486

The main assumption of this model is that humans and animals will seek and attempt to maintain an optimal combination of activities (the bliss point). Idea is that as organisms we try to meet our metabolic needs

Answer 487

Learning is like natural selection.

Answer 488

"weeding-out" (Bouton’s term) or "pruning" (my term) by reinforcement (or punishment...).

Answer 489

1. You need a trait to select. Here it is a behaviour. 2. You need a selective agent. Here it is reinforcement (via the teacher/trainer/computer).

Answer 490

imilar to natural selection — reinforces the theory that learning is not just about acquiring information, but also pruning information

Answer 491

1. variation 2. fitness consequences 3. mode of inheritance 4. limited resources

Answer 492

This is now variability in forms of behaviours, across situations, etc.

Answer 493

This is now simply reinforcement contingencies

Answer 494

The learning process itself (operant conditioning).

Answer 495

The constraints on the process, and deprivation (e.g., of food) is the analogy. - eg. If you are reinforcing with food, maybe you need hunger. Some dogs are suggested to skip breakfast, so that they will work harder to increase the drive

Answer 496

ontogenetic (it happens during a lifetime, i.e., it is a developmental process)

Answer 497

an individual

Answer 498

phylogenetic (happens over long periods of time and across generations, i.e., it is an evolutionary process).

Answer 499

1. Neural networks models make similar assumptions (pruning process are described). 2. Genetic algorithms and genetic programming: They use explicitly biological terms such as "selection", “mutation", "crossover", etc

Answer 500

hypothetical construct

Answer 501

Potential issues: Introspection, lack of experimental rigour.

Answer 502

Potential issues: No hypothetical constructs or intervening variables tolerated (i.e., mediators, e.g., memory, emotions, motivation, the brain).

Answer 503

Potential issues: No hypothetical constructs or intervening variables tolerated (i.e., mediators, e.g., memory, emotions, motivation, the brain).

Answer 504

Potential issues: Still focussed on conditioning to explain all behaviours; and focus on behaviour (as opposed to its underlying processes).

Answer 505

Potential issues: Poverty of the stimulus and representationalism.

Answer 506

- PDP models, a.k.a. neural network models - A form of neural associationism - Explains well perception, learning, memory, but not so well the higher cognitive processes.

Answer 507

1. Hypothetical (theoretical) construct 2. Need for intervening variables » Hull » Operational behaviourism (Tolman) 3. So important that Woodworth thought psychology should be called “Motivology". 4. But why did some behaviourists (mostly Hull and Tolman) see the need for motivational concepts? -- The main reason: Explain how reinforcement (and punishment) work.

Answer 508

behavioural neurophysiological

Answer 509

1. Behaviour is variable — some days you are horny, other days not — this changes how you feel 2. Behaviour is persistent — why do we procrastinate? Can persist in good and and behaviours regardless of the outcome

Answer 510

1. The impact of physiological states relating to homeostasis (balance, a compensating system), allostasis (systems do not balance easily because there are many things going on in parallel — chaotic and paradoxical), metabolism 2. Neurotransmitters: e.g., dopamine (one of the most important NT for motivation, that is why it has a direct impact on learning) 3. Hormones: Mostly peptide and steroid hormones

Answer 511

1. ethology/biology 2. psychology

Answer 512

drive as instincts — the hunting drive is not really a desire, it is more an incentive behaviour

Answer 513

1. Drive theory: a response to deficits (it is a "deficit theory", focus on -R) — deficit theory, works well with homeostasis, weak theory in some ways 2. Incentive theory: focus on +R, intrinsic motivation, etc — much more based on positive reinforcement and classical conditioning

Answer 514

1. Incentive motivation: Motivation for instrumental behaviour created by anticipation of a positive reinforcer. Also called the rG-sG mechanism (from Hull & Spence) - incentive motivation is about how you acquire a reinforcer 2. Incentive learning: A process by which organisms learn about the value of a specific reinforcer while they are in a particular motivational state - Learning of how to be motivated, you can teach an animal to be motivated — requires not always making things too predictable or too easy

Answer 515

Technically no. incidental learning (latent learning), implicit learning dont require reinforcement

Answer 516

Tolman and latent learning

Answer 517

Reinforcers are not necessary for learning but are there to motivate behaviour and give purpose. Reinforcers affect motivation, not learning. Learning is not performance… but motivation is performance.

Answer 518

Nope Eventually extinction will happen in classical conditioning and operant learning Habit learning: operant w a lot of repetition, everything gets autonomized and the acquisition of habits will not get extinguished

Answer 519

- Habit learning was hard to situate in learning theory - Hull and Lashley (e.g., “Maze running habit”) used the term. - Or Tolman’s “habit slip”. - The modern use started with Hirsh (1974), then Miskin et al. (1984).

Answer 520

"skill learning” (both with a motor component) and both are seen overall as “implicit” or “non declarative” forms of learning

Answer 521

same parts of brain — basal ganglia, dopamine, subcortical Idea you can go beyond conditioning and get really strong learning that is not much at the mercy of external reinforcement because it has become intrinsically motivated Eg. Dogs in field don’t always need a treat, sometimes just motivated enough by finding a turtle The more you do something, the more likely you are to develop intrinsic motivation Difference between liking and wanting something

Answer 522

facts and events

Answer 523

skills and habits, priming, simple classical conditoning, nonassociative learning

Answer 524

Instrumental in nature, and very much associated with “motor learning” and the basal ganglia (as opposed to hippocampal-based learning). - More subcortical than we would imagine

Answer 525

Context learning (spatial and temporal). Defined as “rapid learning”. We also know from Gray that the hippocampus is a “comparator”. The hippocampus gives context (the what, the when and the where). Change in context can make you lose some learning or ability to adjust in a certain environment

Answer 526

Motor in nature. Defined as “slow learning”. Now often defined within the realm of the “cortico-striatal loop” (two loops, one is motor, the other is motivational) If a rat learns to run the maze, but can do it by swimming it is not just basal ganglia earning (memorizing specific motor sequences), but also hippocampal. They both work.

Answer 527

- Not just cortical or subcortical, but both -So motor-based learning (habit and skill learning) and motivation are linked in the brain. -Both are heavily driven by the dopaminergic system. -Remember: Both "motor" and "motivation" (even “emotion”) have the same etymological root. --> Why you give amphetamines to people with ADHA (ritalin)

Answer 528

if you look at the etymology but motor and motivation go together

Answer 529

drive theory

Answer 530

- Hull, early work: Reinforcement » reduction of drive (e.g., hunger) - Behaviour strength = D x H - Behaviour strength = Drive (need) x Habit (learning) - Interaction between drive and habit - Behaviour strength = Need x Learning

Answer 531

1. Degree of food deprivation (how hungry you are): Drive or D 2. How much prior reinforcement (reinforcement history) you got: Habit or H

Answer 532

- The concept of incentive motivation: Hull & Spence - Behaviour strength = D x H x K Behaviour strength = Drive x Habit x Incentive Behaviour strength = Need x Learning x Incentive Realized the K explains better forms of learning — innate

Answer 533

motivational effect of reward

Answer 534

outdated today

Answer 535

response to an anticipation of need

Answer 536

Tolman: He argued that animals learn expectancies Food rewards are confirmatory (or not). Food = catalyst (Rescorla, 1985).

Answer 537

- the operational terms “anticipation” and “expectancy” are problematic... but only theoretically. - So they prefer still the term “reinforcement” - Clearly the reward value in a learning task is contingent on how easy it is to get it - The way the see the reinforcer is as a confirmer (yea, I was right — like tap teaching in dance)

Answer 538

(anticipation is involved) The physiological processes are also not a focus of radical behaviourism. The processes are invisible to the naked eye... unless you have technology... and we do in neuroscience. Although the process is classically conditioned... Indeed, the motivational effects of reward come from classical conditioning (this is “incentive learning”). This is what incentive learning really is

Answer 539

classical conditioning

Answer 540

Idea is simple: if you are hungry, you are learning about that state of being hungry. It is actually when you are hungry that you are open to the incentive value of the situation you are in. Only when you are hungry can you learn, because it sets you up for that anticipation. Anticipation itself becomes the reward (remember the clicker!) Clicker announces food > clicker announces possibility of food > pigeon that is still pecking at same stimulus despite the fact there is no food This occurs because there is still anticipation! In a sense, motivational states are occasion setters

Answer 541

-You learn in the hungry state about the incentive value -So you learn about an incentive, and then: -You link this to the instrumental action. -If you experience the +R in a non-hungry state, then you will have no interest in the +R (e.g., food). -Motivational states are occasion setters! (Davidson, 1993, 1998, etc.) or facilitators (Rescorla, 1985) -Great review of these ideas in Dickinson & Balleine, 1994

Answer 542

Safety signals are negative occasion setters: "if this light is on, the shock won't happen" "if I have my pills, I won't have a panic attack" and even: "if I have my pill bottle -- even if empty -- I won't have a panic attack".

Answer 543

Facilitation and facilitators; occasion setting and occasion setter

Answer 544

environmental cues (CS's) that "set the occasion” for conditioning to occur. They "facilitate" the conditioning.

Answer 545

facilitation (Rescorla)

Answer 546

facilitators (Rescorla) » motivational states

Answer 547

CS that confirms a CS-US pairing.

Answer 548

modulator of the conditioning between a CS and a US. In itself, it becomes rewarding

Answer 549

1. The CS 2. The US 3. The modulator (second CS) = occasion setter = facilitator » motivational states

Answer 550

This adds what we call "conditional relations" and "conditional control”. It is not unlike basic discriminations in operant conditioning when we reinforce the S+ (the target stimulus or "good response"), and not the S- (the foil, distractor, or "wrong response").

Answer 551

Two ingredients: 1. The target stimulus: The CS that is present on every trial in a CS-US pairing. It is the target because that is the one that the subject responds to. 2. The feature stimulus: The CS that is present only in trials indicating if the US occurs or not (depending on the procedure). On top of operant discrimination, you can add something from a specific situation using classical conditioning

Answer 552

a specific association between stimuli — facilitator, can be there or not but its presence facilities something

Answer 553

-Hunger triggers the foraging behaviour (system). -Motivation "energizes" the action. -But you must learn ("know") that the action leads to the reinforcer. -Occasion-setters (or facilitators) help in doing so! -You must learn that the reinforcer has a positive effect on the motivational state. -The motivational state increases the desirability of the reinforcer

Answer 554

Actually extended training makes behaviour less sensitive to its consequences » habit learning (e.g., Holland 2004).

Answer 555

but about the anticipation of the reward…

Answer 556

Motivation is not from drives/needs, but rather the anticipation of need... Back to Tolman and latent learning... 1.Exploration of the maze. No reward. 2. Reward in the maze: They excel, with no previous reinforcement history! So... rewards: 1.Reinforce behaviour 2.Motivate

Answer 557

gradual learning, not one trial learnings. But if a rat can do it just by exploring and no reinforcement, there is something else going on. Rewards reinforce and motivate behaviour

Answer 558

As Bouton (2016) says elegantly, "a reward at the end of the tunnel" goes a long way Sometimes just the possibility of it is enough Flaherty (1996) discuses at length these phenomena in “Incentive relativity".

Answer 559

Negative and positive. contrast effects

Answer 560

An increase in reward (1 to 16 pellets) results in an increase in running speed and elation = positive contrast effect. -- Acquire the task, at shift time there is no change. The learning is stabilized. A decrease in reward (256 to 16 pellets) results in a decrease in running speed and depression = negative contrast effect. -- Huge crash in running speed, they adjust their behaviour based on what they are reinforced in 1 pellet to 16 pellets -- Huge jump in their learning! Post-shift is huge

Answer 561

Those effects are caused by a change in the expected value of the reward. Not really the reward, but what you expect the reward to be. How much effort you are putting in might be modulated by the expected value of the reward. Not doubt this is paradoxical

Answer 562

Rewards can sometimes weaken behaviours (in an instrumental context) - -Eg. You get tired Non-rewards can sometimes strengthen behaviours (in an instrumental context) -- Eg. Allude to incentive/intrinsic motivation This includes the negative contrast effects mentioned above.

Answer 563

- Animal trainers seem to forget this — the importance to develop incentive motivation and that you do not need to always reward. - Gen-X thought always rewarding was a great idea — everyone gets a trophy. Sounds good but it creates more problems than anything else, namely increasing a dependency on extrinsic motivation.

Answer 564

Extinction has a lot to do with the magnitude of the reinforcement. The bigger the reward, the bigger the extinction The higher magnitude for a reward, the faster the extinction. (Hulse, 1958; Wagner, 1961)... UNLESS, you are dealing with HABIT LEARNING

Answer 565

Many rewarded trials can increase the rate of extinction (as opposed to fewer rewarded trials)

Answer 566

Maybe make it intermittent

Answer 567

Slot machines are very careful in giving very big payments very rarely. The small payments keep you there, the inconsistent big prizes are the anticipation.

Answer 568

Extrinsic rewards can affect performance negatively. "Punished-by-reward" principle in humans

Answer 569

People like this do not want to take responsibility and are also extremely sensitive to external rewards. When it is not rewarding anymore they disconnect. This is the danger of using extrinsic motivation too much — idea is trying to find ways of developing intrinsic motivation

Answer 570

For some people these are more important than money Expectations, overall, are important.

Answer 571

- Rescorla, 1999 (and a few other studies) — said connection between US/CS was really important He realized he might be wrong after Pearce/Hall and Wagner -Partial reinforcement reduces the associative change produced by non- reinforcement. Two groups of rats: CRF (continuous reinforcement) and PRF (partial reinforcement) In acquisition, CRF (100% of trials are reinforced): better (faster, stronger acquisition) -But: In extinction, PRF (e.g., 50% of trials are reinforced) » persistence

Answer 572

Running speed (effort for motivation) — in acquisition trials the continuous reinforcement were doing better than the partial. Seeing a little less effort In extinction: the continuous lose the behaviour much more quickly, running speed goes down dramatically. The partially enforced rats see effort go down over time, but it is much slower.

Answer 573

Train dogs to recognize signs of anxiety in people with PTSD Biomedical alert and the lack of maintenance training — people with hypoglycaemia in people with diabetes This events are very raw, so it is hard to reinforce. Even when we have people that volunteer to produce those samples, well over have the time they forgot. When they get hypo they deal with that situation at hand, not about training the dog. The dogs then do not get reinforced. That is problematic unless if during the training they start using partial reinforcement. Real life for these dogs means that dogs will not always get reinforced, but they need to keep detecting it. Performance collapse, in part because of rare events: Few chances for reinforcement... if the responses to the events are even reinforced when they occur! Learned industriousness (Einsenberg, 1992) >> this can transfer to other tasks.

Answer 574

How to explain PREE? Two theories: 1. Frustration theory (e.g., Amsel): Not well supported 2. Sequential theory: Not about frustration 1. Extinction = many non-rewarded trials (by definition!) 2. No problem! Especially if this matches acquisition done in PRF! 3. It is the memory of the acquisition phase that "sets the tone" for what to expect. Set expectations early and your future behaviour is dependent on this. This is why always reinforcing is a bad idea. What happens during the acquisition phase is what matters.

Answer 575

Sequences of: non-reinforced trials (N) reinforced trials with large rewards (R) reinforced trials with small rewards (r). Two acquisition types: rNR RNr rNR is more resistant to extinction as suggested by the sequential theory This is because they expect less initially

Answer 576

no, not very well supported. Frustration does not seem necessary to justify motivation. In fact, some cognitive factors seem at least as important: - Memory - Associative strength

Answer 577

Important point. Basically said, why is it that there is so much resistance from some behaviourists and some animal trainers on the idea that reinforcing all the time is not the best. They point out that if you actually look at how animals get reinforced in the wild, that reinforcement every time would collapse very quickly Looking at success rate in predation: for most is very low Yet they continue doing it If they were right that you need to be rewarded every time you try to hunt, most lions after a few trials would quit Says: hey, our brains are wired for not knowing and instead hoping. That incentive/intrinsic motivation and effect of anticipation and expectancy.

final Flashcards

(626 cards)