Week 5 Flashcards

1
Q

Classical vs Operant Conditioning

A
Classical	Conditioning	
• Ivan	Pavlov	
• Learning	via	association	
• (Lectures	2	+	3)	
Operant	Conditioning	
• B.F.	Skinner	
• Learning	via	reinforcement	
• (Lectures	4	+	5)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Theories of reinforcement

A
  • The role of response and reinforcement
  • Skinner
  • Drive reduction
  • Behaviour regulation (Premack)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Learning by doing?

A
• Thorndike	thought	that	a	
response	had	to	be	performed
and	a	consequence	experienced
for	operant	learning	to	occur	
• However,	Tolman	showed	that	
this	is	not	so	
– Responses:	e.g.	learning	without	
doing	(e.g.	rats	running	a	maze	vs	
rats	transported	through	maze,	
learned	the	same	way)	
– Consequences:	e.g.	rats	acquire	a	
cognitive	map	of	their	
environment	even	if	not	
rewarded.	So-called	latent	
learning	(Tolman	&	Honzik,	1930)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Expectations about the reinforcer

A

• Thorndike thought that learning involved the
pairing of Stimulus and Response – without
inclusion of the reinforcer
• Tolman argued that the reinforcer becomes part of
associative network (stimulus, response and
reinforcement)
– Animal develops expectation
• E.g. train monkey to perform action that leads to
banana, and then replace reward with lettuce ->
monkey surprised and frustrated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Experiment by DeWaal & Brosnan

A

Monkeys reject unequal pay

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Theories of reinforcement:

Skinner’s operational definition

A
  • Reinforcer increases rate of behaviour

* Punisher decreases rate of behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Theories of reinforcement:

Drive reduction theory

A

Hull & Spence (1940s)
• 1st theory of motivation
• If homeostasis is disrupted drive is observed
– Primary drives are innate (e.g. thirst, hunger)
• Drive = unpleasant state that the animal wants to
reduce
• Drive reduction of physiological needs = negative
reinforcer & major cause of learning
But… not all reinforcers reduce a biological
drive
• Secondary reinforcers (e.g. money)
– Money can only indirectly reduce drive
– Bridging (previous lecture)
• Novel Stimuli – e.g. Sensation seeking
– Some reinforcement comes from raised
stimulation
– wild rats, mice and shrews elect to run
in wheels
– Working to get access to a window
• Pleasure seeking
• e.g. Intra-cranial reinforcers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Olds & Milner (1954)

A

Electrical stimulation as positive reinforcer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Intra-cranial Reinforcers

Rise of the ‘ratbots’?

A
  • Talwar et al. (2002)

* Nature, 417, 37-38

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What if…

A
• What	if	stimuli aren’t	reinforcers		
– E.g.,	water	isn’t	a	reinforcer	
• But	behaviours are	reinforcers	
– E.g.,	DRINKING	is	a	reinforcer	
• Behavioural	homoeostasis	(not	food,	but	
eating	is	reinforcing)	
• “Bliss	point”	of	behaviour
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Theories of reinforcement:
Behaviour regulation
• Premack’s principle (1965)

A

– A high probability behaviour can reinforce a low
probability behaviour
▪ We have a hierarchy of behaviours arranged
according to response probabilities
(preferences)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The Premack Principle

A

More probable behaviours will reinforce less probable behaviours

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Using Premack’s Principle

A
Brown,	Spencer,	&	Swift	
(2002)	saw	a	7-year-old	boy,	
who	refused	to	eat	all	but	a	
few	specific	foods	
• Low	probability	behaviour:	
eating	new	foods	
• High	probability	behaviour:	
eating	favourite	foods	
• At	meal-times,	parents	told	
him	if	he	ate	a	small	amount	
of	new	food,	he	could	have	his	
favourites	
• Boy	gradually	began	to	eat	his	
greens!
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Avoidance vs. Escape

A

• Remember that punishment ideally involves no
possibility of avoidance/escape
• But, what if the animal has some control?
• Escape learning–emit a response that
terminates an aversive consequence (negative
reinforcement)
• Avoidance learning – emit a response to
prevent the occurrence of an aversive
consequence altogether
– but how can this be examined in a lab?
Present CS (light dims) followed by US (shock)
Initial trials feature escape, then avoidance takes over

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Learning About Avoidance Learning

A

▪ Helps us to understand anxiety behaviours
▪ Phobias
▪ Often never encounter aversive event again
▪ How are phobias maintained?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How to Reduce Avoidance Clinically

A

• In therapy: exposure training
– Flooding and response prevention
– Modelling of situation appropriate behaviour

17
Q

Learned helplessness

Seligman, 1967

A
▪ Effect	of	unavoidable	shock	
▪ Yoked	control	experiment	
▪ (1)	Escapable	shock:		Avoidance	learning	
▪ (2)	Inescapable	shock:		Yoked	to	the	first	
group’s	shocks	
▪ (3)	No	shock	
▪ Then	train	in	shuttle	
box	avoidance	task
18
Q

Learned helplessness: Effects

A
▪ Impairs	subsequent	learning	(in	more	
difficult	task)	
▪ Depression	
▪ Reduced	activity	
▪ Reduced	immune	responses	
▪ More	ulcers	(stress	related)
19
Q

Learned Helplessness

A

• Repeated exposure to punishment has long-term
effects
• Animals/people start to behave as if their
behaviour has no effect on what happens to them
• This is learned helplessness – they’ve learned that
they’re helpless

20
Q

How To Combat Learned Helplessness

A

Place the subject in a situation where it cannot fail,
so it learns it has some control
– An initial experience of control often
‘immunises’ against learned helplessness

21
Q

Learned helplessness- attributions

A

In humans our attributions can harm or protect us
– Internal vs external
• (because of me/not because of me)
– Stable vs unstable
• (a trait that I have/one off incident)
– Global vs specific
• (applies to all contexts/applies to this one context
• Depression promoting attributions:
– Internal, stable, global
• Depression reducing attributions:
– External, unstable, specific

22
Q

Learned Helplessness In Uni Students

A
• Learned	helplessness	is	
typically	worse	if:		
1. The	person	thinks	
everything	is	hopeless	
2. The	person	thinks	it’s	
their	fault	
3. The	person	sees	the	
helplessness	as	longterm	
• A	uni	student	who	has	
learned	helplessness	
might	think	that:	
1. I’m	no	good	at	any	of	
my	classes	(global)	
2. Because	I’m	just	not	
very	smart	(internal)	
3. And	I’ve	just	always	
been	that	way.	(stable)
23
Q

Behavioural Therapies

A
• Use	conditioning	principles	
• Aim	to	modify	situation	inappropriate	
behaviours	
1. Functional	Analysis	
2. Use	empirically	validated	approach	to	modify	
specific	behaviour/condition
24
Q

Functional analysis

A

• What is the problem behaviour, when does it
occur?
• Helps us to target our treatment
• A functional analysis tries to determine what
reinforcers are maintaining an undesirable
behaviour
• Involves monitoring the relationship between
stimuli, behaviour, and consequences.
• Is done on a case by case basis (N = 1)
• Divide complex behaviours into simple ones
(more manageable)
• Inappropriate behaviour same as ‘normal’
behaviour (non-medical model of pathology)
• Inappropriate behaviour: excess or deficit in a
certain situation

25
Q

Functional Analysis Example

A

• E.g., A mentally disabled 10 year old boy who had
self-injurious behaviour (SIB),
• Watson et al. (1999) performed a functional
analysis and found
– that the SIB happened most often as ‘escape behaviour’
– the boy did it in order to stop doing things he didn’t like.
• Thus, the best way to stop the SIB was to get his
teacher to switch to a fun task if he did an unliked
task without SIB.

26
Q

Functional Analysis - SORCK

A
• Kanfer	&	Phillips	(1970)	
argued	that	the	essential	
‘behavioural	equation’		for	
a	functional	analysis	
included:	
– ‘prior	Stimulation’	
– the	‘biological	state	of	the	
Organism’,	
– the	‘Response	repertoire’,		
– ‘Consequence	
– the	‘Contingency	
relationship’	(K)	
• S:	what	happened	before	
the	behaviour	
• O:	the	skills	and	state	of	the	
organism	at	the	time	
• R:	the	behaviour	
• C:	the	consequence	of	the	
behaviour	
• K:	the	effect	the	
consequences	of	the	
behaviour	have	on	future	
behaviour
27
Q

Operant Conditioning in Action

A

• Positive Parenting Programme (Triple P)
• Teaches parents to use operant conditioning
principles in child rearing
• Focuses on reinforcement of desired
behaviour (rather than punishment)
• Teaches other principles too – like chaining,
making consequences more effective etc.,

28
Q

Advice from Triple P

A
E.g.	Getting	children	ready	for	
school	in	the	morning
– Teach	children	the	behaviours	
they	need
– Set	a	routine	(an	order	for	doing	
them	in)
– Reward	desired	behaviour
– Avoid	nagging	or	hassling	them
– Turn	it	into	a	game
• Uses	principles	like	chaining	and	
reinforcement
29
Q

Behavioural Therapy: Throwing ‘thinking’

back in the mix

A
• CBT	is	a	combination	of	cognitive	
therapy	and	behavioural	therapy	–	
‘cognitive	behavioural	therapy’	
• focus	on	‘thinking	errors’	and	
‘core	beliefs’,	which	is	cognitive	
• but	other	techniques	are	based	
on	operant	conditioning	and	
classical	conditioning,		
• e.g.,	practice	exercises	and	
setting	homework	
• especially	in	the	treatment	of	
anxiety	(where	behavioural	
therapy	was	originally	quite	
successful)
30
Q

Use in comparative psychology

A

• Conditioning often is the basis of lean alternative explanations that
need to be ruled out to demonstrate animal smarts
• e.g. Ravens plan for the future (Kabadayi & Osvath, 2017, Science)
• “Here, we show that ravens plan for events …—tool-use and
bartering—with delays of up to 17 hours.”
• birds picked target from among distractors
• However, simpler explanations not ruled out
• Before E1, the 5 ravens were trained that a tool/ token leads to reward
and that distractors did not
• Unsurprisingly, they subsequently tended to select the functional items
and could then get a reward later
• Even on first trial, though there was no rational ground to predict that the
reward situation would emerge 17 hours later
• More about rich vs lean interpretations of animal behaviour in the
lecture on comparative cognition