Operant conditioning Flashcards

1
Q

Define non-associative learning

A

Learning that stimuli exist in the world

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define associative learning

A

Learning associations between stimuli/events

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What did cats have to do in Thorndike’s puzzle box?

A

A box, with a flap that could be opened upon pressing of a pedal from the inside of the puzzle box. Cats were placed in puzzle boxes. Cats do not like being in confined spaces. So cats would scratch the inside of the box and would try to get out of the box, and eventually they would accidentally press the pedal and the door of the box would open and cats would get out.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What did Thorndike find in relation to how long it took cats to get out of the puzzle box?

A

Thorndike plotted how long it would take cats to press the pedal and get out of the box and found that while initially, cats may take more than 3 minutes to escape, by trial 20, they only required a few seconds to press the pedal and get out of the box

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are three of Thorndike’s laws of learning?
1. Law of ef____ - behaviour that leads to a p____ outcome is m____ likely to occur in the f____
2. Law of ex____ - connections between r____ and o____ are strengthened by r____
3. Law of r____ - learning is m____ by an i____ s____

A
  1. Law of effect - behaviour that leads to a positive outcome is more likely to occur in the future
  2. Law of exercise - connections between responses and outcomes are strengthened by repetition
  3. Law of readiness - learning is motivated by an internal state
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is meant by the organism operates on the environment in operant conditioning?

A

Behaviour changes the environment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is meant by behaviour is instrumental in operant conditioning?
Obtains d____ effect. This type of learning facilitates g____-d____ learning

A

Obtains desired effect, this type of learning facilitates goal-directed learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define the following terms:
1. Reinforcer
2. Positive reinforcer
3. Negative reinforcer
4. Punishment
5. Omission

A
  1. Stimulus that increases the likelihood of the preceding behaviour to occur
  2. Stimulus (usually positive) produced by the behaviour that increases the likelihood of the preceding behaviour to occur
  3. Stimulus (usually negative) eliminated by the behaviour that increases the likelihood of the preceding behaviour to occur
  4. Negative stimulus that decreases the likelihood of the preceding behaviour to occur
  5. Elimination of positive reinforcer decreases the likelihood of preceding behaviour
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is meant by continuous compared to partial reinforcement?

A

Continuous reinforcement - each behavioural response is reinforced
Partial reinforcement - behaviour is reinforced only part of the time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are ratio schedules? Fixed compared to variable?

A

Ratio - reinforcement given after every nth response
Fixed - response requirement always constant (e.g., FR1, FR10)
Variable - response requirement varies around average (e.g. VR5)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are interval schedules? Fixed vs variable?

A

Interval - reinforcement given after certain amount of time
Fixed - reward intervals constant (FI10)
Variable - reward interval varies around mean time (e.g., VI10)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What was found in the experiment that used token-economies in substance abuse treatment?

A

Those rewarded half as likely to relapse compared to those who weren’t rewarded (after eight weeks, 80% still sober compared to 40%)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Compare respondent conditioning to operant conditioning

A

R - Learning associations between stimuli in the environment (Stimulus-Stimulus Learning)
O - Learning associations between actions and stimuli/outcomes (A-O learning)
R - Reinforcement not contingent on action/behaviour/response
O - Reinforcement contingent on action/behaviour/response
R - Behaviour is elicited in response to a stimulus (CS»respondent)
O - Behaviour is emitted/produced by the individual (operant)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe an experiment into classical and operant learning interacting

A

Rat placed in chamber with 2 compartments
Speaker delivers auditory stimulus (would-be CS)
Grid floor delivers mild footshock (US)
Barrier for escape/avoidance of footshock
Initially, rat escapes following US (i.e. footshock)
Eventually learns to escape following CS (i.e. auditory stimulus)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is meant by an empty organism?

A

Organism is black box – collection of associations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Summarise taste aversion learning in rats

A

When licked spout, received sweet water along with bright light and clicking noise
Separated rats into 2 groups – one received mild foot shock when licked spout, other group given x-ray upon liking spout
X-ray radiation results in illness later in time so no contiguity, compared to contiguity in foot shock group
Broke compound stimulus into two - A = bright, noisy water, B = sweetened water
The group that was given the x-ray and was made to feel ill long after presentation of the bright/noisy/sweet water, avoided the bottle with just the sweet water more of the time than it did the bottle that was paired with just the noise and the light. By contrast, the group that was given an electric shock upon licking the bright/noisy/sweet spout during testing, still went for the sweet water spout at test, but avoided the spout that upon licking it they would get the noise and the light. This suggested that contiguity was not necessary for learning to occur (a delayed illness to something we eat will still teach us to avoid that which made us ill). It also suggested that we do not make associations with everything, since the footshock group did not avoid the sweet water. It also suggested that some associations are easier established than others (evolutionary basis to learning what makes us ill)

17
Q

Describe Tolman’s latent learning effect (three different conditions of rats finding their way out of a maze)

A

One group always reinforced for finding way out of maze (food reward)
Other group never reinforced
Always reinforced group showed learning curve and less mistakes than non-reinforced
Ones who received reward on day 10 of training learned incredibly quickly, behaviour changed very quickly even did better than those who had always been reinforced
Shows reinforcement not necessary for learning, just speeds it up!