Learning Part 2: Operant Conditioning Flashcards

Question 1

Q

Operant Conditioning

Answer

A

Goal directed behaviour

operant conditioning is concerned with how environmental stimuli shape complex goal-directed behaviours?

Question 2

Q

Edward Thorndike

Answer

A

His experiments, conducted at the turn of the 20th century, paved the way for a behaviourist account of voluntary behaviour

He worked with different animals: e.g. chicks, cats and dogs

He wanted to find out whether animals use reasoning to solve problems

Famous for Thorndike’s puzzle box

Question 3

Q

Thorndike’s puzzle box

Answer

A

Thorndike’s puzzle box: a cat was placed inside a puzzle box and food is placed outside of the box
Is the cat able to work out a mechanism to open the door of the box to obtain the food?

Results:
The cat learned by trial and error (and success): first attempts are random, then it stumbled across solution

Cats became faster on subsequent trials in the same puzzle box

Cats learn to associate response with rewarding consequence

Consequences shape behaviour: unsuccessful responses are gradually eliminated

The conclusion is that cats learn simple stimulus-response (S-R) associations rather than complex reasoning processes

Question 4

Q

Law of Effect

Answer

A

Responses followed by a satisfying state of affairs are strengthened and are more likely to occur again (rewards)

Responses followed by an annoying or unsatisfactory state of affairs are weakened and are unlikely to occur again (punishment)

Question 5

Q

B.F Skinner (1904-1990)

Answer

A

He was influenced by Thonrndike’s work describing voluntary human behaviour using basic S-R associations and without resorting to mentalistic concepts

“Behaviour operates on the environment to generate consequences.”

Organisms learn which behaviours are emitted to earn rewards or avoid punishments

Operant describes any active (voluntary) behaviour that is produced in order to generate consequences, or is instrumental in generating consequences

Essentially everyone is trying to gain something desired or avoid something unpleasant

Question 6

Q

B.F Skinner (consequences shape behaviour)

Answer

A

consequences shape behaviour: unsuccessful responses are gradually eliminated

Question 7

Q

Reinforcement:

Answer

A

Reinforcement occurs when the consequences of an action increase the likelihood of the action being repeated

Reinforcement increases or strengthens the occurrence of a behavior in the future

Question 8

Q

Positive reinforcement +

Answer

A

Stimulus or event which, when presented as a consequence of a behaviour, increases the likelihood of that behaviour recurring in the future

Question 9

Q

Negative reinforcement -

Answer

A

Stimulus or event which, when reduced or terminated, increases the likelihood that an associated behavior will recur

Question 10

Q

Continuous reinforcement

Answer

A

Each response is reinforced

Question 11

Q

Partial reinforcement

Answer

A

Reinforcement is given only for some correct responses

Generates behavior that persists longer: learners keep “testing” for a reward

Question 12

Q

Fixed ratio schedule

Answer

A

Rewarded after a fixed number of correct responses

high rate of responding

faster responses yield quicker payoffs (“bursts”)
e.g. paid for producing a specific number of items

Question 13

Q

Variable ratio schedule

Answer

A

Rewarded after an average number of correct responses

high rate of responding: persistent responding

People/ animals hope that the next response will bring reward
e.g. gambling

Question 14

Q

Fixed interval schedule

Answer

A

Reinforcement for first correct response after a fixed time period

Flurry of responding right before a reward is due
e.g. test scheduled every four weeks

Question 15

Q

Variable interval schedule

Answer

A

Rewarded for first correct response after an average time period

Less predictable

Slow but steady pattern of responding (“testing”)
e.g. surprise quizzes

Question 16

Q

Shaping

Answer

Study These Flashcards

A

Learning more complex behaviours by reinforcing successive approximations to the desired behaviour:

Reinforce high frequency component of desired response

Drop reinforcement – behaviour becomes more variable again

Await response that is still close to desired response – then reintroduce reinforcement

keep cycling: closer approximations are achieved

Shaping of behaviour which is not in the animal’s natural repertoire

Question 17

Q

Extinction

Answer

Study These Flashcards

A

Extinction occurs when reinforcement is withheld

It is not an immediate process, often brief increase in responding

Partially reinforced responses are harder to extinguish

Question 18

Q

Punishment

Answer

Study These Flashcards

A

The use of aversive consequences to reduce undesirable behavior

Any event which decreases the likelihood that ongoing behaviour will recur

Question 19

Q

Positive punishment +

Answer

Study These Flashcards

A

Behaviour is followed by the presentation of an aversive stimulus

Stimulus is added to situation
e.g. electric shock

Question 20

Q

Negative punishment -

Answer

Study These Flashcards

A

Behaviour is followed by withdrawal of rewarding stimulus

Stimulus is taken away
e.g. removal of toys

Question 21

Q

problems associated with Punishment

Answer

Study These Flashcards

A

Punishment is more effective when it is swift (no delay) and consistent (not just administered sometimes)

It is less effective than reinforcement because no desired behaviour is established

It does not cause long-term behaviour change: suppression of behaviour

When threat of punishment is removed, the behaviour returns (e.g. speed cameras)

It produces negative feelings and does not promote new learning

It may indeed teach the recipient to use punishment towards others

It is useful if behaviour is dangerous and must be changed/suppressed quickly

Question 22

Q

Operant Conditioning: Children

Answer

Study These Flashcards

A

Reinforce alternative behaviour that is incompatible with the undesirable behaviour (e.g. respond to normal voice only, not to screaming)

Identify the crucial reinforcer (maintaining the behaviour) and stop reinforcing the problem behaviour (extinction)

Reinforce the non-occurrence of the undesirable behaviour

Remove the opportunity for positive reinforcement

Use strongly reinforcing stimuli, but use variety (e.g. praise, privileges)

Immediate reinforcement after the preferred behaviour

Start with reinforcing all the time, switch to intermittent

Encourage self-reinforcement through pride and a sense of self-control

Question 23

Q

Martin Seligman (Learned Helplessness)

Answer

Study These Flashcards

A

He investigated the effects of exposure to uncontrollable shock on escape/avoidance learning in dogs

1/3 of dogs exposed to unavoidable shock failed to learn to avoid or escape from an unpleasant or aversive stimulus

first phase: Classical Conditioning
- shock paired with light
second phase: Operant Conditioning
- learn to jump when light is switched on to the other side of the box

Question 24

Q

Basic Principles of Learned Helplessness

Answer

Study These Flashcards

A

Learned helplessness might explain behaviour after abuse and in depression

When the traumatic event first occurs it causes a heightened state of emotionality, which has been called “fear“

Fear continues until the subject learns that he can or cannot control the trauma

“If subject learns that he cannot control the traumatic event, fear decreases and is replaced with depression.” (Seligman, 1979)

Learning Part 2: Operant Conditioning Flashcards

(24 cards)