instrumental conditioning Flashcards by Isha Tom

Instrumental conditioning or operant conditioning

Learning a contingency between a behaviour and a consequence
A key difference from classical conditioning is that here we are considering overt behaviours that are operated by an actor leading to a reinforcer

How well did you know this?

Not at all

Perfectly

Law of effect

Behaviours with positive consequences are stamped in and performed frequently
Behaviours with negative consequences are stamped out and performed less frequently

How well did you know this?

Not at all

Perfectly

Reinforcer

Any stimulus that is presented after a response that impacts the frequency that the response is performed. Behaviours can be changed through either presentation or removal of reinforcers

How well did you know this?

Not at all

Perfectly

Presentation of a positive reinforcer and removal of a negative reinforcer

Increase in behaviour

How well did you know this?

Not at all

Perfectly

Presentation of a negative reinforcer and removal of a positive reinforcer

Decrease in behaviour

How well did you know this?

Not at all

Perfectly

Reward training

Presents a positive reinforcer to encourage a behavior

How well did you know this?

Not at all

Perfectly

Punishment training

Presents a negative reinforcer to discourage a behaviour. This could be unethical or authority figure that may inflict fear

How well did you know this?

Not at all

Perfectly

Omission training

Removes a positive reinforcer to discourage a behaviour. Time out is an example of this. Punishment and omission lead to decrease in unwanted behaviours they use different methods

How well did you know this?

Not at all

Perfectly

Escape training

Removes a negative reinforcer to encourage a behaviour

How well did you know this?

Not at all

Perfectly

Acquisition

Learning a contingency between a response and its consequence and acquisition depends on the response rate of a behaviour

How well did you know this?

Not at all

Perfectly

Cumulative graph for the response rate of a behaviour

Horizontal line = no response
Upward slope= a response has been made
The pattern of responding depends on the participant, the complexity of the behavior and the type of behaviour used. Y axis is the cumulative behaviour and x axis is time

How well did you know this?

Not at all

Perfectly

Auto shaping

Learned without any direct guidance. An example pigeon in a cage pecks the keyhole and gets a grain. This contingency is learned without any help

How well did you know this?

Not at all

Perfectly

Shaping through successive approximation

Used for behaviours that are too complex to be auto shaped through gradual smaller approximations and rewards are presented. Used by animal trainers.

How well did you know this?

Not at all

Perfectly

chaining

A technique used to develop a sequence of behaviors. Each behaviour is reinforced with the opportunity to perform the best behaviour in a sequence. Helpful for learning complex behaviour

How well did you know this?

Not at all

Perfectly

Shaping vs chaining

Shaping: a closer approximation of the desired final behavior than the behaviour last reinforced. Reinforcement on the basis of improvement
Chaining: Reinforces the behaviour so long as it is performed in a defined order. Behaviour and order are set prior to the training

How well did you know this?

Not at all

Perfectly

Discriminative stimulus (SD/S+)

Study These Flashcards

Indicates when a contingency is valid

SDelta/S-

Study These Flashcards

Indicates a contingency is invalid

Partial reinforcement

Study These Flashcards

Follow a ratio (responses) or interval schedule (time). Both can be fixed or variable

Four basic schedules of reinforcement

Study These Flashcards

Fixed ratio (FR), variable ratio (VR)
Fixed interval (FI), Variable Interval (VI)

Fixed ratio

Study These Flashcards

May lead to ratio strain. It follows a pause and run pattern for behavioral responses

Variable ratio

Study These Flashcards

Schedules reinforcement after a set average number of responses and can support a high response rate of behaviour or climbing slope

Fixed interval

Study These Flashcards

Delivers reinforcement after a set interval of time. Rarely seen outside of the lab

Variable interval

Study These Flashcards

Deliver reinforcement after a Sey average amount of time. Steady rate response (straight line)

Robust learning

Study These Flashcards

Partial reinforcement is better than continuous reinforcement and less susceptible to extinction. Variable schedule is more robust than fixed schedules

Primary reinforcer

reinforcer with intrinsic value like food, water, mate

Secondary reinforcer

Reinforcer through previous learning and can be exchanged for a primary reinforcer. Example money

Negative contrast

A response originally receiving a high reward is shifted to a lower reward resulting in reduced response

Positive contrast

A response originally receiving a low reward is shifted to a high reward resulting in increased response

Over justification effect

Promoting intrinsic motivation is important for the long term adoption of a behaviour. Because if one relies on the extrinsic rewards, when the rewards stop they will lose motivation

Mirror neurons

Most organisms generate involuntary motor responses roughly equivalent to that of any behaviour they observe

instrumental conditioning Flashcards

(30 cards)