Operant Conditioning Flashcards
What is operant conditioning?
Learning process where the strength of the behaviour is modified by the response to following that behaviour, with it being ellicted through through positive and/or negative modification [punishment and reward].
Give an example of positive and negative reinforcement
positive: driving safely to become no.1 driver.
negative: driving safely to avoid fines.
Define the four techniques of manipulating positive and negative stimuli to reinforce behaviours (4/4)
POSITIVE REINFORCEMENT: adding positive stimulus = to encourage a behaviour to happen again
AVERSIVE PUNISHMENT: adding negative stimulus = to punish a behaviour to stop it happening again
RESPONSE COST: removing a positive stimulus = to punish a behaviour to stop
NEGATIVE REINFORCEMENT: removing a negative stimulus = to encourage a behaviour to happen again
what is aversive punishment
the unpleasant stimulus is given after an undesired behaviour occurs with the aim of making the behaviour less likely
What is a response cost
when a positive stimulus is taken away
what is contiguity in operant conditioning
Contiguity refers to the timing during the learning phrase, or how ideas, memories and experiences are linked closely together in time.
what is contingency in operant conditioning
Contingency relationship between a response and a reinforcer, or a response and the punisher (dependent on each other).
what is Continuous Reinforcement
When reinforcement is given after each and every demonstration or performance of the desired behaviour.
Learning happens quickly when continuous reinforcement is used.
The drawback is that if reinforcement is stopped, the behaviour will quickly slow down and eventually stop.
what is Partial Reinforcement
Occurs when only some of the desired behaviours are reinforced.
Skinner experimented with four different experimental conditions and manipulated both the timing and frequency of when he rewarded his animals for correct behaviours.
define Interval Schedules and Ratio Schedules
Interval Schedules: reinforced based on time intervals that a behaviour is performed.
Ratio Schedules: reinforced based on the frequency (or number of times) of the behaviour being performed.
define fixed and variable intervals with examples
- reinforcement delivered at predictable time. e.g., paid every fortnight
- reinforcement is delivered at unpredictable time. e.g., fishing, dont know when they’ll bite but the longer the line is in the water the more likely of the bite.
define fixed and variable ratios with examples
- reinforcement delivered after a predictable number of responses. E.g., commisions, being paid for every x number of items made
- reinforcement is delivered after an unpredictable number of responses. E.g., gambling-reward after unpredictable number of presses
Describe how schedules of reinforcement affect learning, extinction, and performance.
continuous reinforcement: reward the behaviour continuously each time it is performed. learning is fast performance may become dependent on the reward being present. Extinction is fast.
Define behaviour modification in operant conditioning [5 steps]
the processing of modifying behaviours over the long-term using various motivational techniques and reinforcement strategies. Aims to replace problem behaviour with positive ones.
[5 steps]
1. monitor
2. negotiate a realistic goal
3. reinforcement schedule
4. start modification process and reward small improvements
5. gradually remove the reward
What is the placebo effect
Placebo: a treatment which could be medication, therapy or other intervention that appears to have the benefits of the medical intervention. expectation for it to work makes it work.