Operant Conditioning Flashcards
What is operant conditioning?
A form of learning due to the consequences of behaviour, through reinforcement and punishment
When does positive reinforcement occur?
Occurs when a behaviour is rewarded with something pleasant
This reinforcement will strengthen behaviour
What are the two types of positive reinforcement?
Primary and secondary reinforcers
What is a primary reinforcer?
Include things such as food and water which fulfil a basic need
What is a secondary reinforcer?
Refers to when the reward is something that is not a basic need but leads to a basic need
e.g. money to buy food
What is negative reinforcement?
Strengthens behaviour and occurs when the reward is the removal of something unpleasant
e.g. Skinner would run a small electrical current along the bottom of the Skinner box until the lever was pressed, which would turn the shock off
What is punishment?
Discourages behaviour rather than encouraging it and involves doing something unpleasant after undesirable behaviour has occured
Punishment can be either positive or negative
What is positive punishment?
When something undesired is given as a punishment for bad or unwanted behaviour
e.g. a driver receives a fine for speeding
What is negative punishment?
The removal of something nice as punishment to reduce bad behaviour
e.g. removing a teenagers phone
What is continuous reinforcement?
When behaviour is followed by a consequence
e.g. each time the rat presses the lever at the same time as the light switched on a pellet is delivered
What are the four types of schedules of reinforcement?
Interval
Interval-variable
Fixed ratio
Variable ratio
What is interval reinforcement?
There is a fixed time between rewards (e.g. 5 minutes)
This scheduel means more behaviours per reward as the time schedule increases
What is interval-variable reinforcement?
There is a varibale time between the rewards (or punishments)
Behaviours are usually steady because the timing of the consequences (e.g. a reward) is unknown
What is fixed ratio reinforcement?
Behaviours are rewarded on a ratio which is the same each time
This tends to generate a high number of responses up to the reward and fewer after the reward is given
What is variable ratio reinforcement?
Reward and punishment are given randomly
This is the most efficient because it isn’t known when the consequences will arrive so the behaviours will be tried continually