Operant Conditioning Flashcards
What is Operant Conditioning based on?
Learning through consequence or reinforcement.
What does Reinforcement mean?
When the desired behaviour is rewarded. This makes it more likely to be repeated.
What does Positive Reinforcement mean?
Rewards desired behaviour by adding something pleasant, i.e. money.
What does Negative Reinforcement mean?
Rewards desired behaviour by removing something unpleasant, i.e. taking away pain or distress.
What is Primary Reinforcement?
When the reward is something we want naturally.
What is Secondary Reinforcement?
A reward we have learned to value, i.e. value.
What does Punishment mean?
When an undesireable behaviour produces unpleasant consequences.
What does Positive Punishment mean?
Punishes undesireable behaviour by adding something unpleasant, i.e. a shock.
What does Negative Punishment mean?
Punishment by removing something pleasant, i.e. taking away the Xbox.
In order for conditioning to be effective, what must it be?
- Contingent
- Contiguent
What does Contingent mean?
There’s a clear link between the person’s behaviour and the consequence it produces.
What does Contiguent mean?
The consequence follows soon after the behaviour - if there’s a long delay, conditioning is weakened.
Punishment doesn’t help achieve the desired behaviour, it…
…just makes the undesired behaviour less likely.
What did Skinner carry out his research on?
Animals, famously rats.
What was “Skinner’s Box”?
A box that contained a lever, light and food dispenser.
How was “Skinner’s Box” used to condition the rats?
If the rat pressed the lever, the light came on and a food pellet rolled down the chute. At first the rat did it by accident, but repeated the behaviour after recieving the food (reward) - positive conditioning.
How was Skinner’s study Contingent and Contiguent?
Contingent = the light came on alerting the rat to what it had done.
Contiguent = food dispensed immediately.
What did Skinner do in his variation study to understand negative reinforcement?
Electrified the floor and made the lever turn off the current for 30 seconds - the rat learns to remove something painul by pressing the lever.
What did Skinner find in his variation study?
Rats learned to press the lever, but not as quickly as the rats that were positively reinforced.
What is a Continuous schedule?
Given behaviour produces reinforcement/punishment every time it occurs. Reinforcement/punishment is predictable.
What are Fixed Ratio schedules?
When a given behaviour produces reinforcement (or punishment) only after a certain number of responses have been performed. Reinforcement/punishment is predictable.
What are the features of Fixed Ratio?
- Reward turns up every time the desired behaviour is carried out, so often.
- If you don’t do behav, you get nothing. If you work fast, you get a lot.
- Learning is fast and extinction is moderate.
What are Variable Ratio schedules?
When a given behaviour produces reinforcement/punishment only after a certain number of responses have been performed. Reinforcement/punishment is unpredictable.
Whare are the features of Variable Ratio?
- Reward is dispensed randomly, after a changing number of behaviours, i.e. feeding the rat after one lever-press, then 5, then 3.
- Learning is fast and extinction is slow.