Operant Conditioning Flashcards
(46 cards)
Describe how operant conditioning was developed
Operant conditioning is the second learning theory to be developed.
It was largely developed by Burrhus Frederic Skinner. His research which developed the theory was based upon the principles of earlier research
carried out by Thorndike.
Thorndike (1911) developed a puzzle box with a single exit that could only be opened by a system of levers. A hungry cat was placed in the box and food was
placed just outside. The cat moved around the box, trying to get out because of the food. When
moving around the cat accidently pressed the lever and gradually learned by trial and error how to open the box. Then the cat could get the
food, which acted as the reward. This sort of learning, in which the solution is hit upon by trial and error, and a reward is given so that the behaviour is repeated is called ‘trial- and-error’ learning.
Thorndike called this the law of effect - if the effect of the learning (to get out of the box) is good (obtains food) then the behaviour is learnt and repeated. Learning takes place because of what
happens after the action.
Recall key aspects of Skinner’s research and the famous Skinner’s box
Skinner placed the rats into a Skinner box were he able to manipulate the rat’s behaviour through reinforcements. Skinner focused on manipulating the rat’s behaviour when they were hungry. Therefore, the box was used to deliver positive and negative reinforcements associated with eating.
As the rats spent time in their environments, they started to learn different aspects about it and how they were punished and rewarded.
•If the rats carried out a desired behaviour, then they would be rewarded with food. For example,
the rats had to learn that each time they saw a red light and pressed the lever they were rewarded with a food pellet. Thus, the rats learnt over time an association with touching a lever
and receiving a food reward. This increased the likelihood of them pressing the lever.
•Skinner also used an electric shock generator wired to the grid on the floor of the cage to deter the rats from carrying out certain behaviours. This electric shock was used as a punishment. Skinner found if he punished a behaviour in this way the rat became less likely to repeat it.
From these findings Skinner developed the theory of OPERANT CONDITIONING.
Skinner believed that behaviour is
shaped by its consequences. He proposed the ABC model of operant conditioning. What is the model?
A = Antecedent The situation or stimulus that triggers a behaviour. B= Behaviour The response made by the organism. C= Consequences What happens because of the behaviour.
Define reinforcements
consequences from the environment that increase the probability of a behaviour being repeated. Reinforcers can be positive or negative.
Define punishers
consequences from the environment that decrease the likelihood of a behaviour being repeated.
Define primary reinforcers
These are rewards that are focused on meeting a basic need.
(Shelter, food, drink, warmth,
pleasure)
Define secondary reinforcers
these are rewards that can satisfy a basic need, but it is not in itself a basic need.
(money, house points, chocolate)
Define positive reinforcements
something good is given in response to the behaviour eg. giving someone chocolate
Define negative reinforcements
something bad is taken away in response to the behaviour eg. Given a electric shock until a lever is pressed
Mike’s morning alarm goes off at 5:00AM. He hates this noise and reaches over and hits the
STOP button which stops the alarm noise. He does this every morning when his alarm goes off.
Turning off your morning alarm is an example of negative reinforcement. Explain why this is negative reinforcement. (3 marks)
Negative reinforcement encourages the behaviour by taking something unpleasant away when the desired behaviour is performed. When Mike hits the stop button on his alarm (desired behaviour) the alarm noise which is the unpleasant stimulus is taken away. This negative reinforcement encourages Mike to repeat the behaviour of pressing the STOP button on his alarm.
Define positive punishment
Something bad/undesired is given and the unwanted behaviour is stopped
Define negative punishment
Something good/desired is taken away and the unwanted behaviour is stopped.
Why are punishments is not usually recommended as an effective method to manage behaviour?
A difficulty with punishment is that it does not promote or give information about the desired behaviour it only stops undesirable behaviour.
Does Operant Conditioning take account of individual differences?
Skinner suggested that positive and negative reinforcements occur in an individual’s environment and because everyone’s environment is different, this theory accounts for INDIVIDUAL DIFFERENCES.
Explain how the issue and debate- Determinism links to Operant Conditioning
Skinner also suggested that individuals are not actually in control of their environments and upbringing. Therefore, Skinner proposed that the concept of freewill was an ‘illusion’: all behaviour is a consequence of learning from the environment which an individual does not have control over.
-Also links to social control as operant conditioning is based on deterministic principles which believe all behaviour can be shaped by environmental forces essentially proposing that human behaviour can be manipulated and therefore subject to social control.
Define token economy
A treatment method that provides secondary reinforcement for desirable behaviour that can be saved up or exchange for primary reinforcer.
Describe the application of operant conditioning using token economies
Token economies are used in a variety of settings such as schools and prisons, but the main aim is to encourage desired behaviour through a system of reward, and reduce undesired behaviour through withdrawal of rewards or privileges (punishment).
The tokens used in such economies are secondary reinforcers that can be exchanged for primary reinforcers. The tokens are therefore given in response
for showing desired behaviour. The more tokens received the better the reward.
Give the strengths of methodology used in evidence to support Operant Conditioning
The theory of Operant Conditioning is backed by scientific research. The characteristics of science are clearly demonstrated throughout.
•Empiricism; only directly observable behaviour was studied by Skinner. The data gathered in the research was quantitative and this provides objective statistical evidence to confirm how behaviour is learnt. Furthermore, Skinner’s research was carried out in laboratory settings with a high level of controls. For example, the
use of the Skinner box in particular allowed for the careful manipulation of reward and punishments to be given in a controlled environment to manipulate behaviour which in turn allowed Skinner to establish cause and effect links between reinforcers and behaviour. Therefore, the accuracy of Skinner’s data is high which increases the internal validity of the
research underpinning the theory, thus providing it with credibility.
•Falsifiable: The concepts of the theory, such as reinforcement and punishment are clearly defined allowing testable predictions to be made.
•Reliable: The Skinner box allowed for standardised procedures which enabled Skinner to replicate his research and test whether behaviour is learnt through positive and negative
reinforcers repeatedly with many different rats and pigeons. This confirmed the consistency of
his findings.
Give a strengths of applications to support Operant Conditioning
OC has many practical applications in society. It can provide a concrete explanation for how a variety of different behaviours are learnt, such as addiction to drugs and therefore how behaviour can be changed through reinforcements. Therefore, OC is useful in the education and the criminal justice system and in treatments for dysfunctional behaviours. For example,
operant conditioning is used widely in schools in the form of ‘token economy’ systems. Paul and Lentz (1977) showed the benefits of token economy systems for the treatment of patients with schizophrenia and found only 11% needed drug treatment compared to 100% in the control group.
Give a weakness of the methodology of operant conditioning
•Generalisability: Skinner’s research used rats and pigeons as participants which lowers the generalisability of the research to humans. Animals have different everyday experiences from humans and they do not have the cognitive abilities of humans e.g. language, therefore their learning experiences will be different. Human learning is much more complex and includes many different cognitive and social factors. Thus, rats and pigeons
are not fully representative of the target population of human beings.
•Ecological validity of the underpinning research can be argued to be low because most organisms learn in far more complex environments than a Skinner box. This limits the credibility
of operant conditioning as an accurate explanation of human behaviour.
•Ethics: It could be argued that Skinner caused unnecessary suffering with his use of electric shocks on his animals. However, the costs to the animals may well be outweighed by the many
applications of this research that have made human life easier. E.g., training of guide dogs
Describe a weakness of Operant Conditioning along with an Alternate explanation
The validity of operant conditioning as an accurate explanation of human behaviour is lowered as it can be seen as reductionist. This is because it focuses only on positive and negative reinforcers to explain human behavior and ignores other possible factors involved in behaviour such as biological and cognitive influences. Social Learning Theory provides an alternative explanation of human behavior, including the cognitive elements.
Outline the process of learning in a Skinner box
In the Skinner’s box the rats environment was manipulated through reinforcements. The box included positive and negative reinforcements associated with eating. As the rats spent time in their environments they started to learn different aspects about it and how they were punished and rewarded. If the rats were able to carried out a desired behaviour then they would be rewarded with food. Skinner also used the electric shocks generator to deter the rats from carrying out certain behaviours. This was used as a punishment.
List The two main types of response
behaviours skinner proposed occurred in operant conditioning.
Reinforcers
Punishments
Nisha is a primary school teacher. All the children in her class are well behaved except Nicholas. He always seems to be talking when he should be listening and spends much of his
day doing things that distract, upset or the children around him. Nisha makes sure she watches him carefully all the time and she tells him off every time she catches him being naughty. But his behaviour seems to be getting
worse, not better.
Explain why the punishment given to Nicholas by his teacher is not promoting good behaviour (3marks)
Nisha is rewarding Nicholas’ behaviour by giving him attention each time she tells him off. This attention could be explained as a positive reinforcement
- since Nicholas maybe enjoying the attention. This is a difficulty since Nisha thinks that she is managing the behaviour however she is actually reinforcing it.