Learning Theories Flashcards
Classical conditioning
Learning by association, occurs when a neutral stimulus is repeatedly paired with an unconditioned stimulus, the neutral stimulus eventually produces the same response as the unconditioned stimulus. Discovered by Pavlov
Unconditioned stimulus (UCS)
A stimulus that produces a response without any learning talking place
Unconditioned response (UCR)
An unlearned response to an unconditioned stimulus
Neutral stimulus (NS)
A stimulus that does not produce the target response, it becomes a conditioned stimulus after being paired with the UCS
Conditioned stimulus (CS)
A stimulus that only produces the target response after it has been paired with the UCS
Conditioned response (CR)
The response elicited by the CS (the UCR)
Extinction
When the CS and UCS have not been paired for a while the CR is extinguished and the CS no longer elicits it. Shows that our learning is flexible
Spontaneous recovery
An extinct response which activates again without new pairings with the UCS. Generally these responses are weaker than the initial response
Stimulus generalisation
When an individual that has acquired a CR to one stimulus begins to respond to similar stimuli in the same way
Process of classical conditioning
UCS produces UCR, NS does not produce this response
UCS and NS are paired
NS now produces the same response as the UCS
Now called the CS and CR.
Classical conditioning - strengths
Pavlov (1927)
Watson and Rayner (1920)
Application to aversion therapy - an uncomfortable response is paired with an unwanted behaviour to prevent the occurrence of this behaviour
Classical conditioning - weaknesses
Rescorla (1968)
Incomplete explanation of learning - can only explain a limited range of behaviours (simple reflex responses). For example, could explain how we come to fear dogs but not the maintenance of this fear over time
Rescorla (1968)
Found that contingency (extent to which the NS reliably predicts the UCS) is more important than contingency (how close in time they occur) in the pairing of the NS and UCS. Opposite to what Pavlov thought
Pavlov (1927) - aim
To explain the role of conditioned reflexes in the eating behaviour of dogs
Exploring how salivation becomes associated with new stimuli apparently unrelated to food and the properties of this association
Pavlov (1927) - procedure
Lab study, saliva measured by no. drops or volume in a cannula, took place in a soundproof chamber
Measured baseline salivation in response to NS (metronome), then paired NS and UCS (food) around 20 times varying the presentation before and after the UCS. Variations used to look at extinction, the CS was presented several times without the UCS producing extinction
Pavlov (1927) - findings
The NS did not initially elicit a salivation response, whereas the UCS did immediately. After forward pairings the NS did elicit the response but not when paired backwards and it only became associated if the dog was alert and undistracted. Extinction could be seen and there was also spontaneous recovery
Pavlov (1927) - conclusion
A link is likely to be made between a UCS and an NS that occurs just before the UCS.
Pavlov (1927) - strengths
High internal validity - controlled extraneous variables (soundproof room, collection of saliva in a cannula and carefully tested NS)
Application to eating problems - Jansen et al (2003)
Pavlov (1927) - weaknesses
Lack of generalisability to humans - humans have a larger cerebral cortex compared to dogs, enabling conscious choice
Jansen et al (2003)
Overweight children have acquired very strong associations between cues that predict the arrival of food and the salivation response
Operant conditioning
Learning that occurs when a behaviour is followed by reinforcement or punishment that will increase or decrease the chance of the behaviour occurring again.
Skinner (1948)
Conducted research on animals kept in a chamber. It contained a supply of food pellets that could be released as reinforcement when the animal learnt to do something (pull a lever). Some also contained electric floors that could be used as punishment.
Reinforcement
A behaviour is said to be reinforced when an event that takes place after it increases the probability of the behaviour being repeated. This can be positive reinforcement or negative reinforcement. It could also be primary or secondary
Positive reinforcement
When something nice is introduced to the individual following a behaviour, that increases the probability of the behaviour being repeated
Negative reinforcement
When something unpleasant is removed from the individual following a behaviour, increasing the probability of the behaviour being repeated
Primary reinforcement
When the thing acting as the reinforcer has biological significance, such as food
Secondary reinforcement
When the thing acting as the reinforcer has become associated with something that has biological significance, for example money, which can be used to buy food
Punishment
When an event following a behaviour makes the repetition of the behaviour less likely. It can be positive or negative.
Positive punishment
When something unpleasant is introduced to the individual following a behaviour, decreasing the chance of the behaviour being repeated
Negative punishment
When something nice is removed from the individual following a behaviour, decreasing the probability of that behaviour being repeated
Operant conditioning - strengths
Chase et al (2015)
Skinner (1948)
Application to education and childcare - used in school e.g. detentions or gold stars
Chase et al (2015)
Revealed brain systems that relate to reinforcement in humans using brain scans
Operant conditioning - weaknesses
Incomplete explanation of learning - can only explain how existing behaviours are strengthened or weakened and not how they originated.
Schedules of reinforcement
A plan of how often and when reinforcement will be provided.
Fixed ratio reinforcement schedule
A reinforcement is given after a specified number of behaviours
Variable ratio reinforcement schedule
A reinforcer is given after an unpredictable number of behaviours that vary around a mean value
Fixed interval reinforcement schedule
The time between reinforcers is kept constant
Variable interval reinforcement schedule
The time between reinforcers is varied around a mean length of time
Behaviour modification
A form of therapy in which reinforcement is systematically used to increase the number of instances of desired behaviour
Behaviour shaping
Shaping is used to encourage complex behaviours by reinforcing successive approximations of the behaviour.
Schedules of reinforcement - strengths
Latham and Dossett (1978)
Application to treating inappropriate behaviour - lovaas therapy involves intensive reinforcement using shaping, to normalise behaviour in children with ASD
Latham and Dossett (1978)
Found that mountain beaver trappers responded to variable ratio pay (4 dollars if they correctly guess the colour of a marble) than fixed ratio pay (1 dollar per animal)
Schedules of reinforcement - weaknesses
Does not account for intrinsic motivation, therefore not a complete explanation.
Social learning theory (SLT)
Learning behaviours through the observation and imitation of role models