Instrumental / Operant Conditioning Flashcards
What is E. Thorndike’s Law of Effect?
Positive consequences increased the likelihood / probability of a response.
What is the difference between classical and instrumental / operant conditioning?
Classical conditioning focuses on the relationship between two stimuli. Instrumental conditioning concerns the probability of a response, depending on the consequences.
What is B.F. Skinner’s version of the Law of Effect?
When a response is followed by a reinforcer, the strength of the response increases. When a response is followed by a punishment, the response strength decreases.
What are successive approximations?
It is the refinement of the definitions of success, starting from a broad criterion and then progressively narrowing it. For example, when wanting a rat to push the lever, we first reward it when it goes near to the lever, then when it touches the lever, and henceforth becoming more precise.
What is reinforcement? What is the difference between positive and negative?
Reinforcement increases the likelihood of a behaviour, and can be done through negative or positive means.
Positive reinforcement is done by providing something to reward good behaviour, encouraging them to continue it.
Negative reinforcement encourages good behaviour by removing punishment.
What is punishment? What is the difference between negative and positive?
Punishment decreases the likelihood of a usually unwanted behaviour, and can be either negative or positive.
Positive punishment discourages bad behaviour by adding punishment.
Negative punishment discourages bad behaviour by removing something good.
What is a token reinforcer and how does it differ from primary reinforcers?
Primary reinforcers / punishers are inherently reinforcing (food) or punishing (pain). Token reinforcers like money can be exchanged for primary ones.
Explain the difference between continuous and partial reinforcement, and which is better for certain situations.
Continuous reinforcement is rewarding the wanted behaviour every time, whilst partial reinforcement is only rewarding occasionally. Continuous reinforcement is most useful in conditioning someone quickly and efficiently. However, partial reinforcement is a lot less susceptible to extinction.
What are the schedules of reinforcement?
Fixed-ratio: delivering reward after a fixed number of responses
Variable-ratio: delivering reward after a varied number of responses, depending on the predetermined mean.
Fixed interval: delivering reward after a fixed period of time, regardless of number of responses.
Variable interval: delivering reward after a varied period of time, depending on the predetermined mean.
Why is partial reinforcement more resistant to extinction?
When a subject undergoes partial reinforcement, they understand they are not always rewarded. This builds persistence in the face of failures or absence of reinforcement.
How does the use of “time-out” fit into operant conditioning, and how can it be abused?
“Time-out” is a form of negative punishment, where a bad behaviour is intended to be reduced through removal of any positive reinforcers. However, its convenience can cause it to be abused, where a child may start acting out to earn a “time-out,” in order to avoid something in class.
What is the Premack Principle?
It is the principle that higher probability behaviours can reinforce lower probability behaviours. For example, by only allowing a child play-time after they have done their homework, they are more likely to do their homework.
What is stimulus control?
It is when a behaviour is triggered by the absence or presence of a stimulus. The stimulus provides appropriate cues and signals about our environment, influencing how we respond. (talking to a lecturer vs your friend)
What is the problem with using punishment as an operant?
It generalises extremely well, meaning that it often trickles to other aspects of subject’s life. For example, a student being told that their question is stupid will be less likely to ask more questions, and other students witnessing the scene are scared to do so too.
What are some constraints of operant conditioning?
Animal instincts may interfere with operant conditioning. Despite being conditioned, animals may find their behaviour drifting to their instincts.
There may also be clashes between instinct and operant, where a rat usually freezes or flees, instead of pushing a lever to escape punishment. It is not as intuitive for them.