Instrumental Conditioning Flashcards
instrumental Conditioning
the learning of a contingency between a behaviour and its consequences
Thorndike
cats in a puzzle box. He focused on overt behaviour rather than on unmeasurable mental processes. He predicted that the cat would be able to escape immediately once it randomly discovered the solution, but it turned out that there was never a distinct “ah-hah!” moment, and improvement was gradual.
Stamping In
Random behaviours that are followed by the favourable consequences are performed more frequently
Stamping Out
Random behaviours with negative or no consequences are performed less frequently
Law of Effect
behaviours with positive consequences are stamped in, those with negative consequences are stamped out. Leads to refinement and the learning of a contingency between good behaviours and rewards.
Four Consequences
Presentation of a Positive reinforcer,
Removal of a positive reinforcer,
Presentation of a negative reinforcer
Removal of a negative reinforcer
Reinforcer
any stimulus which, when presented after a response, leads to a change in the rate of that response.
Reward Training/Positive Reinforcement
involves the presentation of a positive reinforcer following a response. Increases the frequency of the behaviour.
Punishment Training/Positive Punishment
involves the presentation of a negative reinforcer to decrease undesired behaviour. Can raise ethical concerns in the real world and the authority can become a CS.
Omission Training/Negative Punishment
involves the removal of a positive reinforcer in order to decrease undesired behaviour. Leads to avoidance.
Punishment vs. Omission
Both lead to a decrease in undesired behaviour, but each do so by different means. The removal of a positive reinforcer IS NOT the same as the presentation of a negative reinforcer. Punishment has a more specific meaning in instrumental conditioning than in everyday life.
Escape Training/Negative Reinforcement
involves the removal of a negative reinforcer to increase desired behaviour. The reinforcer is constantly being presented and the subject wants to have it removed.
Consequence Timing
any type of instrumental conditioning occurs best when the consequence is presented immediately after response.
Responding Rate
dependent on subject, complexity of behaviour, and type of reinforcement used.
Autoshaping
Over time, a contingency is learned when the behaviour to response pattern is repeated. The subject isn’t taught anything, it just learned it eventually. Rewards are spontaneous through correct action.