Chapter 6: operant conditioning Flashcards
operant conditioning
Operant conditioning is a type of associative learning process through which the strength of a behavior is modified by reinforcement or punishment.
reinforcement
Reinforcement is defined as a consequence that follows an operant response that increase (or attempts to increase) the likelihood of that response occurring in the future.
positive reinforcement
Something positive which is received when the desired behavior is performed.
Give an example of positive reinforcement.
Giving a dog a treat when they act well-behaved or perform a trick.
negative reinforcement
Negative reinforcement is the encouragement of certain behaviors by removing or avoiding a negative outcome or stimuli.
Something negative is avoided after the desired behavior is performed which increases the probability of the desirable behavior being performed and repeated.
Give an example of negative reinforcement.
Doing the homework to avoid detention.
primary reinforcer
an event that is innately reinforced often by satisfying a biological need
One that satisfies a basic need such as getting food.
conditioned\ secondary reinforcer
Also is known as a secondary reinforcer an event that gains reinforcing power through its link with the primary reinforcer
One that enables you to access a primary reinforcement such as getting money.
reinforcement schedule
a pattern that defines how often the desired response will be reinforced
Define ‘punishment’.
Causing some kind of physical or mental distress by giving a negative consequence and so decreasing the probability of the undesirable behavior being repeated.
Define ‘positive’ punishment.
adding an aversive consequence after an undesired behavior is performed to decrease future responses.
Give an example of positive punishment.
Getting hit by a ruler at school for being naughty.
adding more chores to the list when your child neglects their responsibilities.
Define ‘negative’ punishment.
Getting something taken away or being deprived of something as punishment.
Give an example of negative punishment.
Going to be without tea for being naughty.
Losing access to a toy, being grounded, and losing reward tokens
Give an example of primary punishment.
Being deprived of food.
Give an example of secondary punishment.
Being deprived of pocket money.
Define ‘shaping’ in terms of ‘successive approximations’
Learning a new skill through different stages, being rewarded each time.
Example: Learning how to use a knife and fork by using stages such as going from being fed, to a spoon, to a knife and fork.
List the 3 principles of operant conditioning.
1) Generalisation
2) Discrimination
3) Extinction
Define ‘generalization’ in terms of operant conditioning.
The the behaviour is generalised to similar things to the reinforced behaviour.
Define ‘discrimination’ in terms of operant conditioning.
Distinguishing between responses that may be similar to the reinforced behaviour.
Define ‘extinction’ in terms of operant conditioning.
extinction refers to the process of no longer providing the reinforcement that has been maintaining a behavior.
List the 5 schedules of reinforcement.
1) Continuous
2) Fixed ratio
3) Variable ratio
4) Fixed interval
5) Variable interval
Define a ‘continuous’ schedule of reinforcement.
Where the desired behaviour is reinforced every time it occurs.
Give an example of continuous schedule of reinforcement.
Getting a raise at work after every successful project.
State the response rate and extinction rate of a continuous schedule of reinforcement.
Response rate = Slow
Extinction rate = Fast
Define a ‘fixed ratio’ schedule of reinforcement.
When every 5th, 10th, or any such regular desired behaviour is reinforced.
Give an example of a fixed ratio schedule of reinforcement.
A mom asking her child to clean their room 5 times before they are punished.
State the response rate and extinction rate of fixed ratio schedule of reinforcement.
Response rate = Fast
Extinction rate = Medium
Define a ‘variable ratio’ schedule of reinforcement.
Where the number of necessary desired behaviours are constantly altered.
Give an example of a variable ratio schedule of reinforcement.
When at a casino a slot machine it provides different winnings each time.
State the response rate and extinction rate of variable ratio schedule of reinforcement.
Response rate = Fast
Extinction rate = Slow
Define a ‘fixed interval schedule of reinforcement.
Reinforcement made once every fixed number of minutes so long as there has been at least one desirable behaviour performed during that time.
Give an example of a fixed interval schedule of reinforcement.
Receiving a treat after every hour of revision.
State the response rate and extinction rate of the fixed interval schedule of reinforcement.
Response rate = Medium
Extinction rate = Medium
Define a ‘variable interval schedule of reinforcement.
Reinforcement is made at different time periods.
Give an example of a variable interval schedule of reinforcement.
Self-employed people getting paid at different times in the month after completing different length tasks.
State the response rate and extinction rate of fixed interval schedule of reinforcement.
Response rate = Fast
Extinction rate = Slow
How does operant conditioning compare to classical conditioning in the responses that are learnt?
Classical = The responses already occur naturally to an animal, it’s only the stimuli that can be manipulated to elicits these responses
Operant = New behaviours are created in animals in response to a consequence