Lecture 4- Operant Conditioning Flashcards
What is the process of shaping in operant conditioning? Give an example.
Shaping is process of reinforcing at behaviour that resembles the target behaviour. Eg pigeon shaped up do a counter-clockwise turn
What was thorndikes early experiment on operant conditioning?
Cats were placed into puzzle boxes and had to find their way out by pulling a string, standing on a platform and turning a latch on the door. Results indicated that cats got quicker at performing this with experiences.
What did type of behaviour did skinner find when random behaviours were reinforced? What’s some real life examples of this?
Skinner found superstitious behaviours in the animals such that they would perform behaviours that they think were eliciting the stimulus. Lucky charms, closing lift doors
What is chaining and backward chaining? Give an example.
Chaining is the process of reinforcing smaller behaviours that lead up to a large behaviour. Backward chaining is when the process begins at the end behaviour and goes to beginning behaviour. An example of chaining is toilet training - letting mummy know need to pee is reinforced first. An example of backward chaining - taking of a sweater begin by pulling sweater off head then pulling sweater of head and shoulder
What is skinners 3 term contingency that make up operant conditioning?
- The discriminative stimulus (cue for behaviour)
- The operant response (the behaviour)
- The outcome (reinforcer or punisher that follows)
If a behaviour is reinforced or punished will it predict the future behaviour?
If it is reinforced it will
What is an example of a positive reinforcement?
Something is given in order to promote the continuation of the behaviour. Eg. A sticker
What is an example of positive punishment ?
Something is given to discouragement the behaviour from occurring eg. A smack
What is an example of negative reinforcement ?
Something is taken away to promote the continuation of the behaviour eg. No homework
What is an example of negative punishment ?
Something is taken away to discourage the continuation of a behaviour eg. Time out (as in time is taken away) maybe Xbox taken away better example
What is the most effective reinforcement or punishment?
Reinforcement.
How do you punish effectively?
No escape As intense as possible Continuous schedule No delay Over a short period of time No subsequent reinforcement
What are some consequences of punishment?
Modelling bad behaviours like aggression/violence
Fear
Learned helplessness
What is a fixed ratio schedule of reinforcement?
A response every nth behaviour
What is a variable ratio schedule of reinforcement?
A response is given approximately every nth behaviour
What Is a fixed interval schedule of reinforcement?
A response is given after an nth amount of time
What is a variable interval schedule of reinforcement?
A response is given approximately after an nth amount of time
Draw what the 4 schedule of reinforcement’s would look like on a graph.
VR (straight line so continuous behaviour, quickest learning)
FR (stepped learning, second quickest)
VI (straight line, continuous behaviour, 3rd quickest)
FI (scalloped so behaviour begins when time is estimated for stimulus to occur, slowest learning)
What is the most effective schedule of reinforcement ? Why?
Variable ratio- because the organism consistently elicits the behaviour in hope it will be rewarded eg. Gambling, nagging
What is the most effective schedule of punishment ?
Continuous - know punishment WILL occur
What three other variables affect operant conditioning (other than schedules):
Drive - how much the organism wants to achieve the goal (ie. starving so strong drive to get food)
Size- the bigger the reward, the better (aka quickly learnt)
Delay- if there is a delay, hard to know what behaviour is being reinforced/ punished
What is the law if diminishing return?
As the level of reinforcement increases, the response level will begin to die off once the level of reinforcement isn’t too desirable anymore
In the wager up between long term punishment or short term reward, what do people prefer? Give an example in real life.
People prefer short term reward ie chocolate over a long term punishment ie getting fat
What is stimulus control? Give an example.
Stimulus control is when a stimulus or a cue tells you what to do in a given situation. The presence of the stimulus makes the behaviour happen, the absence of the stimulus prevents the behaviour. Eg traffic lights- green=go