PSY1002 SEMESTER 2 - WEEK 8 Flashcards
in operant conditioning, what does action refer to
more general than classical conditioning “responses”
pressing levels, opening doors, pushing buttons
under stimulus control so actions can be responses to specific situations
how does operant differ to classical conditioning regarding UCS
UCS relies on behaviours of animal. only ocurs if levers pressed
whats Thorndikes law of effect
responses that create typically pleasant outcome in particular situation more likely to occur again in similar situation. response producing typically unpleasant outcome less likely to occur again
outline Skinner box paradigm
stimuli (lights, speaker)
response (lever)
appetitive stimuli- rewards
aversive stimuli- punishments
define positive reinforcement
behaviour strengthened through adding pleasant stimuli
define negative reinforcement
behaviour strengthened through removing unpleasant stimuli
define positive punishment
behaviour weakened through adding unpleasant stimuli
define negative punishment
behaviour weakened through removing pleasant stimuli
evaluate reinforcements
increase behaviour, more beneficial, result in long-term behavioural changes, creating pos rel with person giving reinf
evaluate punishments
decrease behaviour, less beneficial than reinforcement, temporary behaviour changes based on coercion, create neg rel
how can we prevent rapid extinctions once rewards using in reinforcement stops
use differing reinforcement schedule
what are partial reinforcement schedule (classical conditioning)
responses sometimes reinforced, sometimes not. slower initial learning, but more extinction resistance as reinforcement not appearing after each behaviour so learner take longer to determine lack of rewards. extinction is slower
what are partial reinforcement schedules decided on
if reinforcement presented on time elapsed inbetween reinforcement basis (interval) or on basis of number of responses that has been engaged in (ratio)
whether reinforcement occur on regular (fixed) or unpredictable (variable) schedules
describe interval schedules
reinforcements presented on basis of time elapsing inbetween each reinforcement
describe ratio schedules
reinforcement presented on basis of responses occurs inbetween each reinforcement
what is a fixed ratio reinforcement schedule
behaviour is reinforced after specific number of responses eg: giving child screen after reading 5 pages of book
explain fixed ratio
continuous reinforcement is “FR1”. every 5 pages “FR5”
brief pause in response after each reinforcement
probability of reinforcement increases with successive response
“stair-step” pattern
what is a variable ratio reinforcement schedule
behaviour is reinforced after average and unpredictable number of responses eg: payoff from slot machine
explain variable ratio
responses reinforced after random number of response (VR), attribute number to VR (average of response, eg: VR15)
rate of responding for VR fast
no post reinforcement pauses
consistent response rates over time
what is a fixed interval reinforcement schedule
behaviour is reinforced for first response after specific amount of time has passed eg: receiving monthly salary for work
explain fixed interval
first response, then certain amount of time, then reinforcement
produce characteristic response pattern
post-reinforcement pause, then slow response rate the high response rate toward end of interval
“scallop” patterns
what is a variable interval reinforcement schedule
behaviour is reinforced for first response after an average and unpredictable time eg: check emails periodically
explain variable interval
response reinforced following randomly determined amount of time
constant and no post reinforcement pause
most common for operant research as produces steady predictable performance
define shaping
process of guiding behaviour to desired outcomes using intermediate stages, and reinforvce individual steps. takes time, but allow learning more complex sequences
when can shaping be used for reinforcement
socially important behaviour = toilet training, verbal responses, academic skills
reinforcers must quickly followed desired responses
what are secondary reinforcer
conditioned secondary reinforcer reduce delay between behaviour and delivery of primary reinforcer
eg: previously NS becomes reinforcer after paired with a primary reinforcer
define primary reinforcer
stimuli that are naturally preferred
define secondary reinforcer
neutral event that has become associated with primary reinforcers through classical conditioning
outline pigeon study for effect of reward magnitude on learning (Rose, 2009)
condition A= peck 1 key for small reward, different key is big reward
condition B= peck correct key for reward and incorrect cause lights to go out
condition C= peck specific colour for small reward, peck different colour for no reward
learnt to associate colours with small/large rewards, or no lights
big rewards cause faster learning but eventually small catch up
describe the role of dopamine in conditionings
increased activity in dopaminergic neurons as response to reward
consistent pairing of CS and UCS causes responses occuring at CS not CR
no reward show decreased activity where reward should have occurred