Instrumental Learning Flashcards
1
Q
EARLY WORK
A
- animal psychologists studying instrumental learning before Pavlov:
1. Small (rats in Hampton Court mazes) - issue of not being geared to studying learning process
2. Thorndike (cats in puzzle boxes) - better for learning process focus
- learned to escape; faster over trials
2
Q
INSTRUMENTAL CONDITIONING
A
- Law of Effect aka. if reward follows animal response -> association between stimuli/response = strengthened (S-R learning)
- concept following naturally post Throndike’s analysis = S-R reflex
3
Q
PROCEDURES
A
POSITIVE REINFORCEMENT
PUNISHMENT
NEGATIVE REINFORCEMENT
OMISSION TRAINING
4
Q
POSITIVE REINFOREMENT
A
- R -> appetitive aka. ^ R
- reward follows reinforcement
THRONDIKE - animals repeat actions -> satisfying state of affairs
HULL - drive reduction aka. animal works for food if hungry aka. redefined “satisfying state of affairs”
5
Q
PUNISHMENT
A
- R -> aversive aka. less R
- reduces responding
6
Q
NEGATIVE REINFORCEMENT
A
- R -> no aversive aka. ^ R
- response stops aversive stimulus that otherwise would have occurred
7
Q
OMISSION TRAINING
A
- R -> no appetitive aka. less R
- response cancels reward that would normally occur = omission schedule
- eventually leads to response reduction
8
Q
SCHEDULES OF REINFOREMENT
A
- extinction applies to instrumental conditioning too aka. stop giving reinforcers -> response stops
- BUT we can only get away w/reinforcing some responses pps emit w/stil stable conditioned responding
- reinforcement schedule = rule for deciding which responses to reinforce
- dif schedules -> dif/^ predictable response patterns; instantly recognisable on cumulative record patterns
9
Q
SIMPLE SCHEDULES & EFFECTS
A
CONTINUOUS REINFORCEMENT
FIXED RATIO
VARIABLE RATIO
FIXED INTERVAL
VARIABLE INTERVAL
10
Q
CONTINUOUS REINFORCEMENT
A
- CRF
- reinforces every response
11
Q
FIXED RATIO
A
- FR
- reinforce every nth response
- pause after each followed by fast responding
12
Q
VARIABLE RATIO
A
- VR
- reinforce every nth response on average
- continuous fast responding
13
Q
FIXED INTERVAL
A
- FI
- reinforce first response after time (t) elapsed since last reinforcer
- pause after each reinforcement followed by gradually ^ response rate
14
Q
VARIABLE INTERVAL
A
- VI
- same as FI
- BUT w/variable time period
- continuous moderate response rate
15
Q
RATIO SCHEDULES
A
- reinforcement depends on responses number
- 1 = continuous reinforcement
- not 1 = partial/intermittent reinforcement
- fixed ratio schedule = FR10
- variable ratio schedule = VR10