Chapters 5-9 Flashcards

1
Q

What did Thorndike test for?

A

Do animals possess intelligence?

The study of consequence

  • did Cat & Puzzle Box experiments
    • the cat leanred from the consequences of its actions
    • THE LAW OF EFFECT:
      • any action has a consequence (positive or negative)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The Law of Effect

4 Key Elements:

A

Any action has a consequece (can be positive or negative)

4 Key elements (cause and effect chain of events):

  • have environment
  • have behavior
  • change in environment after the behavior
  • change in ithe behavior after the change in environment
    • DEMONSTRATES LEARNING HAS OCCURRED

_ex:_ puzzle box, push lever, door opens –> cat more likely to push lever

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What did Skinner say about behavior?

A
  • behavior is either strengthened** or _weakened _** by its consequences
  • behavior operates in the environment–> behavior effects the environment

Said that there are 4 TYPES OF EXPERIENCES:

  • 2 THAT STRENGTHEN BEHAVIOR
    • reinforcement
  • 2 THAT WEAKEN BEHAVIOR
    • punishment
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

types of operant learning

A

2 that strengthen behavior: REINFORCEMENT

2 that weaken behavior: PUNISHMENT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is reinforcement?

A

an increase in the srength of a behavior due to its consequences

​to qualify as reinforcement:

  • behavior must have a consequence
  • behavior must increase in strength
  • the increase must be a result of the consequence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

positive reinforcement

A

a behavior causes appearnece or increase in intensity of a stimulus

stimulus: positive reinforcer (something animal seeks out)

“reward learning”

add” something to increase behavior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

negative reinforcement

A

behavior strengthened by the removal/ decrease in intensity of the stimulus

stimulus: _negative reinforcer _(usually something we want ot avoid)

“escape learning”

“escape avoidance learning”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

how do we measure the strength of a behavior?

A

frequency

or

probability

of the behavior occurring

  • or any other feature of the behavior (as long as its reinforced)*
    (ex: duration, form, intensitym latency)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is behavioral momentum?

A

behavior persists even AFTER punishment and other reinforcers

its hard to erase the learning

(“learning keeps building up”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

kinds of reinforcers

A
  • primary: innately effective; don’t have to learn to like them (food, water, social)

vs.

  • _secondary: _a result of learning; conditioned reinforcer (applause, money)
    • have no vsalue on their own, we give them value
  • generalized reinforcers: paired w/ different reinforcers to be used in differnt situations
    • natural: events that psontaneously follow from a behavior
    • contrived: events provided by someone in order to modify behavior; do not occur naturally; manipulation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is satiation?

(“say-she-ay-shun”)

A

when a reinforcer loses its ability to be effective due to changes in the environment, or the value of the reinforcer

ex: secondary reinforcers satiate more slowly because they tend to be weaker and aren’t as disruptive; provide instant gratification (money)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

operant conditioning

A

behavior causes an effect (event contingent on behavior)

the behavior is voluntary/flexible (can be manipulated)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

classical conditioning

A

events connected to behavior

(event 1 is contingent on event 2)

behavior is reflexive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is contingency?

A

X and Y occur together or not at all

the amount of correlation between behavior and its consequence

(how reliably the reinforcer follows behavior)

*in terms of reinforcement: *increase the likelihood of behavior happening again

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is contiguity?

A

the time gap between behavior and its consequence

shorter gap = faster learning

after a delay between the behavior and the consequence, you may inadvertently reinforce other behaviors

*ex: *press lever, then cat chases tail, then door opens–> cat thinks chasing tail will open door

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

characteristics of reinforcers

A

size: larger is better

qualitative differences (individual differences can determine effectiveness)

ex: ice cream not a good reinforcer for someone who is lactose intolerant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

behavior characteristics

A

some behaviors are easier to learn than others

what we’re trying to teach influences how quickly & easily it’s learned

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

what are motivation operations?

A

anything that changes the effectiveness of a consequence

2 types

  • establishing operations: **increase effectiveness
  • abolishing operations: decrease effectiveness
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

neuromechanics of reinforcement

A

Olds & Milner–1950s

ESB (electrical stimulation of the brain)

shock rats brain

looked at reward pathway (pathway the deals w/ reinforcement; limbic system)

found that the reward pathway is dopamine rich w/ endorphins

stimulating dopamine receptors triggers a reward

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

THEORIES OF REINFORCEMENT:

positive

Drive reduction theory

A

drive: motivational states

reinforcers: events that reduce drives

  • pros: works with primary reinforcers*
  • cons: not as well for secodary reinforcers or ones that are hard to classify*
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

THEORIES OF REINFORCEMENT:

positive

Relative value theory

A

reinforcers aren’t things–>they’re BEHAVIORS

ex: the reinforcer isn’t the food, it’s EATING the food

behaviors have different relative values (“rahter be doing X or Y?”)

something with a higher relative value will reinforce better that somwthing with a low relative value

**comparing differnet behaviors and how much you’d rahter be doing them**

pros: no need for internal “drives”

cons: doesn’t consider secondary reinforcers; sometimes low probability behavior will still be reinforcing under normal conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

THEORIES OF REINFORCEMENT:

positive

Response Deprivation Theory

A

compare behaviors to themselves, **not to each other

_baseline: _ amount of time spent engaging in behavior under normal condtions

when does a behaviors become a reinforcer?

  • when the behavior is held below baseline value of how much you want to do something
  • cons: *still issues w/ praise
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

THEORIES OF REINFORCEMENT:

negative

Two-Process Theory

A

**2 processes occurring **

BOTH operant and Pavlovian conditoning occur

escape from an aversive stimulus to learn

when CS loses its aversiveness, avoidance persists; extinction fails to occur

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

THEORIES OF REINFORCEMENT

negative

One-process Theory

A

keep operant conditioning (because it can explain everything)

the reduction in the shock is reinforcing

stop avoidance behavior by forcing it to stop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Shaping

A

a type of learning

reinforce simple behaviors close to what you want/the desired behavior

used to shape behavior that won’t occur spontaneously

behaviors vary: useful behaviors get selected & rewarded

(like natural selection; select for traits)

*rat & basketball video example*

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

how to shape

A
  • don’t require too much at one time
  • provide immediate reinforcement/rewards (latency=bad–> can reinf. wrong behavior)
  • give small rewards
  • reinforce the closet approximation of the end behavior
  • back up when necessary
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

what is chaining?

forward and backward?

A

teaching individual to perform a behavior chain in order

behavior chain: **a series of connected actions

**forward chaining: **reinforce FIRST actions, then SECOND, etc

**backward chaining: **reinforce LAST action first, then second to last, etc

** the LAST ACTION in the chain is the most important**

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

steps in chaining:

A

same process sas shaping BUT reward individual after successfully completing EACH step

analyze chain–what are its parts?

break it into pieces

29
Q

schedules of reinforcement

A

variation in reinforcement contengencies that follow a specifc rule (specific pattern)

fixed vs. variable

ratio vs. interval

ratios: # of behaviors to the # of reinforcements;

ratio increase=more # of behaviors you have to perform to get reinforcement

_interval: _ how much time has gone by between when you got the 1st reward and the 2nd reward; still have to perform behavior to get reward; (time elapsed between a reinforcement and the next behavior)

30
Q

fixed ratio schedule (FR)

A

behavior reinforced when it has occurred a fixed # of times

produces post-reinforcement pauses: pauses b/t behavior after reinforcement given

31
Q

variable ratio schedule (VR)

A

reinforcement given based on an average ratio of behaviors to reward

produce steady performance: pauses are rare/short

*never know when the reward is coming*

main difference from fixed ration schedule: the pauses (no pauses in variable ratio schedule because you don’t know when reward is coming, so you keep doing behavior to get reward)

32
Q

fixed interval schedule (FI)

A

behavior reinforce ONLY if it occurs after a particular, constant interval

after a certain time interval, the next behavior is reiforced; any behavior in the meantime doesn’t count/doesn’t get rewarded

33
Q

variable interval schedule (VI)

A

behavior is reinforced after a interval, BUT the interval varies around a particular time

after a time interval, the next behavior in reinforced

produces steady run rates (but not as steady as variable ratio schedules)

34
Q

extinction

A

have a behavior that has been reinforced, but STOP reinforcing it for good

“FR infinity” (fixed ratio for forever)

ex: pigeon flaps wings=reward–> pigeon flaps wings=>no more rewards

35
Q

extinction bursts

A

at first, behavior will suddenly increase

(pigeon flaps wings more!)

because they can’t figure out why they are no longer recieving rewards

36
Q

resurgence

A

previously reinforced behaviors reappears from childhood

  • variabliy increases after extinction*
    ex: adult throws temper tantrum
37
Q

spontaneous recovery

A

extinct bahevior reoccurs after time outside of the operant chamber

38
Q

less common schedule types

A

duration (fixed or variable)

noncontingent schedules

progressive schedule (stretching ratio)

ratio strain

breaking poit (behavior stops completely)

compound schedules (more than one shcedule combined)

39
Q

partial reinforcement effect

A

behavior reinforced on a schedule is MORE RESISTANT to extinction than continuously reinforced behavior

40
Q

thinner reinforcement effect

A

bigger ratio/longer interval # = thinner ratio

something less reinforcing overall

41
Q

discrimination hypothesis

A

thinner schedule is MORE DIFFICULT to distinguish between the schedule and the extinction process

42
Q

frustration hypothesis

A

not getting a reward is frustrating

(because reinforcers reinforce behavior AND **feeling **too)

thinner schedule = greater frustration

43
Q

sequential hypothesis

A

not getting reinforced every time is another cue to keep going because eventually the behavior will be reinforced

same as frustration hypothesis, BUT the cue to continue the behavior for reinforcemnt is from the environment (not frustration)

44
Q

response-unit hypothesis

A

combination of behavior is ONE response unit

rewards happen after every unit

BUT unit are defined differently (grouped) (*box thingy*)

eliminate partial reinforcement effect

45
Q

matching law

A

how one performs a behavior is directly related to how they get rewarded for it

thicker reinforcement schedule = rewarded more often

relative frequency of reinforcement directly related to the relative frequency of the reinforcement available

learn both ratio schedules and choose the richer one to be rewarded more often

only switch between the 2 to see if you’re about to get rewarded

46
Q

what is punishment

A

the decrease in a behavir because of it consequences (opposite of reinforcement)

Thorndike concluded that we don;t learn from our failures

47
Q

positive punishment

A

behavior weakened with the appearnace or increase in intensity of an aversive stimulus

ADD something to the environment

ex: yelling, arrest, ticket, water squirt

48
Q

negative punishment

A

behavior weakened with the REMOVAL or REDUCED intensity of a pleasurable stimulus

take something away from the environment

ex: time-out

49
Q

VARIABLES AFFECTING PUNISHMENT:

contingency

A

how dependent punishment is on behavior

more contingency = faster bahvior changes

higher contingency = faster extinction/fewer changes overall

50
Q

VARIABLES AFFECTING PUNISHMENT:

contiguity

A

interval between behavior and punishing consequence

longer interval = less effective punishment

51
Q

VARIABLES AFFECTING PUNISHMENT:

punisher intensity

A

more intense punisher = more reduction in behavior

52
Q

VARIABLES AFFECTING PUNISHMENT:

introductory level

(level of intensity)

A

2 options for levels of intensity:

  • start with WEAK, aversive stimulus and gradually increase it until the behavior stops
    • can allow people to learn to toleratestimuli better (not good)
  • start with STRONG, aversive stimulus and build up if you need to
    • larger punishment may be necessary to learn assn and stop behavior much fatser
    • hard to determine the level that’s “just right”
53
Q

VARIABLES AFFECTING PUNISHMENT:

previous reinforcement

A

how strongly a behavior was reifnorces before it was punished

the more well-learned/reinforced a behavior is, the harder it is to punish it

54
Q

VARIABLES AFFECTING PUNISHMENT:

alternative sources of reinforcement

A

if the behavior is the ONLY way to get reinforcement then punishment won’t work very well

if there’s another way to get reinforcement (food, water, attention), then punishment is very effective

(a way to get around previous reinforcement)

55
Q

VARIABLES AFFECTING PUNISHMENT:

motivating operations

A

punishment is MORE effective if the REWARD is not as good

(ex: wheat vs. candy)

quality of previous reward vs. punishment

56
Q

VARIABLES AFFECTING PUNISHMENT:

qualitive features (of the punishment itself)

A

some punisher are just BETTER than OTHERS

depends on the individual anf what they like

57
Q

THEORIES OF PUNISHMENT:

TWO process theory

A

both operant and classical conditioning influence performance

58
Q

THEORIES OF PUNISHMENT:

ONE process theory

A

only involves operant

low probability behavior should punish high probability behavior

“the way to go interms of punishment”

59
Q

PROs of punishment

A

reinforcing

FAST

can permanently change behavior

60
Q

CONs of punishment

A

has to be consistent; must punish every instance of behavior

can create extinction bursts

can be avoided; try to escape–> can lead to negative reinforcement

physical punishment

creates aggression toward the punisher (when escape is impossible)

displace aggression toward innocent others

imitation of the punisher

suppressed behavior if escape is impossible

61
Q

ALTERNATIVES to PUNISHMENT:

response prevention

A

change the environment so that behavior cannot occur in the first place

62
Q

ALTERNATIVES to PUNISHMENT:

extinction

A

must remove all reinforcers

63
Q

ALTERNATIVES to PUNISHMENT:

differential reinforcement

A

3 types of differential reinforcement

  • DRA (differential reinforcement to alternative behavior)
    • specifically reinforce something else
  • DRI (differential reinforcement of incompatible behavior)
    • reinforce specific incompatible behavior (cant do 2 behaviors at 1 time)
  • DRL (differential reinforcement at a low rate)
    • reinforce a lower rate of behavior
    • reinforce someone for doing it less than they would normally do it
64
Q

INFLUENCES OF OPERANT TREATMENTS:

home environment

A

development of secure attachment

neglect in orphanages or in general damages attachment formation

(environment if orphanages is non-responsive, so operant procedures not in effect; lose contengency)

crying–>not being picked up = learn that “no one is there”

in an ideal environment –> needs will be met

  • associative learning: cry = help*
  • ex: learning to speak via encouragement*
65
Q

INFLUENCES OF OPERANT TREATMENTS:

school environment

A

reinforcement works to encourage **good behavior/performance in school **

move away frompunishment–>reinforce good behavir and ignore bad behavior

_DRL_ (children should learn how to earn attention)

reinforcement is the basis for internet-based learning

66
Q

INFLUENCES OF OPERANT TREATMENTS:

clinic environment

A

self injurious behavior:

  • before reinforcement, children were restrained
  • use punishment as a treatment to decrease behavior
    • ​alternatives to punishment: DRI; reiforcement can promote positive behavior

Delusions (false beliefs): can get worse with positive reinforcement; reinforce doubts in those false beliefs & provide alternative options

Transient paralysis: short term paralysis of limb

  • neuroplasticity: brain recovers by rewiring itself so that other parts of barin can work affected limb
  • contrain-induced movement therapy (remove learning of pain; shaping makes it work faster)
67
Q

INFLUENCES OF OPERANT TREATMENTS:

work environment

A

improve worker performance

improve productivity with performance feedback

rewarding safety practices = fewer work accidents

68
Q

INFLUENCES OF OPERANT TREATMENTS:

zoo environment

A

improve the vetinary care of animals with operant conditioning

helps with humane treatmets of animals

allow animals to “earn” food through operant reinforfcementlike they would in their natural habitat; reduces boredom and reinstates wilderness conditions

shape and reinforce behavior to care for animals