Task 6 - Instrumental Conditioning Flashcards

1
Q

Give a definition of instrumental conditioning.

A

Instr. / Operant Conditioning is the process whereby an individual learns to (not) do sth. based on the outcome of an action.

Sd -> R -> O; If O=positive, Sd-R-Association is strengthened.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What research lead to the discovery of operant conditioning?

A

Thorndike’s Puzzle box

-> Animals had to escape the box using mechanisms, in order to get to the reward.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which law has been deducted from Thorndike’s Puzzle Box?

A

Law of Effect -> Behavior followed by a positive outcome is more likely to be repeated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a Habit Slip?

A

Executing a learned action without considering any other cues. (Getting up and getting ready for work out of habit on a Sunday.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the difference of classical vs operant conditioning?

A

In classical conditioning, the CS is always followed by the US, whereas in operant conditioning, the individual first has to perform an action (R) for an outcome to occur.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Describe two famous paradigms in operational conditioning. (Derived from Thorndike & Skinner)

A
  1. Discrete Trials Paradigm - Thorndike:
    - > Experimenter defines start and end of each trial
    - > Puzzle-Box
  2. Free-operant Paradigm - Skinner:
    - > The subject operates the experiment independently.
    - > Skinner Box (modification of puzzle-box)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Explain Shaping

A
  • Reinforcing behaviors, that ultimately lead to the desired behavior
  • > The experiment doesn’t rely on the test subject finding out the desired behavior by accident, but is “shaped” towards it.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is chaining?

A
  • Gradually (step-by-step) training individuals to execute complex sequences of behavior
  • > Teaching the individual steps in reverse order can be efficient in some cases
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Describe the term “reinforcer” and its two subtypes.

A

Reinforcer: A consequence of behavior, that increases the likelihood of this behavior being repeated

  • > Primary reinforcer: a reinforcer that has innate biological value
  • > Secondary reinforcer: A reinforcer that is associated with a primary reinforcer, but is none itself - like money
  • these secondary reinforcers always work as motivation for action whereas an individual can be satisfied from a primary reinforcer.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the principle of negative contrast?

A

Expecting to receive something but then the actual amount is lower will result in disappointment. If you expected the same low amount however, there will be no disappointment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are punishers and what are their characteristics / limitations?

A
  • > A consequence of behavior that leads to a decrease in likelihood of repetition
  • > Less efficient and predictable in controlling behavior than reinforcement
  • > Can train someone to only suppress a certain action in the presence of one Sd and not in general.
  • > Is most effective if the punisher is strong and used from the first second.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is an alternative form of preventing unwanted behavior?

A

Differential Reinforcement of alternative behaviors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Explain all the different forms of positive/negative punishment/reinforcement.

A

Positive Punishment: Action causes punishment to occur
Negative Punishment: Action causes a reinforcer to be taken away
Positive Reinforcement: Action causes reinforcer to occur
Negative Reinforcement: Action causes punishment to be taken away

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are alternative for positive / negative in this context?

A

Positive - Additive

Negative - Subtractive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Summarize Hull’s Drive Reduction Theory.

A

Theory that says that we have an innate drive to obtain primary reinforcers and that learning is driven by the desire to satisfy these needs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Name the three main subtypes of reinforcement schedules.

A
  1. Continuous
  2. Partial / Intermitted
  3. Concurrent
17
Q

Explain Continuous reinforcement schedules

A

R is always followed by a certain outcome

18
Q

Explain the four types of Partial / Intermitted schedule

A
  1. Fixed Ratio Schedule: A specific number of R will lead to an O
  2. Variable Ratio Schedule: On average, after a certain number of R, an O will occur
  3. Fixed Interval Schedule: O will be able to be obtained after a certain time period
  4. Variable Interval Schedule: The first R after a certain average time period results in an O.
19
Q

What are characteristics of Concurrent Reinforcement Schedules?

A
  • The subject has the option of exhibiting a number of different R’s, which all have a different reinforcement schedule.
  • > Lets us study behavioral economics
20
Q

Explain the Matching Law of Choice Behavior

A

The ratio of time spent on the different R’s in a concurrent reinforcement schedules situation is roughly equal to the ratios of maximal O-frequency of the associated R’s

21
Q

What is a Bliss-Point?

A

An arrangement of exhibited R’s that leads to your subjective optimal R-O-Ratio

22
Q

When the option of performing a preferred R afterwards motivates the performing of a less liked R, this is called the ______.

A

Premack Principle

23
Q

What is a refinement of Premack’s Principle?

A

Response Deprivation Hypothesis:

-> The option to perform any behavior A can reinforce the performance of a behavior B, if the access to A is restricted.

24
Q

Which brain structures are of central importance for operant conditioning?

A
  • Basal Ganglia
  • Orbitofrontal Cortex
  • Ventral Tegmental Area
25
Q

What is the function of the Dorsal Striatum (Basal Ganglia) in terms of conditioning?

A
  • It receives input from sensory cortical areas
  • Gives output to motor cortex
  • necessary for Sd-R-Associations
26
Q

How does the Orbitofrontal Cortex contribute to conditioning?

A
  • Contributes to goal-directed predicting
  • Projects to the Dorsal Striatum
  • Neurons code for an expectation of punishment and what the outcome of an action is
  • Neurons fire in correlation to the perceived value of each choice
27
Q

What are two important brain structures for neural reinforcement and what do they do?

A

Ventral Tegmental Area (Midbrain):

  • powerful pleasure centrum if stimulated
  • produces dopamine
  • projects to frontal cortex and:

Substantia Nigra Pars Compacta (SNc, part of Basal Ganglia):

  • produces dopamine
  • projects to dorsal striatum
28
Q

What kind of values can reinforcers have?

A

Hedonic: Subjective Value or Quality
Motivational: Amount of work someone is willing to put into achieving it.

29
Q

What role does Dopamine play?

A
  • It is released during positive reinforcement
  • It motivates us to work for things (motivational value), however it doesn’t add hedonic value
  • Enhances Sd-R-Associations because it promotes synaptic plasticity
  • Can have unpredictable effects
30
Q

What is the Incentive Salience Hypothesis?

A

That even without Dopamine, we still “like” and “dislike” things, but our willingness to work for them diminishes.

31
Q

Explain the Reward-Prediction Hypothesis

A

If a reinforcement comes unpredicted, it will cause more dopamine release.

32
Q

What are Endogenous Opioids?

A
  • Naturally occurring “opiates” (peptides)
  • Decrease pain and produce euphoria
  • It is believed that these peptides control the hedonic value of things
33
Q

Define “Addiction”

A

A strong habit that is continued despite harmful consequences

34
Q

Why is addiction maintained?

A
  • > Positive reinforcement: It gives us pleasure
  • > Negative reinforcement: It helps us avoid withdrawal effects
  • > Many drugs manipulate our dopaminergic system and thus make us want to work to obtain it.
35
Q

Drug craving is associated with activity in which brain area?

A

Insula

36
Q

What is Naltrexone and what are its effects?

A
  • A drug that blocks opiate receptors
  • > decreases the ability of drugs to bind to these receptors and cause a hedonic reaction
  • > Doesn’t reduce craving
  • > Must be taken daily
37
Q

What are other ways of treating an addiction?

A
  • Self-help groups / Cognitive Therapy
  • Achieving Extinction
  • Distancing
  • Delayed reinforcement (reduces number of intake per day, weakens learning)
  • Differential reinforcement of alternative behaviors