Operant Conditioning (Exam 1) Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Operant conditioning

A

Refrain from making certain responses in order to obtain or avoid certain outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Law of effect

A

any behavior followed by a pleasant consequence is likely to be repeated —–> probability increased

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Discriminative Stimulus

A

in operant conditioning, a stimulus indicating that a particular response (R) may lead to a particular outcome (O)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

3-part association

A

Discriminative stimulus (S^D) —> Response (R) —–> Outcome (0)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Discrete-trials paradigm

A

an operant conditioning paradigm in which the experiment defines the beginning and the end points of each trial (Edward Thorndike “Cat Box”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Free-operant paradigm

A

an operant conditioning paradigm in which the animal can operate the apparatus as it chooses in order to obtain reinforcement (or avoid punishment)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Skinner box

A

chamber used for operant conditioning and designed so that reinforcement or punishment is delivered automatically whenever an animal makes a particular response (e.g., pressing a lever)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Cumulative recorder

A

a device used for recording responses in operant conditioning, designed in such a way that the height of the line it draws represents the total (cumulative) number of responses made up to a given time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Discriminative Stimuli

A
  • S^D (light on) —-> R (press lever) —–> O (get food)
  • S^D (light off) —-> R (press lever) —–> O (no food)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Operant Conditioning: Responses

A

Rat may receive food if lever is pressed:

  • S^D (lever in box) —–> R (press lever) —–> O (get food)
    messy room ——> tidiness ————–> allowance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Shaping

A

an operant conditioning technique in which successive approximations to the desired responses are reinforced

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Chaining

A

Trained to preform sequence of tasks (involves reward for preformed action)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Reinforces

A

a consequence of behavior that leads to an increased likelihood of that behavior in the future

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Punisher

A

a consequence of behavior that leads to decreased likelihood of that behavior in the future

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Reinforcement

A

the process of providing outcomes (reinforcers) that lead to increased probability of a particular behavior occurring in the future

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Punishment

A

the process of providing outcomes (punishers) that lead to decreased probability of a particular behavior occurring in the furture

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Primary reinforcers

A

a reinforcer, such as food, water, or sleep, that is of biological value to an organism

* Reinforcers are not created equal*

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Drive reduction theory

A

theory, proposed by Clark Hull, that all learning reflects the innate, biological need to obtain primary reinforcers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Negative contrast

A

phenomenon in which the reinforcing value of one reward is reduced because a better reward is expected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Secondary reinforcer

A

value but has been paired with (or predicts the arrival of a primary reinforcer) not a biological value.

Examples: Safety —-> money OR Social value/capital —-> social media likes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Token economy

A

an environment (such as a prison or school room) in which tokens function the same way as money does in the outside world

Rewarding and reinforcing good habits

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Punishment that leads to more behavior

A

Punishment of (R) decreases the probability that (R) will occur, but does not predict what response will occur instead of (R)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Discriminative stimuli for punishment can encourage cheating

A

Absence of the discriminative stimuli does not mean the absence of punishment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Concurrent reinforcement can undermine the punishment

A

The effects of punishment can be counteracted if reinforcements occur with punishment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Initial intensity matters

A

punishment had to be strong from the outset to be effective

26
Q

Differential reinforcement of alternate behaviors (DRA)

A

Method used to decrease the frequency of unwanted behaviors by instead reinforcing preferred alternative behaviors

27
Q

Positive Reinforcement

A

type of operant conditioning in which the response causes a reinforcement to be “added” to the environment; over time the response becomes less frequent

S^D (dinnertime) —–> R (set the table) —–> O (Praise)

28
Q

Positive Punishment

A

type of operant conditioning in which the response causes an undesirable element to be “added” to the environment; over time the response becomes less frequent

S^D (tastant on thumb)—–> R (thumb sucking) —–> O (bitter taste)

29
Q

Negative reinforcement

A

type of operant conditioning in which the response causes an undesirable element to be subtracted from the environment; overtime, the response become more frequent
S^D (headache) ——> R (take aspirin) ——> O (no more headache

30
Q

Negative punishment

A

type of operant conditioning in which the response causes a desirable element to be “subtracted from” the environment; over time, the response becomes less frequent

S^D (playing at the birthday party) —–> R (bad behavior) —–> O (loss of playtime)

31
Q

Reinforcement Schedule

A

schedule that determines how often reinforcement is delivered in an operant conditoning paradigm

32
Q

Continuous reinforcement schedule

A

reinforcement schedule in which every instance of the response is followed by the reinforcers

33
Q

Partial reinforcement schedule (or intermittent)

A

a reinforcer schedule in which only some instances of the response is followed by the reinforcer

34
Q

Fixed-ratio (RF) schedule

A

a specific number of responses must occur before a reinforcer is delivered; thus FR3 means the reinforcer arrives after every 3rd response

35
Q

Post-reinforcement pause

A

in operant conditioning, a brief pause in responding that follows delivery go the reinforcer

36
Q

Fixed-interval (F1) schedule

A

reinforcement schedule in which the first response after a fixed amount of time is reinforced; thus FI-1 min means that reinforcement follows the first response made after 1 minute interval since that last reinforcement

37
Q

Variable-ration (VR) schedule

A

A reinforcement schedule in which a specific number of responses, on average, must occur before the reinforcer is delivered; thus, VR5 means that, on average, every fifth response is delivered.

38
Q

Variable-interval (VI) schedule

A

a reinforcement schedule in the first response after a fixed amount of time, on average, is reinforced; thus VI-1 min that reinforcement follows the first response made after 1 minute interval, on average, since the last reinforcement.

39
Q

Matching law of choice & behavior

A

principle that an organism, given a choice between two or multiple responses, will make each response at a rate proportional to how often that response is reinforced relative to the other choices.

40
Q

Behavioral economics

A

study of how organisms allocate their time and resource among possible options.

41
Q

Bliss points

A

in behavioral economics, the allocation of resources that maximize subjective value or satisfaction.

42
Q

Delayed discounting

A

Progressive reduction or discounting of the subjective value of a reward the longer it is delayed.

43
Q

Self-control

A

an organism’s willingness to go for a small immediate reward in favor of a large future reward

44
Q

Altruism

A

in behavioral economics, an action or behavior that provides benefit to another at the expense of some cost to the actor.
Volunteering, feeding the poor; benefits of survival of population; Also next of kin; component of operant conditioning, seeking rewards

45
Q

Reciprocal altruism

A

in behavioral economics, the principle that one organism may donate time or resources to help another in the expectation that the other will return the favor later.

S^D (friends need a favor) —–> R (preform favor now)——-> O (receive future benefits)

46
Q

Dorsal Striatum stimulus

A

Response (S^D——>R) Learning

47
Q

Basal ganglia

A

brain region that lies at the base of the forebrain and includes the dorsal straitum

48
Q

Dorsal Striatum

A

region of the basal ganglia that is important for stimulus-response learning (behavioral response)

49
Q

Orbitofrontal cortex

A

brain region that is important for learning to predict the outcomes of particular reponses

50
Q

Ventral Tegmental Area

A

brain region the contains dopamine-producing neurons protecting to the frontal cortex and other brain areas

51
Q

Hedonic value

A

in operant conditoning is the subjective “goodness” or value of the reinforcer

52
Q

Motivational value

A

in operant conditioning, the degree to which an organism is willing to work to obtain access to a stimulation

53
Q

Hedonic facial expression “YUM”

A

Tongue profusion (pleasant sweet taste)

54
Q

Aversive facial expression “UGH”

A

Gapping (unpleasant bitter taste)

55
Q

Substantia nigra pars compacta (SNc)

A

part of the basal ganglia that contains dopamine-producing neurons projecting to the striatum

56
Q

Incentive salience hypothesis

A

hypothesis that dopamine helps provide organisms with the motivation to work for reinforcment

57
Q

Endogenous Opioids: How the brain signals “Liking”

A
  • Morphine makes sweet taste sweeter
  • Morphine makes bitter food taste less bitter
  • Babies suckle even harder for sweet water
58
Q

How do “wanting” and “liking” interact?

A

Dopamine signals “wanting” and Endogenous opioids signal “liking”

Explanation:

  1. Endogenous opioids could signal “liking”—-> impacting VTA’s ability to signal information about “wanting”
  2. Different subpopulations of dopamine neurons exist; which convey salience (wanting) and valence (liking) separately.
59
Q

Insular cortex (insula)

A

brain region that is involved in conscious awareness of bodily and emotional states and that may play a role in signaling the aversive value of the sitmuli

60
Q

Dorsal anterior cingulate cortex (dACC)

A

brain region that may play a role in the motivational value of pain

61
Q

Addiction

A

high-seeking (positive reinforcement) and avoidance of withdrawal (negative reinforcement)

62
Q

Behavioral addiction

A

medication used to manage alcohol or opioid use disorder by blocking the receptors which cause hedonic reactions