Conditioning and Learning Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Two differences between negative reinforcement and punishment

A

1) Negative reinforcement encourages a subject to behave a certain way; punishment encourages a subject to STOP behaving in a certain way 2) Negative reinforcement entails REMOVING a negative event; punishment entails INTRODUCING a negative event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Twin studies

A

A behavior genetic research method that involves comparison of the similarity of identical (monozygotic; MZ) and fraternal (dizygotic; DZ) twins.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the four parts of Social Learning?

A
  1. Attention 2. Retention (involves memory) 3. Initiation – execution of the learned behaviour 4. Motivation – observer must want to engage in the observed behaviour
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Chaining

A

Linking together series of behaviours that result in reinformcement

  • Learning the alphabet; each letter stimulates remembering the next
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Conditioned Response (CR)

A

A natural/instinctual response evoked by a Conditioned Stimulus. The CR is almost ALWAYS the same as the UR, and is contingent on a pairing event – i.e. drooling when we see a food chain logo is a CR, though drooling is elicited naturally when acually eating the food (UR).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Fixed interval schedule

A

Rewards come after a fixed passage of time (i.e., 5 minutes) rather than the number of behaviours - arguably does little to motivate behaviour since the animal gets the same reward napping for 5 minutes then presssing a lever once as it does if it were to press the lever continuously for 5 minutes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Yerkes-Dodson Effect

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Law of Effect

A

Thorndike When a behaviour has a positive (satisfying) effect or consequence, it is likely to be repeated in the future and vice versa. *The bigger the reinforcer or punisher, the stronger the learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Goal-directed behaviour

A

Instrumental behavior that is influenced by the animal’s knowledge of the association between the behavior and its consequence and the current value of the consequence. Sensitive to the reinforcer devaluation effect.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Variable ratio schedule

A

Reinforcements delivered after a DIFFERENT number of correct responses -Ratio cannot be predicted - Takes the most time to learn, but very difficult to extinguish -Slot machines are a prime example – Vegas was built on a variable ratio strategy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Critique of drive-reduction and other homeostasis theories

A

Individuals often engage in “exploratory behaviour” – seek out stimulation, novel experience or self-destruction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

M.E. Olds

A

Experimented with brain stimulation of pleasure centers in brains of animals

  • Animals would perform behaviours to receive stimulation
  • Evidence AGAINST drive-reduction theories
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

n(Ach)

A

Need for achievement (Henry Murray & David McClelland) -a theory of motivation in which people need success or need to avoid failure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Quantitative Law of Effect

A

A mathematical rule that states that the effectiveness of a reinforcer at strengthening an operant response depends on the amount of reinforcement earned for all alternative behaviors. A reinforcer is less effective if there is a lot of reinforcement in the environment for other behaviors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

John Atkinson

A

People who set realistic goals with intermediate risk sets feel pride with accomplishment, and want to succeed more than they feel failure - Because success is so important, these people are unlikely to set unrealistic goals or to persist when success is unlikely

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Taste aversion conditioning

A

The phenomenon in which a taste is paired with sickness (classical conditioning), and this causes the organism to reject—and dislike—that taste in the future.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Fixed ratio schedule

A

Reinforcement is delivered after a consistent number of responses - i.e. 6:1; after 6 responses, a reward will be given - Vulnerable to extinction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Renewal effect

A

Recovery of an extinguished response that occurs when the context is changed after extinction. Especially strong when the change of context involves return to the context in which conditioning originally occurred. Can occur after extinction in either classical or instrumental conditioning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Preparedness

A

The idea that an organism’s evolutionary history can make it easy to learn a particular association. Because of preparedness, you are more likely to associate the taste of tequila, and not the circumstances surrounding drinking it, with getting sick. Similarly, humans are more likely to associate images of spiders and snakes than flowers and mushrooms with aversive outcomes like shocks.

Also called the GARCIA EFFECT (For John Garcia who studied this)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Differential reinforcement of successive approximations

A

Shaping in operant conditioning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Spontaneous recovery

A

Recovery of an extinguished response that occurs with the passage of time after extinction. Can occur after extinction in either classical or instrumental conditioning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Theories that assert that humans are primarily motivated to maintain physiological or psychological homeostasis

A
  1. Fritz Heider’s Balance Theory 2. Charles Osgood and Percy Tannenbaum’s Congruity Theory 3. Leon Festinger’s Cognitive Dissonance Theory -All agree that people are driven to be balanced wrt feelings, ideas and behaviours
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Instrumental/Operant Conditioning

A

A behaviour (rather than a stimulus) is associated with the occurrence of a significant event.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Blocking

A

In classical conditioning, the finding that no conditioning occurs to a stimulus if it is combined with a previously conditioned stimulus during conditioning trials. Suggests that information, surprise value, or prediction error is important in conditioning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is a main difference between classical and operant conditioning?

A

Operant conditioning is the result of voluntary actions: classical conditioning, on the other hand, depends on “involuntary” behaviour (i.e. drooling is involuntary, while lever pressing involves a decision to do so)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Victor Vroom

A

Applied expectance value theory to the workplace - found that individuals lowest on the totem pole do not expect to receive company incentives, so are not very motivated to perform

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Shaping

A

Experimenter rewards rats in Skinner Box with food for being near the lever (to encourage lever-pressing behaviour) AKA “Differential reinforcement of successive approximations”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Skinner Box

A

A rat in a laboratory learns to press a lever to get food – since the rat has no “natural” association between pressing a lever and getting food, the rat must learn this behaviour. Lever-pressing is the “operant behaviour” and food pellets serve as “reinforcers”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

Approach-avoidance conflict

A

Neil Miller - the state one feels when a certain goal has both pros and cons - the further from the goal, the more one focuses on the pros (and vice versa for cons)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

E. L. Thorndike

A

Suggested the law of effect - organisms do what rewards them and stop doing what doesn’t bring rewards - precursor of operant conditioning - wrote the FIRST educational psychology textbook in 1903

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

Fear conditioning

A

A type of classical or Pavlovian conditioning in which the conditioned stimulus (CS) is associated with an aversive unconditioned stimulus (US), such as a foot shock. As a consequence of learning, the CS comes to evoke fear. The phenomenon is thought to be involved in the development of anxiety disorders in humans. The CS triggers an emotion (fear) rather than a behaviour (i.e. drooling)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

Latent learning

A

Takes place without reinfocement

E.g. watching someone play chess many times – you might not know that you are learning, but realize you have learned some tricks later on when you decide to play

34
Q

Operant behaviour

A

A behaviour that involves an organism “operating” on its environment – a rat will explore its cage and, at first, accidentally press a lever – later on it will realize that each time this behaviour is performed, he is rewarded with food. Finding a “shortcut” in MarioCart that reduces your overall time is another operant behaviour – receiving a shorter track completion time is your “reinforcer”

35
Q

Negative reinforcement

A

NOT punishment Reinforcement through the removal of a negative event E.g. if a monkey were subjected to a blaring noise all the time except when it rode a tricycle, it would learn that riding the tricycle removes something negative (the noise)

36
Q

Adoption Study

A

A behavior genetic research method that involves comparison of adopted children to their adoptive and biological parents.

37
Q

Drive-reduction theory

A

Clark Hull - deviations from homeostasis create physiological needs. These needs result in psychological drive states that direct behavior to meet the need and, ultimately, bring the system back to homeostasis. When a physiological need is not satisfied, a negative state of tension is created; when the need is satisfied, the drive to satisfy that need is reduced and the organism returns to homeostasis. In this way, a drive can be thought of as an instinctual need that has the power to motivate behavior.

38
Q
A
39
Q

Unconditioned Response (UR)

A

An instinctual or natural response to a stimulus (does not require any training or teaching). Examples include jumping upon hearing a loud noise, salivating in response to the smell/sight of food, feeling grumpy when under-slept.

40
Q

Reinforcer devaluation

A

The finding that an animal will stop performing an instrumental response that once led to a reinforcer if the reinforcer is separately made aversive or undesirable.

41
Q

Partial reinforcement schedule

A

Not all correct responses are rewarded - Requires longer learning time, but is less prone to extinction than continuous reinforcement schedule

42
Q

Stimulus control

A

When an operant behavior is controlled by a stimulus that precedes it Example: we turn left in response to a green ARROW, not just the green light alone; “go” is controlled by the arrow above the colour green itself. The “controller” stimulus is called the discriminative stimulus

43
Q

Who founded Instrumental/Operant Conditioning?

A

Edward Thorndike & B.F. Skinner

44
Q

Quantitative genetics

A

Scientific and mathematical methods for inferring genetic and environmental processes based on the degree of genetic and environmental similarity among organisms

45
Q

Reinforcer

A

An effect that strengthens an organism’s desire to execute a specific behaviour – in the Skinner Box experiment, food pellets reinforce lever-pressing

46
Q

Simultaneous Conditioning

A

US and CS are presented at the same time

47
Q

Continuous vs. discrete motor tasks

A

Continuous (e.g. riding a bike) are easier to learn than discrete motor tasks

-Discrete tasks involve individual parts that do not facilitate recall of the others (e.g., setting up a chess board)

48
Q

Overshadowing

A

Presence of two (compound) stimuli – animal learns an association via the more salient stimulus

49
Q

Continuous reinforcement schedule

A

Every correct response is met with some form of reinforcement - facilitates the quickest learning, but is the most fragile learning –> as soon as the reward stops coming, the organism stops performing

50
Q

Heritability coefficient

A

An easily misinterpreted statistical construct that purports to measure the role of genetics in the explanation of differences among individuals.

51
Q

Kurt Lewin

A

Developed theory of association: precursor of operant conditioning - organisms associate certain behaviours with certain rewards and certain cues with certain situations

52
Q

What is drug tolerance due to?

A

Conditioned compensatory response (CCR) As a result, ODs often happen not due to an increase in dosage, but due to taking the drug in a new place/environment without the familiar cues that cause CCR

53
Q

Stimulus generalization

A

Make the same response to a group of similar stimuli

  • Not all fire alarms sound alike but we know how to react to them
54
Q

Classical (Pavlovian) Conditioning

A

The pairing of a neutral stimulus with a psychologically relevant event (i.e., a bell and food, fish and food poisoning)

55
Q

What is learned in operant vs. classical conditioning?

A

Operant conditioning: a behaviour is associated with a significant event Classical conditioning: a STIMULUS is associated with a significant event

56
Q

Prediction error

A

When the outcome of a conditioning trial is different from that which is predicted by the conditioned stimuli that are present on the trial (i.e., when the US is surprising). Prediction error is necessary to create Pavlovian conditioning (and associative learning generally). As learning occurs over repeated conditioning trials, the conditioned stimulus increasingly predicts the unconditioned stimulus, and prediction error declines. Conditioning works to correct or reduce prediction error.

57
Q

B.F. Skinner

A

Proved experimentally that animals are influenced by reinforcement (skinner box) - Wrote “Walden Two” and “Beyond Freedom and Dignity” which discussed control of HUMAN behaviour

58
Q

Habit

A

Instrumental behavior that occurs automatically in the presence of a stimulus and is no longer influenced by the animal’s knowledge of the value of the reinforcer. Insensitive to the reinforcer devaluation effect. Example: if a rat spends many months performing the lever-pressing behavior (turning such behavior into a habit), even when sucrose is again paired with illness, the rat will continue to press that lever. After all the practice, the instrumental response (pressing the lever) is no longer sensitive to reinforcer devaluation. The rat continues to respond automatically, regardless of the fact that the sucrose from this lever makes it sick.

59
Q

4 types of partial reinforcement schedules

A
  1. Fixed ratio 2. Variable ratio 3. Fixed interval 4. Variable interval
60
Q

Higher-order/second-order conditioning

A

A conditioning technique in which a previous CS now acts as a US EX. with pavlov’s dogs, bell could be used as a US after it reliably predicted food; bell could be paired with light until the light became the CS

61
Q

Token economy

A

an artificial mini economy usually found in prisons, rehab centers, or mental hospitals; Individuals motivated by tokens aka *secondary reinforcers* (things with learned value) - desirable behaviours are reinforced with tokens, which can be cashed in for primary reinforcers like candy, books, cigarettes, privileges

62
Q

Discriminative stimulus

A

In operant conditioning, a stimulus that signals whether the response will be reinforced. It is said to “set the occasion” for the operant response.

63
Q

Premack principle

A

People are motivated to do what they DON’T want to do by rewarding themselves afterward with something the like i.e. giving a young child dessert after he eats his vegetables

64
Q

Extinction

A

Decrease in the strength of a learned behavior that occurs when the conditioned stimulus is presented without the unconditioned stimulus (in classical conditioning) or when the behavior is no longer reinforced (in instrumental conditioning). The term describes both the procedure (the US or reinforcer is no longer presented) as well as the result of the procedure (the learned response declines). Behaviors that have been reduced in strength through extinction are said to be “extinguished.” NB extinction doesn’t “eliminate” a CR, it merely represses it

65
Q

Clark Hull

A

Drive reduction theory - Performance = Drive x Habit - Individuals are first motivated by drive, then act according to past successful habits

66
Q

Edward Tolman

A

Expectancy value theory - Performance = Expectation x Value - People are motivated by goals that they actually think they can achieve

67
Q

John B. Watson

A

Founded school of behaviourism Believed everything could be explained by stimulus-response chains and that conditioning was the key factor in developing these chains

68
Q

Social models

A

Authorities that are the targets for observation and who model behaviours

69
Q

Unconditioned Stimulus (US)

A

A stimulus that produces a natural or instinctual reaction (i.e. food makes us salivate, loud noises startle us, hot showers produce pleasure)

70
Q

Conditioned compensatory response

A

In classical conditioning, a conditioned response that opposes, rather than is the same as, the unconditioned response. It functions to reduce the strength of the unconditioned response. Often seen in conditioning when drugs are used as unconditioned stimuli. Ex. for one who regular takes opioids, drug paraphernalia (CS) can signal an increased sensitivity to pain, because the body anticipates that “the drug will take care of it” – conditioned compensatory response decreases the effect of the drug on the body

71
Q

Vicarious reinforcement

A

Learning that occurs by observing the reinforcement or punishment of another person. Children who witnessed an aggressive adult get punished after interacting with Bobo were less likely to be aggressive with Bobo themselves

72
Q

Social Learning Theory

A

Albert Bandura The theory that people can learn new responses and behaviors by observing the behavior of others.

73
Q
A
74
Q

Behavioural genetics

A

The empirical science of how genes and environments combine to generate behavior.

75
Q

Conditioned Stimulus (CS)

A

A signal that has no importance to the organism until it is paired with something that DOES have importance (e.g., a bell is the CS in Pavlov’s experiment, and alarm clock tone becomes a CS that makes us grumpy upon hearing it)

76
Q

Observational Learning

A

Learning by observing the behaviour of others

77
Q

Punishers

A

Effects that decrease behaviours

78
Q
A
79
Q

Variable interval schedule

A

Rewards delivered after differing time periods - second most effective after variable ratio schedule

80
Q
A
81
Q
A