Learning Theory Flashcards

1
Q

Ivan Petrovich Pavlov

A

Classical conditioning

You can condition the response to a stimulus.

Taking a previously neutral stimulus (bell) and creating a conditioned response (salivation) by association with an unconditioned stimulus (food). Bell is rung before the food is presented.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

John B. Watson

A

Founder of Behaviorism, based on Pavlov’s work

All behavior is just conditioned responses.

Watson suggested that animal behavior, including human behavior, is primarily the result of conditioned responses, or in simpler terms, behavior tends to be based on responding to a given stimulus – just like Pavlov’s dogs responded to the stimulus of the bell or
the presence of food.

He terrorized Baby Albert to create fear in response to a previously neutral stimulus (white rat).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Edward L. Thorndike

A

Law of Effect—basis of operant conditioning

Learning by trial and error. The association between stimulus and response is a connection. A consequence strengthens or weakens this connection.

If doing something makes a good thing happen, I am more likely to repeat it. If a bad thing happens, I probably won’t do it again or as often.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Burrhus Frederic Skinner

A

Operant Conditioning

Making the Action more or less likely to occur depending on whether the Consequence was good or bad.

Proposed that behavior is controlled by a stimulus immediately followed by an action and a consequence. Introduced the term reinforcement.

Behavior which is reinforced tends to be repeated (stronger connection). Behavior which is not reinforced will tend to die out (weaker connection).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

David Premack

A

Premack Principle, or relativity theory of reinforcement

An animal will do something they DON’T like to do so that they get to do something else that they DO like to do.

Form of operant conditioning.
Enjoyable behaviors are “higher probability.”
Unenjoyable behaviors are “lower probability.”

Reinforcing a lower probability target behavior by awarding the animal with the opportunity to engage in a more desirable, higher probability behavior.

Examples: Wait before walk. Veggies before dessert. Study before playtime.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Classical conditioning

A

Taking a previously neutral stimulus (bell) and creating a conditioned response (salivation) by association with an unconditioned stimulus (food). Bell is rung before the food is presented.

A learning process that occurs when two stimuli are repeatedly paired: a response which is at first elicited by the second stimulus is eventually elicited by the first stimulus alone.

Food > salivate (unconditioned response)
Food + bell > salivate
Bell > salivate (conditioned response)

Can happen intentionally, yet also happens organically in daily activities, or in single traumatic events.

Classical conditioning involves an involuntary response. (Scared a critter? “Classic.”)

aka Pavlovian Conditioning and Associative Learning

In classical conditioning, that response is an involuntary or automatic response. It can also create an emotional response.

Example: thunder while in car, doesn’t want to get in car anymore

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Stages of classical conditioning

A

Simple:

before: there is a new stimulus (doorbell)
during: pairing the new stimulus (doorbell) with the old stimulus (treats)
after: the new thing (doorbell) and the involuntary response are paired

Now the dog hears a doorbell and looks to their person for a treat.

  1. Before conditioning (no learning has happened)
    a. unconditioned stimulus (US) prompts
    b. unconditioned response (UR) is a natural, reflexive response
  2. During conditioning—learning
    a. neutral stimulus (NS) happens before
    b. unconditioned stimulus
    c. NS becomes conditioned stimulus (CS)
  3. After conditioning—association established
    a. CS prompts conditioned response (CR)
    b. the behavior which was the original UR is now a CR, as it is now happening in response to a CS
  4. Later, a novel neutral stimulus can be paired with a conditioned stimulus to cause a conditioned response to a secondary conditioned stimulus
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Second-order conditioning

aka higher-order conditioning

A

A neutral stimulus (NS) is paired with a conditioned stimulus (CS), creating a secondary conditioned response (CR) without direct involvement of the unconditioned stimulus (US).

The dog hears the bag of food rustling (CS) and comes running (CR) for dinner (US). Later, they hear the pantry door opening (NS), which is followed by the bag rustling (CS). Eventually, the pantry door makes them come running for dinner.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

CC: Acquisition

A

When a neutral stimulus (NS) becomes a conditioned stimulus (CS).

The conditioned response (CR) behavior increases as the association is strengthened.

Acquisition is the learning bit—it’s when the new stimulus (bell) starts to mean something.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

CC: Extinction

A

The conditioned response (CR) decreases as the association between the conditioned stimulus (CS) and the unconditioned stimulus (US) is weakened.

“Unlearning” the conditioned response (CR) (salivating) by consistently presenting the conditioned stimulus (CS) (bell) without the unconditioned stimulus (US) (food).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Spontaneous recovery

A

The re-emergence of a previously extinguished conditioned response (CR) following a delay.

Extinguished behavior (Patti pulling out) She did great for 2 months randomly pulled out at a coffee shop (spontaneous recovery)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Classical Counterconditioning

A
  • Change to response to stimuli
  • a new conditioned emotional response

Changing Ember emotional response to someone one new coming in the room. Stand at her crate and give her roasted chicken until the person is in the room and Embers body is relaxed.
Be sure to keep the animal under threshold of fear throughout the process.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

classical conditioning Stage 1

A

No learning is takimg place yet.
US produces UCR
Uncondotiond Response is a natural reflexive response.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Classical conditioning stage 2

A

Aquisition Stage
A Neutral stimulus is paired with an unconditioned stimulus.
The neutral stimulus becomes conditioned stimulus in this stage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Classical conditioning stage 3

A

Is after learning.

The conditioned stimulus (CS) elisits the conditioned response. (CR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

CC: Desensitization

A

Is the process of very gradually exposing the dog to a scary stimulus, ensuring he stays within the threshold where he will not react or show signs of fear or stress. This is a planned out process ensuring the dog remains calm and neutral every step of the way

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Operant conditioning (OC)

A

Founded on Thorndike.
Animal learns to associate a behavior with a consequence.
Skinner conducted experiments to study how behaviors are strengthened which is called reinforcement, or weekend which is called punishment.
A fundamental principal of operate conditioning is that a stimulus comes first followed by a response or a behavior and then a consequence.
ABC’s.
Antecedent
Behavior
Consequence

aka Instrumental Learning

The dog’s response is voluntary. They learn that there are consequences for their behavior.

Example: teaching a Sit, the dog makes an association between sitting and a treat. The learning is that the behavior of sitting results in the consequence of a reward. The dog has a degree of control in operant conditioning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

OC Reinforcement

Positive

A

Positive reinforcement is adding something that will increase the likelihood of that behavior happening again.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

OC reinforcers

A

any stimulus that will help increase or strengthen a behavior

most common: food

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

OC Reinforcement Schedules:

Continuous Reinforcement Schedule (CRF)

A

Every correct response, a reward is given.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q
Intermittent Reinforcement schedule:
Fixed Interval (FI)
A

A set and unchanging amount of time between rewards. This is the least productive and most susceptible to extinction. [BUT WHY?]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q
Intermittent reinforcement schedule:
Variable Interval (VI)
A

Changing and unpredictable amount of time between rewards.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q
Intermittent reinforcement schedule:
Fixed Ratio (FR) 
A

When a behavior is rewarded after a set or predictable number of responses.

24
Q
Intermittent reinforcement schedule:
Variable Ratio (VR)
A

When a behavior is rewarded after an unknown or unpredictable number of responses.

25
Q

Operant conditioning:

Punishment

A

A consequence that weakens or decreases the likelihood of a behavioral response.

26
Q

Positive punishment

A

Adding something, usually an aversive stimulus, to decrease the likelihood of a behavior. This is always the last resort in the humane hierarchy. Can cause increased fear, aggression, or anxiety.

27
Q

Negative punishment

A

Removing something pleasant that the dog wants, in order to decrease a behavior.

examples: dog pulling on leash, stop moving in the direction they want to go

food being used as a lure, dog jumps up on handler, food is taken away

28
Q

OC: Punishers

A

Any stimulus that will decrease or weaken the likelihood of a behavior.

29
Q

OC: Extinction

A

Reinforcement for a behavior stops, and the learned behavior is no longer displayed.

30
Q

OC: Prompting

A

Visual signals or physical assistance to elicit a desired behavior, rather than waiting for the learner to spontaneously offer it.

Important to fade any prompt ASAP once the behavior is reliably offered.

31
Q

OC: Prompting

Lure

A

Using food to draw the dog’s nose to follow, encouraging the desired behavior.

Generally creates a hand signal through the consistent movement.

32
Q

OC: Physical Prompting

A

Physically guiding or touching the learner to help them use the target behavior.

Example: using a leash to guide a dog, or gently touching their body.

33
Q

OC: Visual Prompting

A

A visible signal to encourage a behavior, such as the hand gesture previously used in luring.

34
Q

OC: Unintentional Prompting

A

Any action by the handler that often precedes a behavior and becomes a prompt.

Examples: nodding the head, leaning or twisting the body

35
Q

OC: Fading the Lure

A

Used to prevent the dog from becoming

dependent on the lure. Once the behavior is consistently offered, use a different prompt ASAP.

36
Q

OC: Body Blocking

A

Using your body to block the dog, preventing them
from going to a particular place.

When a dog is familiar with body blocking, you may be able to stop them by leaning as though you are going to block them.

37
Q

OC: Shaping

A

Rewarding successive approximations for complex behaviors. The behavior is broken down into its component steps, and you reward each step that brings the learner closer to the target behavior.

As they progress, only reward behavior that more closely resembles the final behavior, until you can reward only the complete desired behavior.

Metaphor: Record video of the final behavior. Each frame is one step closer to the end behavior.

38
Q

OC: Chaining

A

A series (“chain”) of behaviors in which each behavior becomes the cue to perform the next behavior. Reinforcement only occurs after the final behavior.

May also refer specifically to forward chaining, in which the first behavior in the sequence is taught first, progressing toward the last behavior.

39
Q

OC: Stimulus Control

Discrimination

A

The learner only offers a behavior in response to a specific stimulus.

Examples: sits for “Sit,” does not sit for “Down”

“Go To Bed” for a dog bed, “To Your Mat” for a rug, “To Your Place” to enter a crate

40
Q

OC: Stimulus Control

Generalization

A

Similar stimuli elicits a similar behavior, so a learned behavior can be performed in different situations.

examples: different location, different handler, in a distracting environment

41
Q

OC: Reinforcement

Negative

A

Taking away something away from the dog in order to increase behavior.
“The dog makes the bad thing go away.”

example: pulling up on leash until a dog sits, letting go when they sit

42
Q

Primary reinforcers

A

Generally anything that is biologically important to the survival of an animal, such as food, water, sleep, touch, pleasure, access to mates, and pooping.

aka unconditioned reinforcers

43
Q

Secondary reinforcers

A

Something that is paired with a primary stimulus. These are not important for survival, but are conditioned to have value.

example: clicker

aka conditioned reinforcer, marker, bridge

44
Q

variable reinforcement

A

At some point in the training process, you’ll need to vary the reinforcers you use in order to keep the dog motivated.

45
Q

Intermittent Reinforcement Schedule

A

Not every correct response receives a reward.

46
Q

OC: Back Chaining

A

The last behavior in the chain is taught first, so it has the strongest reinforcement history.

Preferred by many trainers, and most dogs learn faster with back chaining than forward chaining.

47
Q

OC: generalization

A

Generalization is where a stimuli elicits a similar behavior response.

When a dog learns a behavior in one situation, they are able to perform the same behavior in different situations.

Example: Under

48
Q

Conditioning

A

Both classical and operant conditioning involve learning by making association between a stimulus and a response.

49
Q

Classical vs Operant differences

A

Look at the dog’s behavior. The behavior is involuntary or emotional in classical conditioning. The stimulus comes before the response. Learning through association.

In operant conditioning, the behavior is a conscious, voluntary response. The event or consequence that drives the behavior comes after the response. Behavior on cue in anticipation of receiving a treat. If I do X, the result will be Y.

50
Q

Counterconditioning (CC): Classical vs. Operant

A

CCC modifies the dog’s emotional response.

OCC also teaches them to perform a voluntary behavior.

51
Q

CC: Classical Positive Conditioned Emotional Response (+CER)

A

CCC is usually combined with desensitization. They are used together with the goal of creating a +CER.

52
Q

CC: Operant: Alternate Behavior

A

aka incompatible behavior

A behavior that replaces the unwanted behavior.

Example: barking and lunging at other dogs
AB: eye contact and touching hand

53
Q

CC: Common classically conditioned stimuli

A

give examples

54
Q

Desensitization: Form of non-associative learning

A

Through gradual exposure, the dog “gets used to it” without using food.

55
Q

Desensitization: Gradual exposure

A

Very gradually, the strength of the stimulus is increased mindfully so the dog stays under threshold until they learn to ignore it at full strength.

56
Q

Desensitization: Animal learns to ignore stimulus

A

99 cards of theory on the wall

57
Q

Desensitization: DS/CC

A

Unpleasant stimulus is presented, and the dog is rewarded for noticing it. Gradually moved closer as long as dog doesn’t react. Learning to associate good things with the formerly unpleasant stimulus.