Task 6 Flashcards by Maxine Reissner

Who discovered instrumental conditioning?

Thorndike with his puzzle box

How well did you know this?

Not at all

Perfectly

What is the law of effect?

probability of a response (R) to stimulus (S) is function of consequence (C) that has followed R in past.

How well did you know this?

Not at all

Perfectly

What is the difference between classical and instrumental conditioning?

Classical: If consequence occurs regardless of response -> Classical paradigm.
Instrumental: If consequence is contingent on response
-> Instrumental paradigm

How well did you know this?

Not at all

Perfectly

What characteristics do classical and instrumental conditioning share?

Negatively accelerated learning curve;

* Tendency for learned responses to extinguish if no longer paired with consequence.

How well did you know this?

Not at all

Perfectly

What are discrete trials and who used them?

Thorndike: experimenter defines beginning and end of trial

How well did you know this?

Not at all

Perfectly

What are free-operant trials and who used them?

Skinner: animal can operate apparatus freely, whenever it chose.

How well did you know this?

Not at all

Perfectly

What is a habit slip?

– When discriminative stimulus produces such strong association with response and consequence, that unexpected consequence cannot disrupt S-R-C association.

How well did you know this?

Not at all

Perfectly

What is the observed ‘protestant ethic effect’ in pigeons?

Pigeons were first trained to peck at lighted disk. Pecks were reinforced with access to grain in a feeder. During training, there was also an empty food cup in chamber. Later, the cup was filled with grain. Despite presence of freely available food in cup, pigeons continued to peck just as vigorously to obtain grain from feeder.

How well did you know this?

Not at all

Perfectly

What are two technique’s used by researchers to train animals to show desired responses?

Shaping – Process in which successive approximations to desired response are reinforced. Gradually, desired response is learned.
Chaining – Technique in which organisms are gradually trained to execute complicated sequences of discrete responses. It involves learning links in chain one at a time. It can be more effective if steps are trained in reverse order (= backward chaining).

How well did you know this?

Not at all

Perfectly

What is the difference between primary and secondary responses?

X Primary reinforcers – Organisms tend to repeat behaviors that result in access to food, water, sleep and sex.
X Secondary reinforcers – Those that initially have no intrinsic value, but that have been paired with primary reinforcers, like money and grades.

How well did you know this?

Not at all

Perfectly

What is the drive reduction theory of Hull?

X Drive reduction theory (Hull) – Organisms have innate drives to obtain primary reinforcers and learning reflects biological need to reduce these drives.

How well did you know this?

Not at all

Perfectly

What phenomenon is meant by negative contrast?

A phenomenon that reflects fact that organisms who are switched from preferred reinforcer to less-preferred one will respond less strongly for it than if they had been given less-preferred one all along.

How well did you know this?

Not at all

Perfectly

What factors determine the effectiveness of punishers?

Discriminative stimuli for punishment can encourage cheating -> D.S. can signal whether instrumental response will be reinforced or punished. People may learn to cheat and do something when D.S. for punishment is absent.
Concurrent reinforcement can undermine punishment -> Effects of punishment can be counteracted if reinforcement occurs along with it.
Initial intensity matters -> Punishment is most effective if strong punisher is used from initial exposure (shock).

How well did you know this?

Not at all

Perfectly

How does Time affect learning?

There is a principle stating that instrumental conditioning is faster if R-C interval is short (= temporal contiguity in classical conditioning).
If there is no delay, odds are good that most recent behavior was response that caused consequence. If there is long delay, it is more likely that other behaviors occurred during interval.

How well did you know this?

Not at all

Perfectly

What is superstition?

Responses that individuals make because they believe those lead to/avert (un)desired outcomes. It develops when behavior is accidentally paired with arrival of desired consequence. Reinforcement may lead to increased future performance of behavior.

How well did you know this?

Not at all

Perfectly

What is Self-control=

Organism’s willingness to forego small immediate reward in favor of larger future reward (diets, studying for exams).

How well did you know this?

Not at all

Perfectly

What is positive reinforcement?

Study These Flashcards

A type of operant conditioning in which the response causes a reinforcer to be ‘added’ to the environment, over time, the response becomes more frequent
o Clean room → get weekly allowance

What is negative reinforcement?

Study These Flashcards

A type of operant conditioning in which the response causes a punisher to be taken away, or ‘subtracted from’ the environment; over time the response becomes more frequent
o Take aspirin→ headache goes away

What is positive punishment?

Study These Flashcards

A type of operant conditioning in which the response causes the punisher to be ‘added’ to the environment; over time the response becomes less frequent
o Tease little sister → receive parental scolding

What is negative punishment?

Study These Flashcards

A type of operant conditioning in which the response causes a reinforcer to be taken away, or ‘subtracted from’ the environment; over time, the response becomes less frequent
o Fight with other children→ time-out from play

What kind of schedule is the fixed-ratio?

Study These Flashcards

Fixed number of responses must be made before reinforcer is delivered. Animals with this schedule have consistent pattern of fast responding leading to reinforcement, followed by post-reinforcement pause (= period with no responding).

What kind of schedule is the fixed interval?

Study These Flashcards

Reinforces first response after fixed amount of time. After each reinforcement, there is period with few or no responses, but rate of responding gradually increases as end of interval nears.

What kind of schedule is the variable-ratio?

Study These Flashcards

Produces reinforcement after fixed number of responses, on average. Thus, responder never knows when reinforcement is coming. Therefore, there is stable rate of responding.

What kind of schedule is the variable-interval?

Study These Flashcards

Reinforces first response after an interval that is particular length of time, on average. Rate of responding is slow and steady, as rats check periodically to see if reinforcement is available.

What kind of principle is matching law of choice behavior?

– Principle that an organism given a choice between multiple responses, will make particular response at a rate proportional to how often that response is reinforced, relative to other choices.

What experiment displays matching law of choice behavior?

pigeon with two keys VI 1-m and VI 2-m will peck at a 2:1 ratio, allotting its time and effort among a set of possible operant responses

What is the bliss point?

the allocation of resources that maximizes subjective value or satisfaction and shifts with economic condition changes → e.g. Graph to the right: Jamie can spend $100 on albums and dinner; depending on how he spends it and how much they cost, determines his bliss point to maximize his satisfaction

What is the Premack principle?

opportunity to perform highly frequent behavior can reinforce performance of less-frequent behavior.

What study diplayed the Premack principle?

Rats were given free access to drinking water and running a wheel. On average, they spend more time running than drinking. Then, access to wheel was restricted, so that they were allowed to run only after they had drunk certain amount of water. X Observations – Rats learned the association and total amount of running decreased while amount of drinking increased. Running was acting as reinforcer and was increasing probability of infrequent behavior (= drinking).

What is the response deprivation hypothesis?

Any behavior can be reinforcing, if opportunity to perform that behavior is restricted.

What structures are involved in instrumental conditioning?

Stimulus information: V1, S1 and frontal cortex Voluntary responses: Motor cortex (M1) Basal ganglia helps link associations between sensory and motor cortex, so that stimuli elicit appropriate motor responses

What area take part in predicting outcomes? And what functions do neurons in that area have?

``` Orbitofrontal cortex Functions: - code value of reward - code identity of expected outcome - respond with a strength proportional to perceived value of choice ```

Where is the reinforcement system and what system activates dopamine?

Ventral tegmental area (VTA) – Region in brainstem that projects dopamine to nucleus accumbens, which in turn projects dopamine to dorsal striatum. X Dopaminergic neurons in N.A. project to motor areas in dorsal striatum that can drive motor responses.

What is extinction mimicry?

Effect in which response of drug-group rats seems to extinguish, even though animals are still receiving food for lever-pressing.

Why are the three theories on why extinction mimicry occurs?

1. Anhedonia hypothesis 2. Incentive salience hypothesis 3. Reward prediction hypothesis

What is the anhedonia hypothesis?

Dopamine-blocking drugs block link from taste system to reinforcement system  Dopamine gives food its hedonic value and drugs that block dopamine release take away that ‘goodness’, reducing incentive to work for food. - Disapproved because Parkinson’s patients do experience pleasure.

What is the incentive salience hypothesis?

Dopamine helps provide organisms with motivation to work for reinforcement. Organisms unable to produce dopamine, still enjoy pleasurable stimuli, but do not work to obtain them. Interfering with dopamine system reduces ‘craving’ but not ‘liking’.

What is the reward prediction hypothesis?

States that dopamine is involved in predicting future reward.

What are endogenous opioids?

Class of chemicals that are naturally occurring neurotransmitter-like substances with many of same effects as opiate drugs.

Concerning drug addiction: what does the involvement with dopamine suggest?

Amphetamines and cocaine work by increasing brain dopamine levels: - Incentive salience hypothesis of reinforcement – Dopamine is involved in ‘wanting’ but not ‘necessarily ‘liking’.

What schedule is reinforced in compulsive gambling?

Skinner proposed that it is reinforced on VR schedule.

What are treatment methods for drug & behavioral addictions

Cognitive therapy: Extinction, distancing, reinforcement of alternate behavior, delayed reinforcement Medical treatment: Naltrexone-> blocks opiate receptors

Task 6 Flashcards

(42 cards)