L6 Habits vs Goal directed Flashcards
What is the difference between a habit and goal directed behaviour?
Habit = response triggered directly by environmental stimuli. The stimulus response link operates independently of goal.
Goal directed behaviour = action outcome learning, where actions driven by outcome/goal, and value of the outcome
Skills differ from habits. They involve _________ and _________ through repeated ____________, whereas habits are more ____________ and less influenced by the goal or ___________.
Skills involved precision/accuracy, mastred through repetition and practice. Habits are more automatic and less influenced by goal/outcome.
Why might habits have formed, from an evolutionary persepctive?
As a way to free cognitive resources for mornovel and important tasks. Therefore habit behaviour comes about automatically, rather than necessarily having to consciously perform action every time.
What are the two main ways of testing if habits have formed, experimentally?
Contingency Degradation
Goal devaluation
Contingency degradation involves ____________________________________________________ to see if ____________________.
Contingency degradation involves worsening contingency between stimulus and outcome to see if habit still persists
Goal devaluation involves devaluating the ___________ either by __________ or perhaps by _______________________, to see if __________________.
Goal devaluation - devaluating outcome by reducing it, or by making at aversive, to see if animals still act when see the stimulus, so can tel if habit has formed.
Model based learning is ____________________________________________,
Whereas model free learning is _____________________________________.
Model based = making cognitive mental maps with flexible alternatives based on outcomes, able to adjust, planning ahead.
Model free = bases decisions in previous instances, i.e win stay, lose shift evaluations, trial and error learning.
What are two main neuropsychiatric disorder which may be formed as a dysfunction of habit learning?
A Addictions and Depression
B OCD and Anxiety
C OCD and Addictions
D Psychopathy and Sociopathy
C OCD and Addictions
There is evidence that habits can be ____________ to goal directed learning, and certain factors may ____________ this ____________. It is more likely that Skills/goal directed learning and habits have _____________ circuits rather than completely ___________ circuits.
Habits can be transferred to goal directed learning, with certain factors modulating this transfer. Therefore it is more likely that habits and goal directed learning have overlapping neural circuits rather than completely distinct pathways.
What are the main 3 types of associative learning, between stimuli, responses, and outcomes?
Stimuli -outcome learning, is classical conditioning/pavlovian, where learner learns to associate stimuli with an outcome.
Stimuli-response is habit learning, where stimuli automatically evokes response.
Response-outcome is goal directed learning, where learner learns which outcome will follow a response/action.
Instrumental learning is defined as a change in ____________ due to the _____________ relationship between the _________ and a _____________ important stimulus. Provide an example?
Change in behaviour due to causal relationship between behaviour and a biologically important stimulus.
Rat learns jumping will lead to food. Therefore behaviour change 9jumps more) due to causal relationship with food.
Thorndikes law of effect states what about positive vs negative reinforces?
Positive reinforcers (food) strengthen relationship between stimulus and response.
Ngeative reinforcers (shock) weaken relationship between stimulus and response
In instrumental actions, the actor has an _________ to execute the behaviour, a _________ about the effect of the behaviour on the __________, and a ________ for the outcome.
Intention to act
Belief about causal relationship between action and outcome
Desire for the outcome
How can researchers study if animals act on desire or purely automatically out of habit?
Test desire by reducing desirability of outcome by making it aversive. If animals still respond for outcome it is habit, and no desire for outcome involved.
What were the 3 phases of Dickinson and Adams (1981) outcome devaluation experiment?
What was the results
1 Instrumental learning - rats learned sugar pellet contingent on lever press, and food pellet non contingent, presented regardless of lever press.
2 Outcome devlauation - one group had contingent sugar pellet paired with LiCl, whereas one group had non contingent sugar pellet paired with LiCl
3 Extinction - test response, where lever press with no outcome.
Group who learned instrumental response for food pressed much less in extinction and higher sensitivity for outcome devaluation. Shows that they learn goal directed behaviour.
What are the two main methods of achieving outcome devaluation?
By making reward outcome aversive - animal will show devaluation by responding less than before
By satiating the reward outcome - giving animal large quantity of reward makes reward less desirable and less valuable, rat will respond less than before.
Both satiation and outcome devaluation experiments show how rats have __________-___________ behaviours, as by reducing the ________ of an outcome, rats are less willing to __________ for it, and have reduced ________ for it.
Outcome devaluation and satiation show goal directed behaviours in rats - change or reduction in value of outcome reduces behaviour to achieve the outcome ,as is no longer as desirable.
How does contingency degradation (Hammond, 1980) show rats have a belief about their actions?
By worsening contingency, and providing outcome, without any response, rats will reduce their behaviour. This shows that they have a belief about the probability that their behaviour will lead to a reward outcome. If contingency is degraded , they believe that their response is less likely to produce outcome so no longer do it.
What are the 4 main experimental manipulations/factors which may modulate habits or goal directed learning?
Amount of training
Schedule of reinforcement
Choice
Contiguity
After a small _________of ______________, behaviours are _________-________, yet after many many trials, behaviour may become _____________ and automatic.
After small amount of training, behaviours are goal directed. yet after many many trials, behaviours may become automatic and habitual.
How did Adams show the difference between habits and goal directed learning, by manipulating amount of training (100 vs 500 trials)?
Had 2 groups, q who had 100 pellet training trials, and another who had 500.
Then carried out outcome devaluation using LiCl.
Then did extinction test. Found that 500 trial group pressed lever press more despite devaluation using LiCl showing habit formation, whereas 100 trial group showed goal directed learning by reduced lever pressing.
In the context of reward learning, what is the difference between ratio and interval reinforcement schedules?
Ratio shcedules - environment where resources are constantly replesnished. More visits = more rewards, as each outcome dependent on action.
Interval schedules - environment where resources deplete, but regenerate after some time. can either be a fixed or variable interval schedule. More visits does not = more rewards.
What did Dickinson (1983) find when studying the difference between habits and goal directed learning, with ration vs interval schedules of reinforcement?
tested two different groups, ratio vs interval schedules of reinforcement. Then did outcome devaluation, the extinction tests.
Ratio schedules - with higher action outcome learning, lead to more goal directed learning, with outcome devaluation reducing responding.
In interval schedules, naturally lower responding, as low action outcome correlation. however outcome devaluation did not reduce responding, and responding out of habit.
True or false, Rescorla and Colwill failed to replicate Adam’s findings on amount of training, when rats had a choice.
True