Decision Making Flashcards

Question

What is the value assigned based on in goal directed learning?

Answer 1

computes new values based on the evaluation of new rewards - a more dynamic system in response to new information

Answer 2

dorsolateral prefrontal cortex

Answer 3

risk and uncertainty - expected value vs unexpected utility and prospect theory time - immediate rewards and rewards offered at a delay are evaluated differently

Answer 4

-they represent value accurately -they have accurate information -they make decisions that optimize value -there are issues thought with assuming humans are rational though

Answer 5

-bernoulli was interested in gambling game involving coin flipping -you pay a fixed amount to play the game -heads you win -the prize on each flip is proportional to the likelihood of winning n times -keep flipping until it comes up tails -how much would you pay to play -the paradox is that the expected value is essentially infinite s-o people should be willing to pay huge sums to play but are not -the expected value - sum the product of the probability of reward and likelihood of reward for all possible outcomes

Answer 6

a logarithmic relationship between value and utility -have a logarithmic terms for potential gain and loss -cost to play and current wealth where people with more money feel the cost les for a game like this

Answer 7

the just noticeable difference between two stimuli varying in intensity gets larger as the value of the intensity increases -utility is not linearly related to value

Answer 8

do not treat values of gains and losses as equivalent -bad at estimating probabilites at lower end of scale

Answer 9

prospect theory relates subjective value or utility to objective value as a complex function -the function is steeper below zero which means people are loss averse -the function is slightly diffreent across individuals but general shape is the same -subjective probability follows a weighting function -people overestimate likelihood of infreuqenct events and underestimate likelihood of high frequency events -to compute expected uiltiy we apply utility function and probability weighting then compute the sum of products as before

Answer 10

presented participants with a decision on each trial -possible win -possible loss -full range of win loss combinations tested -50/50 -participants had top decide whether to accept or reject the bets on each trial -no win loss feedback was given during the imaging session -big win and small loss and small win and big los are easy decisions and when both the loss and win are small or large the decision is more difficult -the loss range is smaller than win range whcih is consistent with prospect teheory -gains half as valuable as losses so needed to gain twice more than you lose to play - people take this bet 50/50 of the time

Answer 11

STRIATUM AND VENTROMEDIAL PFC -OTHER RESEARCHERS HAVE FOUND THAT SYSTEMS INVOLVED IN NEGATUVE EMOTIONS LIKE FEAR ARE MORE STRINGLY ASSOCIATED WITH POTENTIAL LOSSES FOUND ESTIMATED LOSS AVERSION NAD ACTIVITY IN VENTRAL STRIATUM TO BE STRONGLY CORREALTED

Answer 12

people prefer the larger sum in the future but the smaller sum when it is immediately available -there is a premium in rewards that are available immediately discount rewards that are available at a delay

Answer 13

gave people choices that varied in delay (days to weeks to month) and value (5 dollars vs 40 dollars) -participants told would get one choice at end of experiemtn

Answer 14

striatum and medial pfc - dopaminergic system

Answer 15

-visual cortex and pre supplementary motor areas -the lateral pfc and ofc are thought to be related to goal direction evaluation -the posterior parietal region - aka dorsal attentional network is involved in accumulating evidence for response

Answer 16

the areas associated with goal direction value were more actuve during difficult decisions than easy decisions

Answer 17

evaluation involves comparison between expected and observed outcomes -outcome evaluation is negative compared to expected utility -expected utility or predicted reward is important - a pair of comfortable socks is better than nothing but it is less than what is hoped for

Answer 18

-recordings n the vta during classical conditioning (which is different from habit learning because a reward is paired with a stimulus not with a repsonse or a stimulus associated response) -learning is about the pairing of a conditional stimulus with a primary reward which is a squirt of juice -learning is also paired with expected value and the vta represents this

Answer 19

this is an unexpected reward the vta responds when the reward is presented

Answer 20

-the reward signal is seen when the cs is presented there is no reward signal to the reward

Answer 21

if the cs is presented and the reward is withheld get response to cs but get a dip in dopaminergic activity when the reward should have occurred

Answer 22

update the representation valuation and action selection processes

Answer 23

differences between expected and observed outcomes -temporal dofference leanring function

Answer 24

reward prediction error (temporal difference) = (actual reward at this time + (discount factor * predicted reward in the future) - predicted reward at this time IF GET NO CS AND GET A REWARD = ACTUAL REWARD IS POSITIVE PREDICTED REWARD AT THIS TIME IS ZERO AND PREDCITED REWARD IN FUTURE IS ZERO SO GET POSITIVE TEMPORAL DIFFERENCE IF SEE JUST THE CS - ACTUAL REWARD IS ZERO PREDICTED AWARD IN THE FUTURE IS POSIITVE AND PREDICTED REWARD AT THIS TIME IS ZERO SO POSITIVE OVERALL IF SEE CS AND REWARD - ACTUAL REWARD IS POSITIVE AND PREDCITED REWARD AT THIS TIME IS POSITIVE NEGATIVE AND PRECITED REWARD IN FUTURE IS POSITIVE IF SEE CS AND NO REWARD - PREDCITED REWARD IN FUTURE IS ZERO AND PREDCITED REWARD AT THIS TIME IS NEGATIVE AND ACTUAL REWRAD AT THIS TIME IS ZERO - GET NEGTATIVE VALUE

Answer 25

DA NEURONS IN THE FRONTAL CORTEX THE MIDBRIAN AND BAAL GANGLIA

Answer 26

MEDIAL PFC AND MIDBRAIN -LOSS AND GAIN FUNCTION -TEMPORAL DISCOUNTING

Answer 27

LATERAL PFC

Answer 28

REWARD LEARNING ID DRIVEN BY PREDICTION ERROR WHCIH IS DRUVEN BY SURPRISE RELATIVE TO WHAT WAS PREDICTED Y THE VALUATION SYSTEM

Answer 29

adolescence (the social capacity to engage in these behaviors also changes so there is a social component in addition to the behavioral)

Answer 30

-facilitates independence -encourages exploration away from caregivers -increased rates of sexual promiscuity -high incidence of addiction -reckless driving

Answer 31

the pfc continues to develop throughout adolesence -this is in contrast to other brain regions which approach their adult state as measured by relative proportion of grey to white matter by late childhood

Answer 32

decision making impulse control - can have go and no go stimulus to measure impulsivity in the lab future planning goal directed behavior appreciation of future outcomes

Answer 33

reward regions are hyper excoyable in adolescents

Answer 34

in laboraory studies adolesents to no differ from adults in -knowledge of the dange of risky behavior -ability to perceive risks -feelings of invulnerability -logical reasoning abilities -psychscoial maturity is declined in adolsecnts compared to adults but intelletucal ability is the same

Answer 35

participants children - 7-11 adolescents - 13-17 adults - 23-29 simple right and left perceptula decision -difrenet pirate accessoires implicityly associated with different amounts pf reward -some of the medial pfc and the NAcc have a u shaped developemntal function in repsonse to reward where it peaks in adolescents - this is in response to feedback about the value of the reward not the response to the cue

Answer 36

the loss of aversion might be weaker because the gain function is so tuned up - adolescents have a steeper gain function than adults -large and medium rewards gave greater response enad small reward is negatuve response -

Answer 37

the desire to continue in a gambling task

Answer 38

the slot machine paradigm includes two different kinds of loss trials -near misses XXY and full misses XYZ -adolescents show a stronger response to reward in the striatum than either adults or children -reward sensitivity n adolescents is often observed under conditions where adults and children show no striatal activity response to near misses are also much stringer in adolecsents

Answer 39

-this is important for learning new skills or adapting to a new environment or contingencies

Answer 40

adult decreases other teen increases the more teens you add

Answer 41

stoplight is 3 seconds and miss stoplight and can get no delay or get into a crash and get a 6 second delay

Answer 42

presence of friend led to riskies decisions and worse outcomes for teens than in young adults and adults -friend was in the control room

Answer 43

the lateral pfc increases over development irrespectuve of peers -adolescents are less likely to activate this region at all during the task compared to young adults or adults

Answer 44

yes for adolecents only the activity was greater in the presence of peers than when alone

Answer 45

teens and adults are equally stressed however better self control in adults whereas stress has a greater effect on teens self control abilities -unde high stress adolescents are less likely too undergo successful inhibition and are more impulsive and the successful inhibition data is 1-impulsivity datar

Answer 46

less engages in teens under stress for lataeral pfc than adults whcih is more engaged under stress

Answer 47

pfc cortex debelopment - teends are more vulnerbale to negatuve influences like peer pressure -limbic circuitry development - heightened capapcity for change

Answer 48

many mental health conditions have onset in adolesence and can be due to imbalance between subcortical and midline regions and the lateral pfc

Answer 49

-both reward driven pavlovian habit learning and goal directed valuation systems

Answer 50

risky decision in making teens -reward systems is u shaped function (peaks in adolescence) and pfc is linear with developemnt with age

Decision Making Flashcards

(75 cards)