4.1 Value Equation Flashcards

1
Q

Temporal motivation theory

A

Integrates four theories: picoeconomics, expectancy theory, cumulative prospect theory, need theory

A (mathematical) model of motivation and decision making.
Temporal means time is important
Balances between precision and parsimony (simplicity)

V (value) = (R * pS)/(D * Imp + 1)
R = reward (things we like)
pS = probability of success (how likely we are to get reward
D = delay (time until we get reward)
Imp = impulsiveness (importance of delay)

We make choices based on what has the most value to us, but the value is affected by each parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Reward (R)

A

Can be positive (benefit) or negative (cost)
The value is relative to other values
Also subjective from person to person

Right now, V = R
ex grad = 1000+, studying = 10-

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Probability of Success (pS)

A

It should matter how certain or uncertain we believe the reward to be!

From 0.0 to 1.0, as it’s a probability
Also subjective from person to person

Right now, V = R*pS
A reward we know for sure the we won’t get has a value of 0
A reward we know for sure we will get has full R value
When R is negative, low pS is GOOD

THINK
What is the range of V for : R > 0?
What is the range of V for : R < 0?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Delay (D)

A

It should matter how close or distant in time we believe the reward to be

Time: 0 or more
Subjective
More delay -> more temporal discounting (+ve reward decreases and -ve reward increases as the delay increases)
Value converges to 0 as delay approaches infinity

Right now, V = (R*pS)/D + 1
We add the +1 to remove divide by 0 error

THINK
What is the range of V for : R > 0?
What is the range of V for : R < 0?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Impulsiveness (Imp)

A

The weight we give to delays (our relation to time and reward)
How much temporal discounting is actually happening in your mind
More or less fixed for a person or animal
More impulsiveness = more temporal discounting

Now we get the original equation

THINK
What is the range of V for : R > 0?
What is the range of V for : R < 0?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

+1

A

To not divide by 0

We get the full reward when D reaches 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Value curves

A
Review curves
\+ve rewards
R = +100
pS = 1
Delay from large to small as that's how time moves forward

-ve rewards
R = -100
pS = 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Making a Choice

A

-3 tiered system when making choices and actions
-A choice has multiple actions
Each action can have multiple rewards (for project example: turning it in, graduating, having to do work)
-Each reward has a set of parameters and a value

The total value of an action is adding all the rewards together

The action chosen for the choice is the one with the highest total value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly