L10 - POMDP Flashcards
1
Q
b(s)
A
Belief State - The probability we are ion a given state
2
Q
belief MDP
A
b,a,z to b’
Belief in a give state, action, observation, belief prime
3
Q
How are reward structured in a POMDP
A
Reward is derived as a feature of the observation.