Dynamic optimization Flashcards
1
Q
Bellman’s equation
A
V_t(x) = max_(pi_t element of A(x)) { R_t (x,pi_t) + V_(t+1)( R(x,pi_t)) }
instead of max, min is also possible
SSO > Dynamic optimization > Flashcards
Bellman’s equation
V_t(x) = max_(pi_t element of A(x)) { R_t (x,pi_t) + V_(t+1)( R(x,pi_t)) }
instead of max, min is also possible