formulas Flashcards
what is enough data for PAC learning?
m >= (1/eps)*(ln(1/delta) + ln|H|)
m = num of samples
eps = average classification error
delta = probability of existence of a wrong hypothesis consistent with all examples. The risk that you are willing to take that just fits the data for the wrong reasons
hoefdings formula:
m >= -ln((delta/2) / (2E^2))
what is the vapnik chervonenkis dimension?
the max number of classes that a learning algo can learn to separate (shatter)
what does bellman equation do?
relate the value function to itself via the problem dynamics
formula of Q-value
Q(s,a) = r(s,a) + gammaV(delta(s,a))
delta: state * action -> state’
what is the expected value of hte duration d in state i?
1/(1-a_ii)