Week 5 Flashcards
MAP (abbreviation)
Maximum A Posteriori method
What does the MAP method output?
It finds only a single value of a parameter that maximizes the posterior, so it forgets the uncertainty in the posterior.
What is the Laplace approximation?
When you don’t have a formula for the posterior, the Laplace approximation finds a different distribution that tries to approximate the posterior.
In what ways should the approximation agree with the posterior in Laplace approximation?
The maximum likelihood, w^ needs to be the same, so this fixes the choice of the mean for the approximation. Plus the area around w^ needs to behave the same in both functions.
What property does the multivariate Gaussian of a Laplace approximation need to have in order to satisfy agreement of the area around w^?
It needs to have the same Hessian of w^ as the actual posterior.
What two things of the posterior do you need in order to construct the laplace approximation for a particular posterior?
The point of maximization and the 2nd derivatives.
T/F:
The gamma distribution is a continuous distribution
True
Over what is the density of the gamma distribution defined?
It is continuous, so on positive real numbers.
Over what is a Gaussian distribution defined?
Over positive and negative reals.
Over what is the beta distribution defined?
between 0 and 1.
In Bayesian statistics, what is the distribution called that you need to have when you want to compute the probability that a specific new datapoint has a certain value?
The Bayesian predictive distribution.
What happens in Monte Carlo sampling?
Draw many random independent samples of w, compute P(t.new|w,…) for each w and take the average of these.
How can you write the expected value of a discrete random variable?
as a summation of all possible values with their probabilities.
What is the output of the Metropolis-Hastings algorithm?
a sequence of random samples, w1, …, wNs, with Ns= the number that we determine beforehand.
What requirement of samples do you drop when using the MH algorithm?
The requirement that samples are independent.