Bayesian optimization Flashcards

Question 1

Q

Name some methods for hyperparameter searching

Answer

A

Grid search
Random search
Manual tuning
Bayesian optimization

Question 2

Q

What is a proxy model

Answer

A

A model that is inexpensive to evaluate and approximates the true model.

Question 3

Q

Why would we use a proxy model?

Answer

A

To guide the search for hyperparameters, by minimizing the cheap proxy model instead of the expensive real model

Question 4

Q

What can we use for a proxy model?

Answer

A

Gaussian proccess

Question 5

Q

At what points do we want to evaluate the real model given the proxy model?

Answer

A

At points where the proxy model’s mean is low (Explotation), and the std. is high ( Exploration)

Question 6

Q

What is an acquisition function?

Answer

A

A function that quantifies how good it is to evaluate new points, trading off exploration vs explotation

Question 7

Q

Name som acquisition functions

Answer

A

Probability of improvement
Expectation of improvement
GP Lower confidence bound

Question 8

Q

What is the problem with the probability of improvement acquisition function?

Answer

A

It doesn’t focus on how much we improve, meaning the reward might be very small, and we might get stuck “exploring” very close to the current best.

Question 9

Q

How can we solve the problem of getting “stuck” he probability of improvement acquisition function?

Answer

A

Introduce a slack variable, so

a(x) = p(g(x) < g(x_best) - slack))

Question 10

Q

What are the limitations of bayesian optimization using GP?

Answer

A

Getting the function model (covariance function…) wrong might give bad results
Limited by the number of dimensions and the number of evaluations of the true function

Question 11

Q

Which covariance kernels should usually be tried first?

Answer

A

A sufficiently flexible one like Matern. (Not Gaussian).

Question 12

Q

Maximising the marginal likelihood might fail for hyperparameter optimization, especially in the early stages where we have few datapoints, what can we do instead?

Answer

A

Integrate out hyperparameters using Markov Chain Monte Carlo.

Question 13

Q

What is the problem with scaling Bayesian optimization to high dimensions?

Answer

A

Optimizing the aquisition function is hard for high dimensions and might require many evaluations of the true model to reach a good minima.

Bayesian optimization Flashcards

(13 cards)