Chapter 19 How to Implement Bayesian Optimization Flashcards
What’s Bayesian Optimization?
P 179
Bayesian Optimization is an approach that uses Bayes Theorem to direct the search in order to find the minimum or maximum of an objective function.
What’s an objective function?
External, P 178
The function that it is desired to maximize or minimize.
P 178: Objective Function. Function that takes a sample and returns a cost.
What’s global function optimization?
P 178
Global function optimization, or function optimization for short, involves finding the minimum or maximum of an objective function.
What are the definitions of Samples, Search space and cost?
P 178
Samples. One example from the domain, represented as a vector.
Search Space: Extent of the domain from which samples can be drawn.
Cost. Numeric score for a sample calculated via the objective function.
The objective function is often easy to specify but can be computationally challenging to calculate or result in a noisy calculation of cost over time. True/False
P 178
True
The form (shape) of the objective function is unknown and is often highly nonlinear, and highly multi-dimensional defined by the number of input variables. True/False
P 178
True
What does the below statement mean?
“The objective function is also probably non-convex.”
P 178
This means that local extrema may or may not be the global extrema (e.g. could be misleading and result in premature convergence), hence the name of the task as global rather than local optimization.
Nonconvex functions arethose functions that have many minimum points. Local and global minimum points.
Little is known about the objective function, and as such, it is often referred to as a____ function and the search process as ____. Further, the objective function is sometimes called an ____ given the ability to only give answers.
P 178
black box, black box optimization, oracle
Is the object function easy to specifiy?
P 178
The objective function is often easy to specify but can be computationally challenging to calculate or result in a noisy calculation of cost over time.
Define:
Algorithm Training.
Algorithm Tuning.
Predictive Modeling.
P 178
Optimization of model parameters.
Optimization of model hyperparameters.
Optimization of data, data preparation, and algorithm selection.
A directed approach to global optimization that uses probability (Bayes Theorem) is called ____.
P 178
Bayesian Optimization
For what kind of objective function is Bayes optimization most useful?
P 179
It is an approach that is most useful for objective functions that are complex, noisy, and/or expensive to evaluate.
Bayesian optimization is a powerful strategy for finding the extrema of objective functions that are expensive to evaluate. It is particularly useful when these evaluations are costly, when one does not have access to derivatives, or when the problem at hand is non-convex.
Given the Bayes optimization formula below, why is the posterior called a surrogate function for the objective function? P 179
P(f|D) = P(D|f) × P(f)
The posterior represents everything we know about the objective function. It is an approximation of the objective function and can be used to estimate the cost of different candidate samples that we may want to evaluate. In this way, the posterior probability is a surrogate objective function.
An important hyper-parameter in the GP (Gaussian Process) model is the
____.
P 184
from sklearn.gaussian_process import GaussianProcessRegressor
kernel
Kernel methods use kernels (or basis functions) to map the input data into a different space. After this mapping, simple models can be trained on the new feature space, instead of the input space, which can result in an increase in the performance of the models.
What is the acquisition function?
P 180
Acquisition Function: Technique by which the posterior is used to select the next sample from the search space.
Web: Bayesian optimization is a sample-efficient approach to global optimization that relies on theoretically motivated value heuristics (acquisition functions) to guide its search process.