Hyperparameter Optimization Flashcards by Marisa Triana

(True or false) hyperparameter optimization (HPO) is a black-box optimization problem.

True

How well did you know this?

Not at all

Perfectly

R package that provides different tuning approaches

mlr3 tuning

How well did you know this?

Not at all

Perfectly

The relevant parameter set can be constructed / specified using the ___ command.

ps()

How well did you know this?

Not at all

Perfectly

All the details for the tuning procedure need to be combined in a ____ .

Tuning instance

How well did you know this?

Not at all

Perfectly

It specifies the method that is used for optimizing the tuning instance.

Tuner

How well did you know this?

Not at all

Perfectly

It defines the number of (equidistant) values per configurable parameter that are to be tried during the tuning procedure.

Resolution

How well did you know this?

Not at all

Perfectly

To avoid overfitting, tuning itself should be performed during the ___.

Training procedure

How well did you know this?

Not at all

Perfectly

A mixture of learner and tuner

Autotuner

How well did you know this?

Not at all

Perfectly

(True or false) “irace” returns the elite configurations and can be used for tuning.

True

How well did you know this?

Not at all

Perfectly

Finding a good hyperparameter configuration for
the problem at hand.

Hyperparameter optimization

How well did you know this?

Not at all

Perfectly

Which are the two main types of baseline optimizers?

Grid search and random search

How well did you know this?

Not at all

Perfectly

Baseline in which a number of values is selected for each parameter and evaluate all possible combinations.

Grid search

How well did you know this?

Not at all

Perfectly

Baseline in which each parameter is sampled uniformly at random.

Random search

How well did you know this?

Not at all

Perfectly

Evaluating novel configurations (very) different to previous ones.

Exploration

How well did you know this?

Not at all

Perfectly

Two types of stochastic approaches for hyperparameter optimization.

Simulated Annealing and CMA-ES

How well did you know this?

Not at all

Perfectly

Two types of model-based approaches for hyperparameter optimization.

Study These Flashcards

iterated F-race (irace) and Hyperband.

Trying to improve existing configurations by evaluating similar ones.

Study These Flashcards

Exploitation

True or false? In the context of Bayesian optimization, the „expected improvement“ acquisition function trades off exploration and exploitation of the search space. Therefore, it is a suitable method in situations where functions are expensive to evaluate, e.g. hyperparameter configurations in large-scale problems.

Study These Flashcards

True

Another name for Bayesian Optimization

Study These Flashcards

Sequential Model-Based Optimization, SMBO

A fast-to-evaluate model of performance function, based on
already evaluated configurations, with uncertainty estimate.

Study These Flashcards

Surrogate model

Optimization which evaluates more configurations with lower budgets speeds up HPO by terminating bad configurations early, and thus allows better optimization with limited overall budget.

Study These Flashcards

Multifidelity optimization

Type of multifidelity optimization that starts all configurations with a certain fraction of the budget. Then, discards half of the configurations with worst performance, doubles the budget and repeats until final budget is reached.

Study These Flashcards

Succesive Halving (SH)

The three most important parameters of succesive halving.

Study These Flashcards

Budget factor, final budget, and total used budget.

This algorithm tries to alleviate the problem of how
well the initial training phase of SH captures final performance by introducing multiple brackets.

Study These Flashcards

Hyperband

(True or false) The smallest allocated budget increases with each consecutive bracket in the hyperband algorithm.

True

Another advanced tool for (multifidelity) algorithm configuration which uses statistical tests to identify and stop bad configurations. Its central concept is “racing”.

irace

(True or false) In Successive Halving, the budget allocated to the runs is halved after each iteration.

False. After each stage, the budget is multiplied by 𝜂 and the number of active configurations divided by 𝜂.

True or false? Hyperparameter optimization can be understood as a special case of the CASH problem.

True

Hyperparameter Optimization Flashcards

(28 cards)