Probabilistic algorithms Flashcards

Question

Hessian matrix?

Answer 1

useful in characterizing the shape of L and to | distinguish, among the zeros of gradient g(d), the minimum points, the maximum points and the inflexion points

Answer 2

we can multiply by an Inverse, which achieves the same thing as "dividing by a number" remember: with Matrices the order of multiplication matters. AB is almost never equal to BA. https: //www.mathsisfun.com/algebra/matrix-inverse.html

Answer 3

is the matrix of all first-order partial derivatives of a vector-valued function. When the matrix is a square matrix, both the matrix and its determinant are referred to as the Jacobian in literature The Jacobian matrix is important because if the function f is differentiable at a point x (this is a slightly stronger condition than merely requiring that all partial derivatives exist there), then the Jacobian matrix defines a linear map ℝ^n → ℝ^m, which is the best (pointwise) linear approximation of the function f near the point x. This linear map is thus the generalization of the usual notion of derivative, and is called the derivative or the differential of f at x.

Answer 4

Als Differenzierbarkeit bezeichnet man in der Mathematik die Eigenschaft einer Funktion, sich lokal um einen Punkt in eindeutiger Weise linear approximieren zu lassen.

Answer 5

find a Hamiltonian cycle with minimal path length in G

Answer 6

- Asymmetric TSP - optimum configuration for the ATSP - time dependent TSP - Vehicle Routing Problem - probabilistic VRP -

Answer 7

an algorithm design paradigm for discrete and combinatorial optimization problems, as well as mathematical optimization. A branch-and-bound algorithm consists of a systematic enumeration of candidate solutions by means of state space search: the set of candidate solutions is thought of as forming a rooted tree with the full set at the root. The algorithm explores branches of this tree, which represent subsets of the solution set. Before enumerating the candidate solutions of a branch, the branch is checked against upper and lower estimated bounds on the optimal solution, and is discarded if it cannot produce a better solution than the best one found so far by the algorithm.

Answer 8

in B & B we try to reduce the search space:

Answer 9

LP, also called linear optimization best outcome of a mathematical model (provided constraints)

Answer 10

a convex set is a subset of an affine space that is closed under convex combinations. in a Euclidean space, a convex region is a region where, for every pair of points within the region, every point on the straight line segment that joins the pair of points is also within the region

Answer 11

a special case of a polytope, having the additional property that it is also a convex set of points in the n-dimensional space R^n

Answer 12

convex describes a surface that curves outward, or is thicker in the middle than on the edges. https://writingexplained.org/concave-vs-convex-difference

Answer 13

Concave is an adjective that describes a surface that curves inward, or is thinner in the middle than on the edges.

Answer 14

a hypercube is an n-dimensional analogue of a square (n = 2) and a cube (n = 3)

Answer 15

involving two or more variable quantities.

Answer 16

is a generalization of the one-dimensional (univariate) normal distribution to higher dimensions One definition is that a random vector is said to be k-variate normally distributed if every linear combination of its k components has a univariate normal distribution

Answer 17

describe...

Answer 18

describe...

Answer 19

a slack variable is a variable that is added to an inequality constraint to transform it into an equality Introducing a slack variable replaces an inequality constraint with an equality constraint and a non negativity constraint By introducing the slack variable y >= 0, the inequality Ax <= b can be converted to the equation Ax + y = b

Answer 20

is a generalization of the notion of a triangle to arbitrary dimensions

Answer 21

t - derivative-free optimization algorithm -> derivatives are unavailable or expensive to compute - > derivative-free method - > recommended for solving optimization problems with noisy objective functions

Answer 22

a polygon which is the smallest perimeter fence enclosing a set of N points

Answer 23

limitation of the allowed values of the degree of freedom of the system e.g. boundaries (localisation constraints) e.g. deadline for event occurrence (temporal constraints) hard & soft constraints

Answer 24

a solution which violates a constraint is not rejected, but "punished"

Answer 25

is any approach to problem solving, learning, or discovery that employs a practical method not guaranteed to be optimal or perfect, but sufficient for the immediate goals.

Answer 26

Two natural ways to design an heuristic algorithm: - Construction heuristics - Markovian improvement heuristics

Answer 27

the new node is inserted somewhere inside the sequence (not appended!) - How to choose the new node? - Where is the optimum place to insert it?

Answer 28

e.g. Nearest Neighbor Heuristic

Answer 29

you insert the next randomly chosen node between those two nodes where the increase in the total length is minimum until all nodes are visited.

Answer 30

choose a random node. then choose a random node which is closest to the first node. Then choose a random node which is closest to the second node and so on... (the new node is *appended*!)

Answer 31

The values of loss function + the search space form the energy landscape of the system

Answer 32

Swap move algorithm TSP Translation move algorithm Inversion move algorithm

Answer 33

much faster than SA (no need to generate delta uniformly on (0,1))

Answer 34

independent and identically distributed a sequence or other collection of random variables is independent and identically distributed if each random variable has the same probability distribution as the others and all are mutually independent

Answer 35

independent, identically distributed (i.i.d.) measurement | errors

Answer 36

Basic implementation of random search assumes perfect (noise-free) values of L

Answer 37

_09 sl. 31

Answer 38

the sequence of a_k values used in the calculation of d^_k+1, as a factor to Y_k(d_k)

Answer 39

d* e D* -> a global optimal solution D* is the solution space, the set of all solutions of an optimization problem d^ e D (problem space) is a solution candidate

Answer 40

The gradient g(d) of L(d) is the vector of first order derivatives

Answer 41

Simultaneous Perturbation

Probabilistic algorithms Flashcards

(100 cards)