Mueller 21/22 Flashcards

Question 1

Q

Give a brief outline of the SIMP method

Answer

A

Introduces material density variability from 0 to 1, where 0 is no material and 1 is a solid. Calculates derivative of the objective function (e.g. weight) w.r.t. density. Then reduces density of elements with lowest derivative.

Question 2

Q

What is an Adjoint Solution?

Answer

A

One which directly expresses the sensitivities of a single cost function w.r.t. many design variables. (Think of the car which showed where to push in/push out for better aerodynamic performance.)

Question 3

Q

State the Optimality conditions for optimization

Answer

A

If F'(x) = 0 
and
F''(x) >0
and if these are satisfied by x=x*
F(x)>F(x*) for all other x

Then x* is a minimum.

Question 4

Q

State the steps for the Bisection Method

Answer

A

For an interval a

Question 5

Q

List the key properties of the Bisection Method

Answer

A

User define interval which must contain the min
Will find any min, not not necessarily the global
Convergence is slow, depends of width of the initial and desired brackets
N = [log(xb-xa)+log(eta)]/log(2)
Gradient free

Question 6

Q

Describe the Secant Method and list the key steps

Answer

A

A gradient based optimization technique which uses linearly interpolates between a bracket (although extrapolation can be used under certain conditions), using values of x and F’(x).

Set x1 = a and x2 = b (Bracket)
Compute F’1 = F’(x1) and F’2
Set k=2
Use linear interpolation formula to find x’k+1
Compute F’k+1
Set k=k+1
Repeat until convergence

Question 7

Q

State the conditions for extrapolation for the Secant Method

Answer

A

If x1x2 & F’1>F’2>0

Question 8

Q

State the key properties of the Secant Method

Answer

A

Needs gradients
Only need first derivatives
May converge to a max
Faster than Bisection Method
Flexibility in choosing xk
Can be generalised to multi-variate problems.

Question 9

Q

State the three methods of choosing xk in the Secant method and briefly describe them

Answer

A

Chronological - xk becomes xk-1, xk+1 becomes xk. Simple and quick.

Smallest gradient - The two F’(x) with the smallest absolute values are used as the bracketing points, as they are, in theory, closer to the min. Faster

Bracketed - The above may find a max, by choosing two F’(x) with opposite signs for the brackets, we make sure we find a min. Slower but no risk of finding max.

Question 10

Q

List the key differences in origin of the Bisection, Secant, and Newton Methods

Answer

A

Bisection - Literal calculation of function values over points and converging to the smallest values

Secant - Uses Linear Interpolation to find the zero of the gradient to find the min

Newton - Uses Taylor expansion to approximate the zero of the function using gradients.

Question 11

Q

What is the process for safeguarding the Bisection Method? Explain.

Answer

A

From the initial point, calculate the negative gradient and follow until it starts increases in steps s. This will select an interval with a min and if the function is unimodal, there will only be one min.

Question 12

Q

What is the process for safeguarding Newton’s Method? Explain.

Answer

A

If F’‘(x) < then we will tend to a max.
If F’‘(x) = 0 then we will divide by zero.
If the above conditions are true, then use deltax = -F’(k)

To ensure the next step, s = alphadeltax is within the interval.
If deltax < 0, alpha = min{1, (a-xk)/deltax}
If deltax > 0, alpha = min{1, (b-xk)/deltax}

Question 13

Q

What is the process for safeguarding the Secant Method? Explain.

Answer

A

Use a bracketed interval. This ensures a local max is not found.

Question 14

Q

State a Taylor Expansion for two variables

Answer

A

F(x+dx, y+dy) = F+p^tg+0.5p^tHp

where H is the hessian
g = gradient matrix = [Fx Fy]^t
p = step vector = [deltax, deltay]^t

Question 15

Q

State the multivariate optimality conditions

Answer

A

For F(x+dx, y+dy) = F+p^tg+0.5p^tHp

H is positive definite
p^t*g<0

Question 16

Q

State the three Wolfe Conditions and what they mean.

Answer

A

p^tgk<= -eta0mag(p)*mag(gk)

This is stronger than the p^t*g<0 condition as it ensures a minimum angle of from the contour is achieved

F(xk+spk) - F(xk) <=eta1sp^t*gk

This ensures the step size is not too big

{F(xk+spk) - F(xk)}>=(1-eta2)sp^t*gk

This ensures the step size isn’t too small.

Question 17

Q

What is an Augmented Lagrangian?

Answer

A

It is a penalty function which, using approximation methods, approximates the first-order constrained optimality criteria by using Lagrange multipliers. It then adds a small penalty to correct for the error in the approximation of the Lagrange multipliers.

Question 18

Q

How do projected gradient methods work?

Answer

A

By removing the component of the gradient that is perpendicular to the constraint, however, they need the gradient of the constraint.

Question 19

Q

What are interior point methods good for?

Answer

A

Inequality problems, as they can explore the feasible are well.

Question 20

Q

What are projected gradient methods good for?

Answer

A

Constraint problems with linear constraints

Question 21

Q

What are the principles in the derivation of SQP?

Answer

A

Perform a TE to approximate the objective function to obtain a quadratic model, then include an approximated 2nd derivative of the constraints within the Hessian. This improves stability and convergence. So even though the constraint approximation is still linear, the hessian includes the second derivative, so it can handle non-linear constraints better.

Question 22

Q

When should you use SQP?

Answer

A

When you’ve got non-linear constraints and you already have a design which is close to the minimum. SQP is expensive when far from the min, so if you’re establishing a first design, it might not be the best method.

Question 23

Q

Describe the finite difference method and when to use it with regard to calculation of derivatives

Answer

A

The finite difference method calculated the function at two locations:

For forward difference, its at x and x+del_x
For central difference it’s x+del_x and x-del_x

It then subtracts the first from the second and divides by the interval between the first and second.
FD is FO accurate, CD is SO accurate. Good for use with 50 or so variables but calcs scale with number of desvar so not appropriate for 100 +, can get v expensive. Also suffers from precision error of pc, and too small a change in x will blow the solution up.

Question 24

Q

What is the principle behind complex variable derivative calculation?

Answer

A

By implementing an imaginary perturbation, a Taylor Expansion eliminations the subtraction that occurs in FD methods. This eliminates the cancellation error that occurs when delta approaches zero, so we get a more accurate calculation. It’s slightly more expensive but has increased precision.

Question 25

Q

What is a compiler, why use it?

Answer

A

A compiler is a programming tool which reads an entire script before translating into assembly code. This allows for a degree of optimization of the script, as the whole script is read as opposed to a line by line interpretation.

Question 26

Q

What is source transformation AD?

Answer

A

Where the source code is modified to include the derivatives of the components of a function. Requires compiler code which is only available in a small number of of coding languages, such as Fortran and C.

Question 27

Q

What is operator overload AD?

Answer

A

Where custom data types are created and their respective interactions are programmed using the basic rules of calculus. Method is usable in languages which cannot compile code, such as Python, C++, or MATLAB. Less efficient than S-T AD but available in more popular languages.

Question 28

Q

What is the most efficient way to deal with linear operators in AD?

Answer

A

Call the primal function twice, but the second time with original variables replaced with the derivatives, as linear function diff results in the exact same coefficient * by the derivative of the variable, so they’re the same format. This way is cheaper than running the full operation.

Question 29

Q

What are exterior penalty functions good for?

Answer

A

Problems in which representation of the entire design space is desired, for example, genetic algorithms.

Question 30

Q

In genetic algorithms, what is the biased roulette wheel and tournament selection?

Answer

A

They are methods of selection. BRW selects like a RW. The chromosome width is proportional to the fitness, making fitter chromosomes more likely to be selected. TS is where a few chromosomes are randomly compared, maybe two or three, and the fittest are selected for reproduction.

Brainscape's Knowledge GenomeTM

Mueller 21/22 Flashcards

Brainscape's Knowledge Genome^TM