Chapter 13 - Nonlinear BOOK Flashcards
Formally define the non linear programming problem
We use something like the standard form here.
We want to find x = (x1, x2, …, xn) such that we
max f(x)
subject to
gi(x) <= bi, for i=1,..,m
and
x>= 0
So, we do a couple of changes relative to the linear programming problem:
1) We refer to the objective function as f(x), indicating that we are essentially using any type of function here. This stands in contrast with the linear programming objective function, that is explicitly given as a sum of products, making it linear.
2) The same argument/point with the constraints.
Why is non linear programming a particularily large subject?
It is large because there are a huge variety of classes of functions we can look at. We have some methods that work very well on some problems, and not at all on others etc.
Define a convex function of a single variable
A function of a single variable is said to be convex if the following applies:
For each pair of values, x’ and x’’, such that x’ < x’’, we have that:
f[λx’’ + (1 - λ)x’] <= λf(x’’) + (1-λ)f(x’)
.. for all values of lamda between 0 and 1.
We furhter say that the function is strictly convex if we can replace ‘<=’ with ‘<’.
So, what is this saying?
The LHS takes the function value of the sum of lambda times the x’’ and (1-lambda) times x’. In this case, lambda is sort of like a weight. If lambda is 0.25, it means that we are at 25% of the distance between x’ and x’’. So, when LHS take this as the function parameter/argument, the argument becomes the x-value of 25% between x’ and x’’. Therefore, the f(x) gives the y-value corresponding to this specific point.
The RHS does the same, but with y-value. RHS follow straight line, LHS follow the non-linear path.
IF the non-linear path is below the straight line at all value sof lambda and any pair of x’ and x’’, then we have a convex function.
IT IS ALSO WORTH NOTING that if we flip the inequality from <= to >=, we basically do the concavity test instead of convexity.
What is the formal mathematical definition/requirement of convexity?
A function is convex if and only if the second derivative is always greater than 0.
NB: This is single variable.
What is the formal mathematical definition/requirement of convexity when the function is of many variables?
We say the function is convex if its Hessian is positive semi-definite for all possible values of x1, x2, …, xn
What is the “negative” property of convex/concave functions?
If we have a convex function f(x1,x2,..,xn), then g= -f(x1,x2,…,xn) is CONCAVE and vice versa.
What is the sum property of convex/concave functions?
A sum of convex functions is convex. A sum of concave functions is also concave.
Define a convex set
Say we have a convex function. Then the collection of points lying ABOVE and ON the graph of the convex function, is said to be a convex set.
We can also get a convex set from a concave function:
If we have a concave function, then the collection of points lying BELOW and ON the graph of the concave function, is said to be a convex set.
What is an important property of convex sets?
For any collection of convex sets, the collection of points lying in ALL (/both) sets are convex in both, i.e. the intersection of the convex sets, is a convex set in itself.
the image illustrate this property. Notice how the third graph(s) is the intersection set.
Formally define the convex set
A convex set is a collection of points such that, for each pair of points in the set, a straight line connecting the pair of points is also in the collection.
Will the solution for non linear programming problems be CPF solutions?
Not necessarily. There exists constraints and objectiuve functions that will give this, but generally they do not.
If we have a linear objective function and non-linear constraint(s), what do we get graphiucally?
We get a straight line as the objectiuve function. This line will be pushed as far out the feasible region as possible. the feasible region will be given as some curves instead of lines.
What is required for a problem to be considered convex?
Depends on whether it is maximization or minimizaiton.
If MAX: We need a concave objective function f(x), and every gi(x)<=bi must be convex.
if MIN, we the objective function is also convex.
Why do we require gi(x)<= bi to be convex?
Because the region defined by being below the function gi(x) will be convex. Thus, the feasible region becomes convex.
Why do we need convex problems?
The property of convex problems is that they guarantee that any local optimum is actually a global optimum.