Integer programming Flashcards

Question

In LP, what determines the difficulty in solving a problem?

Answer 1

Typically the number of constraints. This is a contrast to IP, where there is more focus on the number of integer variables and the structure of the problem.

Answer 2

KKT conditions along with convexity analyss

Answer 3

We need to use some iterative procedure where we find optimistic and pessimistic bounds that are iteratively brought closer together. We stop when the error (upperbound - lowerbound) is smaller than some pre defined error constant.

Answer 4

A pessimistic bound is always given by any feasible solution to a problem. For max problems, pessimistic bounds is a lower bound. For min problems, pessimistic bound is an upper bound.

Answer 5

we find optimistic bounds by solving a relaxed variant of the problem. Relaxation can be done in a variety of ways: 1) Remove integer constraint 2) Make feasible region larger by removing one of several of the constraints

Answer 6

Primal bounds

Answer 7

Dual bounds

Answer 8

The problem (without relaxation) will not have any feasible solutions either

Answer 9

A problem is weak if its LP relaxation gives an optimal solution that is far away from the optimal solution of the IP problem. The closer the stronger. We say that the formulation is IDEAL if the optimal solution is the same for both problems.

Answer 10

Convex hull.

Answer 11

"Convex hull of feasible integer points". A point **y** is a convex combination of a set of points **x**^(k), k in K, if: **y** = ∑lambda_k **x**^(k) [k in K], where ∑lambda_k [k in K]= 1, and lambda_k >= 0 for all k in K. So, a point is called a convex combination of a set of points if it is a weighted average of those set of points in a way where the sum of the weights is equal to 1. Crucial about convex combinaitons: They define a convex set. Then we can define the convex hull: The convex hull of a set of points **x**^(k), k in K, conists of all possible convex combinations of the points. So, the convex hull of a set of points refer to the entire possible set region defiend by considering all convex combinations of the set of points.

Answer 12

Obviously we allow linear hyperplanes. The definition of convex hull applies.

Answer 13

We can define a convex hull by inequalities that essentially create a feasible region.

Answer 14

If we had them available, we could solve the IP as an LP

Answer 15

No, it is too time consuming. However, there are many methods that reduce the differnece between the feasible region and the convex hull, by performing changes to the IP formulation.

Answer 16

Initially choose a strong formulation Adding problem specific valid inequalities. Valid inequalities is a topic for later.

Answer 17

for IP, we can often find multiple ways of structuring the very same feasible region. Now, this is the feasible region in regards to the IP solution points. However, once we perform the LP relaxation, we of course get another feasible region. it benefits us if the difference between the convex hull (given by the convex combination area of the solution points) and the feasible region of the LP relaxation is as small as possible. Therefore, adding more constraints (while keeping the IP feasible region the same) is beneficial for this purpose. Of course, doing so adds complexity since we have more constraints. However, the problem structure is typically much more important than adding some constraints.

Answer 18

The smaller we can set it, the better the LP relaxation will be. it is based on the same principle as constriants. Why? Because the bigM is a part of a constraint, so naturally it makes a difference. The point is to reduce the differnece between the feasible region of the LP relaxation and the convex hull.

Answer 19

A valid inequality is an inequality that is satisfied for all x in Xip. Adding a valid inequality will not change the feasible region of the IP problem, but will reduce the difference between the feasible region of the LP relaxation and the convex hull.

Answer 20

since it yield a better approximation of the convex hull, the LP relaxation will be better (at least not worse)

Answer 21

we say that the face of a convex hull is given by the set of points that satisfy: F = {x : x in Xc, ∑aj xj = b} This means, not all valid inequalities will produce a face.

Answer 22

A face F of a convex hull Xc with dimension dum(Xc)=n is called a facet if dim(F)=n-1 In other words, if the face is of dimension one less than the convex hull, it is a facet.

Answer 23

A valid inequality that defines a surface of dimension 2, can support the convex hull along an edge or a point

Answer 24

approximate the convex hull better and better. it is not a goal to construct the entire convex hull, as some areas are more important than others.

Answer 25

Constrains: Valid inequalities

Answer 26

If not valid, then they can remove points from the feasible region. this might shave off the optimal solution.

Answer 27

1) Choose a suitable mathemtical formualtion of the problem. If possible, add some initial valid inequalities 2) Solve the LP relaxation of hte problem 3) If integer solution, stop. If not, continue 4) Add one or several valid inequalities that cut away the LP optimal solution. a) Based on problem specific cuts b) Based on a general cutting plane method, like Gomory's method 5) Resolve the LP relaxation and go to step 3

Answer 28

Gomorys generate the valid inequalities based on the optimal tableau from LP relaxation. This is nice, because we dont need to know much about the structure of the IP problem. One requirement for generating cuts: All coefficients in the problem must be integers. However, note that we can multiply coefficeints by a suitable constant if not. Each row in the system of equations (including the obj func row) that has a fractional RHS value can be used to generate a cut. The constraint must cut away the current LP optimal solution, but not any integer solutions (just the same as saying we want to add a valid inequality). Here is what we do: We solve the LP relaxation. This gives us basic solution. for each row in the optimal tableau that is not integer on RHS, we study it. Each can be written like this: xi + ∑aij xj = bi Since xj are all non basic, we basically have that xi = bi. Then we split bi and aij into their integer parts and fractional parts. xi + ∑(int(aij) + frac(aij))xj = int(bi) + frac(bi) Now we re structure the equation so that we get all the integer parts on the LHS, while the fractionals are on the RHS xi + ∑int(aij)xj - int(bi) = frac(bi) - ∑frac(aij)xj Instead of using int and frac names, we use regular to indicate integer, and "f" to indicate fracitonal: xi - bi + ∑aij xj = fi - ∑fij xj Since LHS only consist of integers, the total is an integer. Therefore, the LHS must also be an integer. However, we also know that fi can NEVER be equal to 1. fi must always be smaller than 1, since it is the fractional part of a fractional number. When we combine the fact that the RHS must be smaller than 1 and must be integer, we know that the LHS basically is 0 or smaller. Sidenote: Why do we know that ∑fij xj is positive? It is positive because xj is either 0 or 1, and fij is a positive fraction part. So, we have: xi - bi + ∑aij xj <= 0 Alternatively, xi + ∑aij xj <= bi Now, remember that aij and bi are floored functions. This final result is teh Gomory cut. It is an integer cut, since it consist only of the integer parts. We could also use the RHS result of being smaller than or equal to 0 as well, which is called the "fractional cut". One last thing: It is common to take the Gomory cut, and perform substitution so that we dont give it in terms of slack variables. We therefore visit the orgiinal problem description and find the suitable substitution before we add the constraint to the problem.

Answer 29

for a binary knapsack problem iwth feasible region: X = {**x** in {0,1} : ∑aj xj <= b} the set S is a cover if ∑aj [j in S] > b. The set S is a minimal cover if for each selection of "k", removing k from the set will make that S is no longer a cover. So, this means that if the set of all indices is not a cover, the constraint can never be violated. More importantly, a specific "cover" will violate the constraint.

Answer 30

If S is a minimal cover, then the constraint ∑xj [j in S] <= |S|-1 must be a valid inequality for X. The reasoning is that since S is a minimal cover, we can not include it in the otpimal solutino. However, specifying that gives us a constraint that reduce the feasible region of the LP relaxation.

Answer 31

We use the theorem that says tghat if S is a cover of X, then the augmented cover constraint ∑xj [j in E(S)] <= |S|-1 is a valid inequality if E(S) = S Union {j : aj >= ai, for all i in S} In other words, we can add a variable to the mix IF its coefficient is greater than or equal to all coefficients in the set already.

Answer 32

Split the feasible region into smaller regions and solve relaxed problems. By making use of information from each subproblem, we can find the optimal solution to the original problem.

Answer 33

We want to collect information in areas of the feasible region where it is likely that solutions with good objective function values can be found, while at the same time avoid searching shitty areas.

Answer 34

We split a problem into subproblems with smaller feasible regions

Answer 35

In each subproblem, we find an optimistic bound to the objective function value by solving its relaxed problem. This bound represent the best possible solution this part of the problem can be (not necessarily that it will be this value, because obviously it is an optimistic bound, not necessarily realistic).

Answer 36

adding a constraint (or several) in a systematic way.

Answer 37

The cuts are based on comparing the current best pessimistic value vs the optimistic bound of a new sub tree. If the pessimistic value is better, we can cut. Therefore, there is significant amount of performance that depend on being able to find good pessimistic values as early as possible.

Answer 38

Depth first Breadth first Best first

Answer 39

Always pick a subproblem to solve that has been branched the most times. The advantage of depth first is that it quickly finds a feasible soltuion (pessimistic solution).

Answer 40

Completes a level before starting on the next level. Breadth first works well in cases where it is possible to find good solutions without adding too many constraints.

Answer 41

Move to the subproblem with the most promising optimistic bound. Disadvantage of this is that we need to start over with simplex

Answer 42

In many practical applications, it is not feasible to search and investigate all subproblems. Therefore, it can be necessary to specify a terminatio ncriterion that basically is a given level of accuracy that we are happy with. We refer to this error as "epsilon". Then we terminate the search once the (best optimistic bound - best pessimistic bound)/best pessimistic bound is smaller than the epsilon.

Answer 43

We use dual simplex to re-optimize starting from the previous spot.

Answer 44

Depends on the strategy. if we use depth first to find a solution quickly, we want to use the "largest fractional value" as the pick, because this reduce teh search space the most. other methods include: - choosing the one closest to integer - Select the variable with the best obj func coefficient - select from a pre-determined list - indices

Answer 45

No. The problem structure is very important, because if we know something specifically, we can leverage this to make modifications to the branch and bound and solve more efficiently.

Answer 46

Knapsack TSP Job scheduling there might be more. Perhaps set partition, covering and packing? What about network problems

Answer 47

LP relaxation can be solved by inspection AND We can always round off the relaxed solution and get a pessimistic bound.

Answer 48

For the linear problem, we can set the variable to be the ratio of the b-value and its coefficient. upper bound variable j = b / aj For the integer problem ,we can round it down, so we get: upper bound variable j = floor(b / aj) The reason why we can round down like that is that if there are no other constraints, the variable is flexible.

Answer 49

Find the sorted order of the cj/aj. Rperesent the order which we want to choose from. Then given a set of constraints on the integers: Initialize all variables to be their lower bounds. In the first subprob lem, all arel ikely 0. Later, they will be other than 0 (when branched on). Based on the lower bound, compute how many resources are left. Then we go through the sorted order, and fill it up. This basically solves the LP relaxation by inspection. If all variables are integer restricted, we know that the solution can be rounded down (basic solution), for instance from (1.75, 0, 0, 0) to (1,0,0,0). if all coefficents in the objective rfunction are also integer, we can round the optimistic bound as well. This gives us 2 important things: 1) Each step gives us a pessimistic bound. 2) LP relaxation solved by inspection

Answer 50

if the A matrix is total unimodular, the constraint Ax<=B defines the convex hull. Therefore, solving a single LP relaxation solves the entire problem.

Answer 51

Each square submatrix must have determinant equal to 0, -1 or +1. We dont do this in practice, but there are some easier ways. We have some properties that guarantee that the matrix is total unimodular: 1) Each element is 0, 1 or -1 2) No more than 2 non-zero elements in each column 3) Rows can be split into two subsets P1 and P2 where a) if a column contains two of the same number (+1 or -1) the rows have to be in differnet subsets b) If a column contains +1 and -1, bith rows have to be in the same subset. So, for eahc column: If there are 2 non zero elements, we must be able to create a partition so that all of the columns are satisfied

Answer 52

If one is total unimodular, the other is as well

Answer 53

It is a condition that says if the matrix posess it, it happens to be total unimodular. There are other matrices that does not have this condition, but can still be totally unimodular

Answer 54

one empty, annd the other one include all the rows. In such a case, we must of course have that if there are more than 2 non zero elements, they must have opposite sign in eahc column

Answer 55

y1 v y2 v y3 ... we can model this by y1 + y2 + ... <= n w we force w to be 1 if as much as one of the variables on the LHS is 1.

Answer 56

A binary variable that indicate a certain state. For instance, yes no decision. it is very common to distinguish between states where we have 0 aciton, and "some" action. For instance in production at various facilities. We cna use an indicator variable to indicate whether produciton at facility is beign done or not.

Answer 57

we have a constriant like this: x <= My this will force y to be selected as 1 if x is going to be larger than 0. The important part is what we choose M to be. It shiuld be as small as possible, but not be restricting in terms of x. The upper bound for x is a safe value for M.

Answer 58

x >= my, where m is a small value if y is sat to 1, then we see that x must be larger than or equal to the value of the lower bound "m".

Answer 59

It is not actually possible to represnet this as a constraint. y <= x x <= My

Answer 60

Say we have continuous variables xA and xB. First we need indicator variables as to whether they are used or not: xA <= MyA xB <= MyB Then we can use the indicator variables to enforce the relationship: yA <= yB Alternatively: yA - yB <= 0

Answer 61

We add M multiplied by indicator variable to the LHS, while adding M to the RHS. FOr instance: ∑ajxj + My <= b + M This makes sure that if the constraint holds, y=1. If the constraint doesnt hold, y=0. The phrasing is a little weird. "Showing whether a constraint holds or not" must be considered in relaiton to the objective funciton. If there is a penality for not including the constraint. It is actually a very powerful tool. consider cases where we can avoid a constraint for a fine or something like that. In these cases, this indicator variable is necessary.

Answer 62

Should be equal to: ∑ajxj - b <= M In other words, M must be an upper bound on this LHS expression. Depends very much on what x can be.

Answer 63

the case that the book try to explain is that if we want bi directional to work, where setting y=1 enforce a x larger than 0, then we need to add another constraint. However, adding this constraint still isn't perfect because we need to find a level (epsilon) that we define as the lower threshold. So, we end up enforcing x to be not just positive if y=1, but it must be larger than the epsilon value. Sure, we can set epsilon extremely small and achieve a somewhat desired effect, but it is not as elegant as we might have hoped for. The constraint x >= ey, where e is epsilon value, force x to be larger than epsilon if we set y=1. if y=0, nothing is enforced. We can then combine them: x >= ey x <= My Alternatively: x >= my x <= My in any case, if y=0 then x is forced to be 0. if y=1, x is forced to be larger than "m". if x is larger than 0, y is forced to 1. forcing y to 1 will then force x larger than epsilon. to sum up, this constraint is impossible to model without epsilon.

Answer 64

Recall that the other way around: y=1 --> ∑aj xj <= b, is enforced by: ∑ajxj <= b + M(1-y) however, this does not say that if y=0, then the constraint cannot hold. It just removes it from conisederation. The key here is that if y=0, no relationship is enforced. from the relation we now want to enforce, it can be structured as: not(y=1) --> not (∑aj xj <= b) which of course equals: y=0 --> ∑aj xj > b This is where we are stuck with the other constraint. The other constraint only say that if y=0, then nothing is required. In order to enforce some kind of constraint that make the ∑aj xj > b if y=0, then we need to add epsilon again. We then say: ey <= ∑aj xj - b