IP bank Flashcards

Question

There are typically two ways to add cuts to a solution. Name em

Answer 1

Use a specified cutting plane method, like Gomory. Or we can use problem specific cuts, like solving the separation problem.

Answer 2

Gomory's method gollow the general outline of a cutting plane method. Start by solving LP relaxation. if Integer, we're good. if not, we must add a valid ineuqality that removes the current LP optimal solution. Gomory use only the informaiton provided in the simplex tableau of hte optimal tableau. Consider the current basic solution. for those basic variables that have a fractional solution, we can consider them. Typically, we take the one with the largest fractional value. We take this constraint as it is specified in the final tableau. xi + ∑aijxj = bi We separate the integer part and fractional part. this is equivalent to taking a floor operation and storing the remainder as well. xi + ∑(int(aij) + frac(aij))xj = int(bi)+frac(bi) Re-allocate xi - int(bi) + ∑int(aij)xj = frac(bi) - ∑frac(aij)xj LHS consists of purely integers. Therefore LHS is integer. Therefore, RHS is integer. frac(bi) can never be larger than or equal to 1. Same with frac(aij). At the same time, they can never be negative. Therefor,e the LHS is smaller than, or equal to 0. this gives us: xi - int(bi) + ∑int(aij)xj <= 0 relocate xi + ∑int(aij)xj <= int(bi) This is a valid inequality. It removes the fractional part of a solution (single basic variable) by only using logic in terms of what values it can be. IMPORTANT: Recall that this will give is an equality that use slack variables. We dont want this. Therefore, we must perform a sutiable substitution by using the ORIGINAL problem formulation.

Answer 3

Use the separation problem to generate cover inequalities in a cutting plane method

Answer 4

Recall that the feasible region of 0/1 knapsack can be defined as: X = {**x** in {0,1} | ∑ajxj<=b} In other words, we have a set of variables that when multiplied by the technology coefficient, the sum cannot be larger than b. We say that a set of variables S is a cover if ∑aj [j in S] > b. This just means that the set of variables is a cover if when including those variables, their sum must be larger than b. We can extend this to the definition of "minimal cover" which has the additional condition of satisfying that we can remove any one of the j variables in the set, and the set will no longer be a cover. In other words, a minimal cover represent a set of variables that can never be seen together in an optimal solution.

Answer 5

If S is a minimal cover, then the constraint ∑xj [j in S] <= |S|-1 is a valid inequality for X. This idea is actually very simple. if we can find a set of variables that are a minimal cover, which means that it would be a limit in terms of how they can never be selected all together at the same time, we can make a cover constraint specifying that they can never be 1 at the same time.

Answer 6

We need to find a set of variables that can never appear together because of some constraint. When we add the constraint that specifies this specific relationship, we have added a cover constraint.

Answer 7

Yes, we introduce the idea of an "augmented cover constraint". The augmented cover constraint is exactly the same as the regular cover constraint, but we are now looking into whether we can add any variables to the LHS and still keeping the constraint valid. To do this, we use the mathematical relationship between technology coeffcieint sizes and RHS values. Define E(S) = S union {j : aj >= ai, for all i in S}. Explanation: E(S) now holds the variables from the original cover, but we also add all variables whose technology coefficient is larger than or equal to ALL technology coefficients in the original cover set. The augmented cover constraint is given as: ∑xj [j in E(S)] <= |S|-1 Note that this is not only 0/1 knapsack specific. This definition only add more dimensions to the hyperplane that is the constraint.

Answer 8

∑aj [j in S] > b AND ∑(1-xj^(LP)) [j in S] < 1 The first one basically just say that there must exist a cover. The second constriant is about making sure that the current solution from LP violates the cover inequality. The aboslute CRUCIAL insight is that the cover inequalities are always valid ineuqalities. By finding a cover inequality, we automatically know it is valid. By ensuring that it violates the current LP solution, we know it is in the right direction.

Answer 9

The separation problem generates a cover inequality . alpha = min ∑(1 - xj^(LP))zj [j in N] s.t. ∑ajzj [j in N] > b zj in {0,1} the z-variables are indicator variables that correspond to whether variable j is included in the cover or not. The separation problem will return a minimum cover if one exists. If a minimum cover doesnt exist, it will return a cover if one exist. if alpha is smaller than 1, we have an inequality on the form: ∑xj [j: zj-star = 1] <= ∑zj-star [j] - 1 So basically, take all the positive valued (1 valued) z-variables, and the sum of these corresponding x-variables must be smalelr than or equal to the number value of the sum of z-variables less 1. this is exactly the same as the earlier cover inequality result. if alpha is larger than or equal to 1, it means that the cover is such that it will not cut away the current LP optimal solution.

Answer 10

it is a hard problem, knapsacvk problem. In practice, we use heuristics to solve it fast. If the result is a valid inequality that has alpha less than 1, we know that it will cut away the fractional solution wiht hte proportion 1-alpha. Therefore, we have a measure of the strneght of the inequality.

Answer 11

Split the feasible region into smaller and smaller regions and solve a relaxed problem in eahc subproblem. We collect information from each subproblem, and explore in a way that makes us look into regions where it is more likely that a solution will be. We progress the search by optimistic bounds that will be more and more realistic. On the way, we hope to find better and better feasible soliutions because we need them to prune. Branching is made by adding extra constraints, typically in the shape of setting a binary variable to be either 1 or 0.

Answer 12

Relax branch prune and select. we relax a problem, typically by finding LP relaxation. This gives us an optimistic bound. The quality of the bound is determined by method specifics and problem structure. Now we want to progress. We do this by branching on a variable. The general idea is that we have not found the optimal solution, so we need to look into smaller regions and continue until we reach IP solution. The crucial part here is that we must not cut away any IP feasible solutions in the process. Also crucial: we need to make sure that we remove the LP relaxation's optimal solution from the new sub problems, otherwise the new problems will just find the same solution as this one. Prune based on current best pessimistic soltuion in combinaiton with the optimistic bound for a solution to a subproblem. we also need a selection policy in regards to which sub problems we look into first. The usual suspects are: 1) Depth first 2) breadth first 3) best first Depth firsth as the advantage of fidnign IP solution relatively quickly, which can be used to prune.

Answer 13

Some kind of termination criterion. If we have many variables, it may take a lot of time to split on each and grow the search tree all the way down. It might not even be practical, taking weeks or longer to finish. Therefore, we typically define an error which is the error we tolerate in the objective function value. Then we simply terminate the search once reached. The termination criterion is typically made based on current optimistic and current pessimistic bounds. The error is the difference between them, and then divided by the pessimistic bound to get hte ratio.

Answer 14

We typically use dual simplex to re-optimize with the added constraints.

Answer 15

knapsack (integer, but not 0/1) TSP job scheduling

Answer 16

We can solve the LP relaxation very easily by making use of the relative contribution of each variable by comparing ratios of c/a. The variable with the best ratio has the most bang for the buck, and we want to exhaust it. Given a subproblem, we want to to the following steps: 1) Initialize all variables to be their lower bound. For most, this is probably 0. We also set k=1. 2) Compute the "remaining resource" RHS as ∆=b - ∑ajxj, where xj is the lower bound we sat earlier. We assume a knapsack with only one constraint, so this delta represent how much capacity we have left. Step 3) Check if we need to stop. If ∆ < 0, it means that we have no feasible solutions. Step 4) Check if we have optimal soluiton. this is the case if ∆=0. Step 5) if ∆>0, we have more capacity, and we want to fill the knapsack more. To do this, we set xk = lowerBound_k + min{∆/a_k, u_k - lowerBound_k}. we also update ∆ := ∆ - a_k(xk - lowerBound_k). Then we set k:= k+1 and go to the previous step. Explanation: index k represent the order of decreasing quotient/ratio. We start with the best, and finish with the last. If we have more available space in the capacity, we compare two sizes: 1) ∆/a_k 2) u_k - l_k ∆/a_k represent the amount of x_k we'd pick if we could fill the sack with it. u_k - l_k represent our constraint in regards to how much more of the item we have available. We cannot fill more than we have available. Upper bounds are made by u_j = floor(b / a_j). The updating of ∆ is just reducing delta by the amount we took, when accounting for the coefficient as well. We can always round down fractional solutions using this approach, so we get a candidate for a pessimistic bound at each sub problem. This refers to rounding down the basic solution. For instance, (1.27, 0, 0, 0) can be rounded down to (1, 0, 0, 0).

Answer 17

It is common to relax the sub tour constraints. There is a good reason for this. The relaxed problem becomes the assignment problem (with both sets being N). The good thing about the assignment problem is that the A matrix is totally unimodular. Therefore, the LP relaxation always produce an integer solution. if a sub tour appears in the solution, we can branch on it to break it. The breaking process is about enforcing a variable to be 0.

Answer 18

Regular branch and bound but also add valid inequalities.

Answer 19

THe exact same as branch and bound but we add valid inequalities at each node in the search tree. we solve a subproblem, and then we add a valid inequality that cuts away this solution. Important about branch and cut is that once we add a valid inequality at some sub problem, the remaining subproblems along the branch is strengthened.

Answer 20

regular branching can actually be considered constraint branching, it is just that it applies only to a single variable. When we think about constraint branching, I believe the general idea is to include more than 1 variable in a constraint. The motivation behind this is that some problems have structures where simply setting a variable (binary) to 0 and 1 does very little "damage" to the feasible region. In fact, there are problems where if we set a specific variable to 1, we can automatically rule out a set of other variables (they must be 0). By exploiting such obvious relationships (if they appear to be obvious at least), we can define the new subproblem by ∑xj = 1, where the sum is over the variables that are mutually exclusive.

Answer 21

for each item/object, we define a constraint. The cosntraint should loop over all sets, and make sure that the sum of objects over all the sets equals to 1. So, if j is for set, and i is for item, we'd get: ∑aij xj [j in J] = 1, for i in I This means that we KNOW that for each constraint, one of the variables must be 1. Because of this, we can select a pair of constraints p and q, and for each such pair we can define a set of variables that have non-zero technology coefficient aij for both constraints, so that we have: J_pq = {j | a_pj = 1 AND a_qj = 1} So, for two constraints, we define a set of variables that have non-zero coefficient in both the constraints. We know that the constraints are for a speciifc item. Therefore, this is basically us saying: "We pick a pair of items, and find the set of variables that includes these two variables". Select two items/constraints, and find the set of variables (sets) that have both these items in it. Since we know assume we have picked two items, and we have found the set of variables that includes both items: We know that we can legally only pick one of the variables in this set. Therefore, we can formulate two new constraints, one per sub problem, that defines the branching: 1) P0 + ∑xj [j in J_pq] = 1 2) P0 + ∑xj [j in J_pq] = 0

Answer 22

Set of variables (continuous or integer) where at most one variable is not zero

Answer 23

Set of variables where at most 2 variables are not zero. These 2 variables must be consecutieve in the given order.

Answer 24

piecewise linear function representation of a non-linear function

Answer 25

some variables are better used (more efficiently used) when we consider them as a single entity rather than a collection of variables. This is because it will then be possible to branch on the entity rather on variables.

Answer 26

we can ocnsier it as a set of many variables. We can split this set into two subsets. We know that iehter the first subset or the second must be 0 (all must be zero). {x1, x2, ..., xr} are all zero, OR {x_(r+1) ... xn} are all zero We can vosnider these as two branches.

Answer 27

I violates S1. However, we have S2 if there are only max 2 elements that are non zero.

Answer 28

The reference row holds the order of the special ordered set of variables. The reference row has values that are monotonically increasing. We can then use the reference row to find the sort of mid-point by taking r-bar = ∑ajxj / ∑xj When summing over all the variables in the special ordered set. We use r-bar as the branching marker where we split the set into subsets.

Answer 29

We have to remove one element. {x1, x2, .., x_(r-1)} are all zero OR {x_(r+1), ..., xn} are all zero What we do is find the branching marker as always using the same method as before. Then we remove the item closest to this marker, like above.

Answer 30

We add a simple indicator vairable x <= My is x is alrger than 0, we force y=1.

Answer 31

if we need to enforce the other way around as well: y=0 --> x<=0 This is the same as saying: y=0 --> x=0 in many cases, we have a cost associated with the indicator variable y. This allow us to never worry about this relation. However, if there is no cost associated with y, then the model solver needs to be explictly told how to model the relation. The problem is that we never want to have y=1 if x=0. This is impossible to model perfectly, so we use an approximation. We say "if x is below a certain threshold, we force y to be 0". x >= my Now, if x is to be larger than 0, nothing is imposed from the constraint. However, is x is equal to 0, due to the small m, y is forced to be 0. There is a wiggle room equal to the size of m. if x is larger than m, there is no issue. If x is smaller than m, we must understand that there might be a case where x is very samll, but not 0, and still y=1. However, since it is a threshold level, it is usually fine. The combination of x <= My x >= my enforce the relations we want

Answer 32

Since these are proportions, we cannot just use something like xa <= xb. This would work if xa and xb happened to be binary. However, they are continuous with upper bound of 1. We need to add indicator variables that indicate whether xa is included or not: xa >= my xa <= My Now, if xa is above m, y is forced to 1. if xa is below m, m is forced to 0. Now we can add the real relationship we're after: xb >= my

Answer 33

This is the same as saying: x = 0 OR y=0 x+y <= 1

Answer 34

We define a new binary variable, z. z should replace them. We need the logic saying that : z = 1 <==> x=1 AND y=1 We can enforce this by: z <= x z <= y z >= x+y-1 , alternatively :x+y-1<=z, alternatively: x+y-z <= 1

Answer 35

Define new variable that is continuous, z. z <= My (if y is 0, then z is 0) z <= x x - z <= M(1-y) (the enforce that if y=1, then we have that x=z)

Answer 36

A set of variables (integer or continuous) within which exactly one varialbe MUST be non-zero.

Answer 37

A set of variables (integer or continuous) within which AT MOST 2 variables can be non-zero, AND these two variables must be adjacent.

Answer 38

Treating the set of variables as an entity, rather than treating each individual variable. The aim is to branch on entities.

Answer 39

we have a set of variables. {x1, x2, ..., xn} At most ONE can be non-zero. We know that in a feasible soltion, if one of these variables is 1, then this non-zero variable must lie in either: {x1, x2, ..., x_r} OR {x_(r+1), ..., x_n} The benefit in this is that we can basically branch on the condition/constraint ∑xi <= 0 for the two sets. This is much better in terms of working on the feasible region than taking eahc variable at a time. We can choose any selection of variables to be in either set as long as we are dealing with SOS1 cases.

Answer 40

So we now require adjacency and that there is a monotonic reference row. The reference row is crucial because we need it to capture adjacency. We refer to the index we use to split as the "branching marker". Use cases for SOS2 are those that require adjacency. Adjacency is tricky because we need constraints ot enforce it along with binary vairables for eahc "case". If we instead use SOS2, we can effectively branch on constraints. We can add SOS properties to gurobi, which will basiclaly just tell it that it is useless to branch on individual variables in the set, and it is much better to branch on the entire entity.

Answer 41

SOS1 is a property that we can find in a great variety of cases. It basically just means that we can only have at most one variable equal to 1. For instance, if a firm has a single position our for hire, they can only hire one candidate. if we view each candidate as a variable (binary, hire/not hire), this would be a SOS1 property. SOS1 is very general, and can be applied every time we have a list of binary variables that is basically constrained to be at most equal t 1. SOS2 on the other hand, is more difficult to apply. I understand that they are very useful in approximating a point on a non linear function, regardless of dimension. So, if we have a function that represent cyclical patterns that are distorted etc because of many different events, and the function is a function of time, we can approximate what the value will be at different points in time using SOS2 as a basis for branching