| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 12 | 15 | 16 | 17 | 18 | 19 | 20 |
Go back to the top of the page.
Go back to
the top of the page.
Go back to
the top of the page.
Go back to
the top of the page.
x~_i = { 2*x_i - ( u_i + l_i ) } / { ( u_i - l_i ) }
An important observation in relationship to uncriticality:
Minimize f = -x1
Minimize f = x1^2 + x2^2 - 2 x1 - 4 x2
Prior to discussing the monotonicity analysis, we had studied
the numerical methods for the unconstrained minimization of
many variables. Now, we will look at numerical methods for
CONSTRAINED optimization variables. We will begin with linear
problems first and then move to nonlinear problems.
Before presenting the Simplex method, we discuss the basis
for it first and see how it works.
Having discussed how LP problems can be solved, we now turn to
constrained NLP problems.
Next we move to another indirect, like the penalty
formulations, method for constrained problems.
Next, we move to some direct methods. That is, here
we don't solve an unconstrained or linear
problems as a short cut to solving constrained
problems. Rather, we solve the constrained problems directly.
We will discuss two direct methods, viz. the method of
feasible directions abd the Genralized Reduced
Gradient (GRG) method .
This is a "direct" numerical method for solving the
constrained minimization problems. In some ways, it works like
the Simplex algorithm. We convert the general form with inequality and
equality constraints to a "standard" form that has only equality
constraints by introducing slack variables. The theory behind the
reduced gradient was already discussed when the KKT conditions
were proved. The essence of this method is that each equality constraint
reduces the design space dimension by one. If the constraint is simple
and explicit enough, a variable can be easily eliminated, but if it is
not we indirectly reduce the search space dimension in the "gradient
space".
A feasible design point is needed to start the GRG method. Once it is
found, the algorithm proceeds to the boundary of the feasible region and
stays on it (like the Simplex algorithm) unitl the constrained minimum is
found.
Lecture 5 on 9/23/97
Contributed by Peng Song
(i) equality constraint: h = x^3-6x^2+5x+12=0
Feasible region = {-1,3,4}
(ii) inequality constraint: g = x^3-6x^2+5x+12 <= 0
Feasible region: (-infinity,-1] & [3,4]
Since any feasible movement from x* must be in the increasing direction of f(x)
i.e. if x = x* + alpha*d with alpha >= 0 & d is the feasible direction,
then f(x) >= f(x*)
Considering the I order Taylor's series expansion,
f(x) = f(x*)+f'(x*)(x-x*)+o(2)
f'(x) (alpha*d) = f(x)-f(x*) >= 0 with alpha >= 0
f'(x) d >=0 since alpha >=0
This is the necessary and sufficient conditon for a constraint local minimum.
Obviously, it also works for the multiple variables situation:
(gradient f(x))(delta x*) >= 0
Minimize f(x) where x = (x1,x2,...,xn)
I order Necessary conditions: (gradient)f(x*) = 0
II order Sufficient conditions: Hessian(f(x*) is positive definite.
II order Necessary conditions: Hessian(fx*) if positive semi-positive.
Positive definiteness of a matrix implies :
Hessian at x*
Eigenvalues
Nature of x*
Positive definite
All +ve
Local minimum
Positive semi-definite
All nonnegative
Probably a local minimum
Negative definite
All -ve
Local maximum
Negative Semi-definite
All nonpoositive
Probably a local maximum
Indefinite
Mixed signs
Saddle point
Minimize f = x1^2 + 1/x1 + x2 + 1/x2
gradient f = [2x1 -1/x1^2]=[0 0]
then x1* = [1/(2^(1/3)) 1] or x2* = [1/(2^(1/3)) 1]
H = [2 + 2/x1^3 0 ]
[0 2/x2^3]
H(x1*) positive definite : x1* is a minimum.
H(x2*) indefinite : x2* is a saddle point.
Graphical Interpretationintroduction
Several contour plots were drawn and explained.
That is, the gradient of the function and the gradient of the constraint are in the same direction.
gradient f(x*) + c1*g1'(x*) + c2*g2'(*) = 0
The dimension of the feasible region region decrease by one with each equality constraint. That is,
if we had a (x1,x2) space, with an equality constraint, we will be constrained to move only
along a curve -- meaning one dimension.
Lecture 6 on 9/25/97
Contributed by Joel Esposito
Go back to
the top of the page.
Lagrangian = L = f + sum_of( mu_i * h_i)
i = 1 ... M
Lecture 7 on 9/30/97
Go back to
the top of the page.
Minimize f = x1^2 + x2^2 - 4*x1 + 4
Subject to
.......... g1 = -x1 =< 0
.......... g2 = -x2 =< 0
.......... g3 = x2 - (1-x1)^3 =< 0
Note that this a weaker condition than requiring the positive
definiteness of the Hessian of the Lagrangian.
Minimize f = x1^2 + x2^2 - 3*x1*x2
Subject to
.......... g1 = x1^2 + x2^2 - 6 =< 0
Understand why we said the sufficient condition is "weaker" than the
positive definiteness condition by solving this example
Lecture 8 on 10/2/97
Contributed by Aveek Das
Go back to
the top of the page.
min f=x1^2 + x2^2 - 4x1 + 4
subject to
g1= -x1 =< 0
g2= -x2 =< 0
g3 = x2 - (1-x1)^3 =< 0
This example shows that the KKT theory does not apply because the minimum is at
an irregular point. Apply KKT conditions to see that there will be a
a conflict in solving them.
(Also refer to the Handout)
Solution of the second example problem given in the previous lecture,
including application of the KKT sufficiency condition to check the
minimum point obtained from the KKT necessary conditions.
The modified Lagrange multipliers are K times the values in the
original problem. We can easily see this from the KKT necessary
conditions.
The modified Lagrange multipliers are (1/K) times the values of
those in the original problem. This again follows from the necessary
conditions.
g =< epsilon
i.e., delta_g = epsilon
then
delta_f = -lambda * delta_g
subject to
g1 = x1^2 + x^2 - 6 <= 0
subject to
g1new = x1^2 + x2^2 - 6 =< 2
i.e., delta_g = 2
Lecture 9 on 10/7/97
Contributed by Tom Sugar
Click here for a summary of
this lecture.
Go back to
the top of the page.
Lecture 10 on 10/9/97
Contributed by Hong Zhang
Go back to
the top of the page.
If a minimization method always locates the minimum of a general
quadratic function in no more than a predetermined number of operations
and if the limiting number of operations is directly related to the
number of variables, then that method is said to be quadratically
convergent.
Sequential searching in N mutually A-conjugate directions leads
to the minimum of a quadratic function of N variables.
Statement of the theorem:
If a quadratic function Q(x) = 0.5*x'Ax + x'B + c is
minimized sequentially, nce along each direction of a set f N mutually A-
conjugate directions Si ( i = 1, 2,..., N), the minimum of the
function Q will be found at r before the Nth step irrespective f the starting
point or the order of searching in Si ( i = 1, 2,..., N) directions.
Powell's algorithm generates A-conjugate directions.
Why is -grad(f) the steepest descent direction?
Proof that this method also gives rise to the A-conjugate
directions is part of the Homework #5.
Lecture 11 on 10/16/97
Contributed by Xiaoye Wang
Go back to
the top of the page.
H(k+1)=H(k)+delta(H(k))
B(k)=inverse(H(k))
B(k+1)=B(k)+delta(B(k))
Lecture 12 on 10/23/97
Go back to
the top of the page.
x~ = D x
D can be chosen in many ways. Usually diagonal form is
preferred. The diagonal elements can be chosen to be either
the square root of the diagonal entries in the Hessian or
the reciprocal of the absolute value of the respective
variabel itself.
A simple transformation that makes all variables vary between
-1 and + 1 can be written as:
where l_i < x_i < u_i
grad(f(x~) = inv(D grad(f(x)
H(f(x~) = [inv(D]' H(f(x) [inv(D]
Using the above the old unscaled routines can be used for the
scaled problem.
Formulating an optimization problem for a real design problem
is the focus of modeling.
Papalambros and Wilde (1988) note that the most common mistake in modeling is
to leave something out. This type of mistakes will manifest in the form of
under-constrained or unbounded models.
In this section, we will only deal with positive variables (unless mentioned
otherwise) as the design variables are usually positive (e.g., length,
thickness, etc.).
A few important terms:
If minimum exists, the problem is said to be
well-boounded .
If a constrained minimum exists, the problem is said to be
well-constrained .
Note that "critical" is stronger than "active".
Removal of the constraint changes the minimum
Removal of the constraint changes some of the minima
Removal of the constraint does not change the minimum
Removal of the constraint makes the problem unbounded or
ill-constrained.
Fix all but one variable at a time and then minimize with respect to that
variable. It is called partial minimization
It tells helps us identify the critical connstraints and then eliminate a
variable along with the corresponding critical constraint.
A function is said to increase or be increasing monotonically with respect to
the single positive finite variable x, if for every x_2 > x_1,
f(x_2) > f(x_1). That is,
{ deltaf / deltax } > 0
Or if the function is differentiable,
df/dx > 0
The above definition talks about "positive monotonocity". Similarly, "negative
monotonocity" can also be defined.
Notation:
f(x+) indicates positive monotonocity of f w.r.t. x
f(x-) indicates negative monotonocity of f w.r.t. x
Similarly, f(x-) needs at least one g(x-).
In a well-constrained objective function, every "increasing" (decreasing) variable is bounded
from "below" (above)
by at least one active constraint.
Lecture 13 on 10/28/97
Go back to
the top of the page.
Monotonicities and critical constraints were identified.
Variable and the correspnding critical constraints were eliminated
one at a time.
Since the prooblem turns out to be unbounded, an extra constraint
was added.
With this, the solution is obtained without using KKT theory.
Various rules for discussed.
Minimize f = x1 - x2
Subject to
g1: 2 x1 + 3 x2 - 10 =< 0
g2: -5 x1 - 2 x2 + 2 =< 0
g3: -2 x1 + 7 x2 - 8 =< 0
Lecture 14 on 10/30/97
Go back to
the top of the page.
A non-monotonic constraint can sometimes be split into two or more monotonic constraints.
This concept also relates to the regional monotonicity concept wherein we can
identify different regions of the design space where the objective and constraints are
monotonic.
If more than one constraint is critical for a varible, we can immediately decide which
of them will be cirtical (active). So, we call them conditionally critical constraints.
If one constraint is critical for more than one variable, then it is said to be
multiply critical. If this happens, one of the variables needs to be eliminated
using the multiply critical constraint.
If a constraint is dominated by other constraints, then we can remove the dominated
constraint. Dominance of a constraint occurs if the feasible space of this constraint
is a subset of the feasible space of the other constraints. For example, if g_i < g_j
for all values of x, then g_i is dominated by g_j.
If there is a very complex constraint and we want to find if it is going t be active,
we can do the following:
Let the complex constraint be g_i.
Delete g_i and form a relaxed problem.
Solve the relaxed problem and find x as the solution.
If x satisfies g_i, g_i is said to be inactive for the original
problem.
If not, g_i should be active or semi-active for the orginal problem
In a well-constrained problem, if there are constraints that exhibit the same
monotonicity w.r.t. a variable as that of the of the objective, then they are called
uncritical .
If there is a constraint g_i that is critical w.r.t. x_k and there is another constraint
g_j that is uncritical w.r.t. x_k, and g_j depends on no other variables, then g_j is either
inactive or the constraints are incnsistent.
Equality constraints can be inactive r semi-active "in the sense that removal f this cnstraint
dooes not affect the value of the minimum for the problem."
This can happen if there are variables in the problem that are not in the
objective constraint.
Every monotoonic non-objective variable in a well-bounded problem is either
a) irrelevant and can be deleted from the problem together with all constraints in
which it ccurs, or
b) relevant and bunded by two active cnstraints, one from above and one from
below.
Consider:
Minimize f(x_i+)
Subject to
g(x_i-, x_j-) =< 0,
then f(x_i*) is indirectly negative montonic w.r.t. x_j.
This was proved in the class.
Subject to
g1: exp(x1) - x2 =< 0
g2: exp(x2) - x3 =< 0
g3: x3 - 10 =< 0
Subject to
g1: x1 + 4 x2 - 5 =< 0
g2: 2 x1 + 3 x2 - 6 =< 0
As we discussed the MP1 and MP2 principles, they can be
applied to only inequality constraints. So, if we have
any equality constraints, we need to convert them to
inequality constraints. The sign of the inequality ("less
than" or "greater than") is decided by the needed monotonicities
of the variables involved.
Lecture 15 on 11/6/97
Go back to
the top of the page.
Minimize f = c1*x1 + c2*x2 + ... + cN*xN
Subject to
a_1j * x1 + a_2j * x2 + ... + a_Nj * xN =< b_j {j = 1,...,M}
a_1j * x1 + a_2j * x2 + ... + a_Nj * xN = 0 {j = M+1,...,M+L}
N = number of variables
M = number of inequality constraints
L = number of equality constraints
This corresponds to a vertex of the feasible polyhedron.
What is the canonical form?
Lecture 16 on 11/11/97
Something interesting happens when we include the objective
function as the last row.
What are basic and non-basic variables?
"Moving to another basic feasible solution" imples replacing
one variable that is in the basis now with a non-basic variable.
So, we need to decide which one should leave the basis and which
one should enter the basis.
Go back to
the top of the page.
rp = penalty parameter (a scalar)
P(x) = penalty function that is formed from the
constraints.
And what is their graphical interpretation?
What is the role of the penalty parameter in each of those?
What are their advantages and disadvanatges?
Lecture 17 on 11/13/97
And then, we linearize about this new point and
go to another design point and so on sequentially.
There are commercial optimization programs that are
based on SLP.
Go back to
the top of the page.
Lecture 18 on 11/18/97
Go back to
the top of the page.
Lecture 19 on 11/20/97
Go back to
the top of the page.
Lecture 20 on 12/2/97
Go back to
the top of the page.