13.07.2015 Views

MPEC Problem Formulations in Chemical Engineering Applications

MPEC Problem Formulations in Chemical Engineering Applications

MPEC Problem Formulations in Chemical Engineering Applications

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>MPEC</strong> <strong>Problem</strong> <strong>Formulations</strong> <strong>in</strong> <strong>Chemical</strong>Eng<strong>in</strong>eer<strong>in</strong>g <strong>Applications</strong>B. T. Baumrucker ∗ , J. G. Renfro † , L.T. Biegler ‡June 16, 2007AbstractWith the development and widespread use of large-scale nonl<strong>in</strong>ear programm<strong>in</strong>g(NLP) tools for process optimization, there has been an associatedapplication of NLP formulations with complementarity constra<strong>in</strong>ts<strong>in</strong> order to represent discrete decisions. Also known as Mathematical Programswith Equilibrium Constra<strong>in</strong>ts (<strong>MPEC</strong>s), these formulations can beused to model certa<strong>in</strong> classes of discrete events and can be more efficientthan a mixed <strong>in</strong>teger formulation. However, <strong>MPEC</strong> formulations andsolution strategies are not yet fully developed <strong>in</strong> process eng<strong>in</strong>eer<strong>in</strong>g. Inthis study, we discuss <strong>MPEC</strong> properties, <strong>in</strong>clud<strong>in</strong>g concepts of stationarityand l<strong>in</strong>ear <strong>in</strong>dependence that are essential for well-def<strong>in</strong>ed NLP formulations.Nonl<strong>in</strong>ear programm<strong>in</strong>g based solution strategies for <strong>MPEC</strong>s arethen reviewed and examples of complementarity drawn from chemical eng<strong>in</strong>eer<strong>in</strong>gapplications are presented to illustrate the effectiveness of theseformulations.1 IntroductionMathematical programm<strong>in</strong>g methods have proven extremely valuable for thedesign and operation of chemical processes. This has been made possible bymore flexible model<strong>in</strong>g constructs, <strong>in</strong>creased comput<strong>in</strong>g power, and a betterfundamental understand<strong>in</strong>g of solution algorithms and problem formulations.Related optimization models have a number of options to represent discretedecisions, as <strong>in</strong> contact and friction problems <strong>in</strong> computational mechanics, equilibriumrelations <strong>in</strong> economics, and a wide variety of discrete events <strong>in</strong> processsystems. Many of these situations naturally lend themselves to complementarity∗ <strong>Chemical</strong> Eng<strong>in</strong>eer<strong>in</strong>g Department, Carnegie Mellon University, Pittsburgh, PA 15213USA† Honeywell International, 1250 West Sam Houston Parkway South, Houston, TX 77042,USA‡ Correspond<strong>in</strong>g Author, <strong>Chemical</strong> Eng<strong>in</strong>eer<strong>in</strong>g Department, Carnegie Mellon University,Pittsburgh, PA 15213 USA, lb01@andrew.cmu.edu1


Assum<strong>in</strong>g that x = 0 at the solution, then the active set c(x, y) = 0 consists ofg 1 and g 3 where:∇g 1 = [1 0] T (10)∇g 3 = [y 0] T (11)[ ] 1 0∇c T = ⇒ l<strong>in</strong>early dependent (12)y 0Failure of LICQ is noteworthy as uniqueness of the constra<strong>in</strong>t multipliers holds ifand only if LICQ is satisfied. A weaker condition is the Mangasarian-FromovitzConstra<strong>in</strong>t Qualification (MFCQ), which requires l<strong>in</strong>ear <strong>in</strong>dependence of theequality constra<strong>in</strong>ts and the existence of a search direction that lies both <strong>in</strong> thenull space of the equality constra<strong>in</strong>t gradients and po<strong>in</strong>ts <strong>in</strong>to the <strong>in</strong>terior ofthe region def<strong>in</strong>ed by the l<strong>in</strong>earized active <strong>in</strong>equality constra<strong>in</strong>ts. MFCQ is anecessary and sufficient condition for boundedness of the multipliers. However,MPCCs violate MFCQ at all feasible po<strong>in</strong>ts as well. Consequently, multipliersof the MPCC (3) will be non-unique and unbounded.Because these constra<strong>in</strong>t qualifications do not hold, it is not surpris<strong>in</strong>g thatf<strong>in</strong>d<strong>in</strong>g an MPCC solution may be difficult. In order to classify this solution, wedef<strong>in</strong>e a B-stationary po<strong>in</strong>t for an MPCC if it is feasible to the MPCC and d = 0is the solution to the follow<strong>in</strong>g L<strong>in</strong>ear Program with Equilibrium Constra<strong>in</strong>ts(LPEC):wherem<strong>in</strong>d∇f(w ∗ ) T d (13a)s.t. g(w ∗ ) + ∇g(w ∗ ) T d ≥ 0 (13b)h(w ∗ ) + ∇h(w ∗ ) T d = 0(13c)0 ≤ x ∗ + d x ⊥ y ∗ + d y ≥ 0 (13d)w ∗ = (x ∗ , y ∗ , z ∗ )I X = {i : x ∗ i = 0}I Y = {i : y ∗ i = 0}Verification of this condition may require the solution of 2 m l<strong>in</strong>ear programswhere m is the card<strong>in</strong>ality of the biactive set I X ∩ I Y . A stronger, but moretractable form of stationarity for the MPCC problem is strong stationarity. Apo<strong>in</strong>t is strongly stationary if it is feasible for the MPCC and d = 0 solves the5


follow<strong>in</strong>g relaxed problem:m<strong>in</strong>d∇f(w ∗ ) T d (14a)s.t. g(w ∗ ) + ∇g(w ∗ ) T d ≥ 0 (14b)h(w ∗ ) + ∇h(w ∗ ) T d = 0(14c)d x,i = 0, i ∈ I X \ I Y (14d)d y,i = 0, i ∈ I Y \ I X (14e)d x,i ≥ 0, i ∈ I X ∩ I Y (14f)d y,i ≥ 0, i ∈ I X ∩ I Y (14g)Strong stationarity implies no feasible descent direction for (14) and is equivalentto B-stationarity if the biactive set I X ∩ I Y is empty, or if the <strong>MPEC</strong>-LICQproperty holds [1]. <strong>MPEC</strong>-LICQ requires that the follow<strong>in</strong>g set of vectors bel<strong>in</strong>early <strong>in</strong>dependent:{∇g i (w ∗ )|i ∈ I g } ∪ {∇h(w ∗ )} ∪ {∇x i |i ∈ I X } ∪ {∇y i |i ∈ I Y } (15)whereI g = {i : g i (w ∗ ) = 0}.<strong>MPEC</strong>-LICQ implies that the multipliers of (14) are bounded and unique. Ithas been shown that <strong>MPEC</strong>-LICQ is a generic property of MPCCs and of bilevelproblems formulated as MPCC [27] and that this property can be expected tohold “almost everywhere”. That is, if <strong>MPEC</strong>-LICQ is violated for a particular<strong>MPEC</strong>, it can always be satisfied by small perturbations of that problem.Satisfaction of <strong>MPEC</strong>-LICQ leads to the follow<strong>in</strong>g result:Theorem 1 [1, 15, 26] If w ∗ is a solution to the MPCC (3) and <strong>MPEC</strong>-LICQholds at w ∗ , then w ∗ is strongly stationary.With this property, convergence to a strongly stationary po<strong>in</strong>t is much easier(and required for many NLP algorithms), s<strong>in</strong>ce only one set of <strong>MPEC</strong> multipliersis needed to verify optimality and, consequently, this property allows thereformulation of (3) to a number of equivalent nonl<strong>in</strong>ear programs. In the past,special <strong>MPEC</strong> solvers were required [4, 5, 17] to handle these difficulties. Withthis property, <strong>MPEC</strong>s can now be addressed directly through the formulationand direct solution as NLPs. These NLP reformulations are covered <strong>in</strong> the nextsection.F<strong>in</strong>ally, a related (but weaker and implied by <strong>MPEC</strong>-LICQ) constra<strong>in</strong>t qualificationis <strong>MPEC</strong>-MFCQ, which requires that there exist a nonzero vector6


d ∈ R n such that:d x,i = 0, i ∈ I X \ I Y (16a)d y,i = 0, i ∈ I Y \ I X (16b)∇h(w ∗ ) T d = 0(16c)∇g i (w ∗ ) T d > 0 i ∈ I g (16d)d x,i > 0, d y,i > 0 i ∈ I X ∪ I Y (16e){∇h(w ∗ )} ∪ {∇x i |i ∈ I X } ∪ {∇y i |i ∈ I Y } are l<strong>in</strong>early <strong>in</strong>dependent (16f)<strong>MPEC</strong>-MFCQ implies that the multipliers of (14) will be bounded.3 Solution StrategiesIf no reformulation of an MPCC is made and the complementarity is representedby (5), (6) or (7), the constra<strong>in</strong>t multipliers will be non-unique and unbounded,and dependent active constra<strong>in</strong>ts exist at every feasible po<strong>in</strong>t. However, someactive set NLP strategies may be able to overcome this difficulty as they weredeveloped to handle degenerate NLP formulations [2] and <strong>in</strong>crease robustness ofactive set methods. In particular, encourag<strong>in</strong>g results have been reported withthe Filter-SQP algorithm [6]. Moreover, elastic mode SQP algorithms havef<strong>in</strong>ite term<strong>in</strong>ation and global convergence properties when solv<strong>in</strong>g MPCCs [1].On the other hand, an active set algorithm’s performance is still affected bycomb<strong>in</strong>atorial complexity as the problem size <strong>in</strong>creases.The follow<strong>in</strong>g MPCC reformulations have been analyzed <strong>in</strong> [1, 11, 15, 22, 24]and allow standard NLP tools to be applied.Reg(ǫ): m<strong>in</strong> f(w) (17a)s.t. h(w) = 0 (17b)g(w) ≥ 0(17c)x, y ≥ 0 (17d)x i y i ≤ ǫ ∀i (17e)RegComp(ǫ): m<strong>in</strong> f(w) (18a)s.t. h(w) = 0 (18b)g(w) ≥ 0(18c)x, y ≥ 0 (18d)x T y ≤ ǫ(18e)7


RegEq(ǫ): m<strong>in</strong> f(w) (19a)s.t. h(w) = 0 (19b)g(w) ≥ 0(19c)x, y ≥ 0 (19d)x i y i = ǫ ∀i (19e)PF(ρ): m<strong>in</strong> f(w) + ρx T y (20a)s.t. h(w) = 0 (20b)g(w) ≥ 0(20c)x, y ≥ 0 (20d)Complementarity conditions may be relaxed and the problem reformulated as<strong>in</strong> the Reg(ǫ), RegComp(ǫ), or RegEq(ǫ) with a positive relaxation parameterǫ. The solution of the MPCC, w ∗ , can be obta<strong>in</strong>ed by solv<strong>in</strong>g a series of relaxedsolutions, w(ǫ), as ǫ approaches zero. These solution strategies are generally attractedto strongly stationary po<strong>in</strong>ts, but with some exceptions (see [1]). Underassumptions of <strong>MPEC</strong>-MFCQ and sufficient second order <strong>MPEC</strong> conditions [24],Reg(ǫ) and RegComp(ǫ) exhibit local convergence of ‖w(ǫ) − w ∗ ‖ ≤ O ( ǫ 1/2) .(These properties can be strengthened to O (ǫ) if we can assume strict complementarityof the bound multipliers for i ∈ I X ∪ I Y .) Reg(ǫ) has been implementedand tested <strong>in</strong> a version of IPOPT called IPOPT-C [22]. On the otherhand, RegEq(ǫ) will exhibit slower convergence with ‖w(ǫ) − w ∗ ‖ ≤ O ( ǫ 1/4)under the same assumptions.In a related development, Nonl<strong>in</strong>ear Complementarity <strong>Problem</strong> (NCP) functionsand smooth<strong>in</strong>g functions have also been used extensively to solve MPCCs[3, 10, 14, 28]. NCP functions replace the complementarity condition <strong>in</strong> theproblem with an equivalent nonl<strong>in</strong>ear equation. One widely studied NCP functionis the Fischer-Burmeister function:φ(x, y) = x + y − √ x 2 + y 2 (21)As this function is non-differentiable at x = y = 0, it is usually smoothed to theform:φ(x, y) = x + y − √ x 2 + y 2 + ǫ (22)for some small ǫ > 0. The solution to the orig<strong>in</strong>al problem is then recovered bysolv<strong>in</strong>g a series of problems as ǫ approaches zero. An equivalence can be madebetween this method and RegEq(ǫ). Accord<strong>in</strong>gly, the problem will converge atthe same slow convergence rate.In contrast to the regularized formulations, we also consider the exact l 1penalization shown <strong>in</strong> PF(ρ). Here the complementarity can be moved fromthe constra<strong>in</strong>ts to the objective function and the result<strong>in</strong>g problem is solvedfor a particular value of ρ. If ρ ≥ ρ c , where ρ c is the critical value of thepenalty parameter, then the complementarity constra<strong>in</strong>ts will be satisfied at8


the solution. Similarly, Anitescu et al. [1] consider a related “elastic mode”formulation, where artificial variables are <strong>in</strong>troduced to relax the constra<strong>in</strong>ts <strong>in</strong>PF(ρ) and an additional l ∞ constra<strong>in</strong>t penalty term is added. In both cases theresult<strong>in</strong>g NLP formulation has the follow<strong>in</strong>g properties:Theorem 2 [1] If w ∗ is a solution to PF(ρ) and w ∗ is feasible for the MPCC(3), then w ∗ is a strongly stationary solution to (3).However, the value of ρ c is not known beforehand. If the <strong>in</strong>itial value is toosmall a series of problems with <strong>in</strong>creas<strong>in</strong>g ρ values may need to be solved, andhere we can apply the follow<strong>in</strong>g result:Theorem 3 [15] If w ∗ is a limit po<strong>in</strong>t of a sequence of solutions {w k } toPF(ρ k ), <strong>MPEC</strong>-LICQ holds and ρ k x k i → 0 and ρ k yi k → 0 for i ∈ I X ∪ I Y ,then w ∗ is a strongly stationary solution to (3).Hence, the solution strategy is attracted to strongly stationary po<strong>in</strong>ts, andwith a large enough ρ this l 1 penalization is exact <strong>in</strong> the sense that all stronglystationary po<strong>in</strong>ts of the MPCC are local m<strong>in</strong>imizers to PF(ρ). PF(ρ) allowsany general nonl<strong>in</strong>ear programm<strong>in</strong>g solver to be used to solve a complementarityproblem. Provided that the penalty parameter is large enough, the MPCCmay then be solved as a s<strong>in</strong>gle problem, <strong>in</strong>stead of a series of problems. F<strong>in</strong>ally,<strong>in</strong> order to provide greater numerical efficiency, it has been suggested todynamically change the penalty parameter dur<strong>in</strong>g the course of the optimizationfor both active set and <strong>in</strong>terior po<strong>in</strong>t optimization algorithms [1, 15]. Thismodification was recently <strong>in</strong>troduced <strong>in</strong>to the solver KNITRO [16].F<strong>in</strong>ally, there is a strong <strong>in</strong>teraction between the MPCC reformulation andthe applied NLP solver. In particular, as noted <strong>in</strong> [15], if a barrier NLP method(like KNITRO or IPOPT) is applied to PF(ρ), then there is a one-to-one correspondenceof <strong>in</strong>termediate solutions of PF(ρ), Reg(ǫ) and RegComp(ǫ). Thisequivalence can be seen by compar<strong>in</strong>g the penalty parameter ρ to the multiplierassociated with the relaxed complementarity constra<strong>in</strong>ts <strong>in</strong> Reg(ǫ) andRegComp(ǫ). As a result, for a correspond<strong>in</strong>g set of parameters ρ and ǫ wewould expect a barrier NLP method to exhibit similar performance with eitherPF(ρ), Reg(ǫ) or RegComp(ǫ).4 <strong>MPEC</strong>Lib ResultsThe formulations of the previous section provide a straightforward way to representand address any MPCC through nonl<strong>in</strong>ear programm<strong>in</strong>g. An importantcomponent of this task is the efficient and reliability of the NLP solver. In thissection we compare and evaluate each of these formulations through a librarytest set provided with the GAMS model<strong>in</strong>g environment [8]. For this evaluationwe consider <strong>MPEC</strong>Lib, a large collection of test problems, and apply an automaticreformulation of <strong>MPEC</strong>s provided by the NLPEC meta-solver <strong>in</strong> GAMS.9


4.1 Automatic Reformulation of ComplementarityMPCC models written <strong>in</strong> the GAMS model<strong>in</strong>g language may be automaticallyreformulated as an NLP us<strong>in</strong>g several of the previously described complementaritysolution techniques. This can be accomplished with the use of the NLPECmeta-solver [8]. When called, NLPEC reformulates the complementarity constra<strong>in</strong>tsof a MPCC model with a user specified reformulation, <strong>in</strong>clud<strong>in</strong>g (17)-(20). NLPEC will then call a user-specified NLP solver to solve the reformulatedmodel. The f<strong>in</strong>al results from the NLP solver are then translated back <strong>in</strong>to theorig<strong>in</strong>al MPCC model. NLPEC verifies that the complementarities are satisfied<strong>in</strong> the result<strong>in</strong>g solution. This is particularly important when us<strong>in</strong>g the PF(ρ)formulation described above, as it may not converge to a stationary po<strong>in</strong>t of theMPCC if the penalty parameter is too small.4.2 Comparison of Solution Strategies on <strong>MPEC</strong>Lib<strong>MPEC</strong>Lib is a set of <strong>MPEC</strong> test problems ma<strong>in</strong>ta<strong>in</strong>ed by Dirkse [19]. The testset currently consists of 92 problems, which <strong>in</strong>clude both small scale modelsfrom the literature and several large <strong>in</strong>dustrial models.The performance and robustness of several NLP reformulations <strong>in</strong>clud<strong>in</strong>gPF(ρ) with ρ = 10, Reg(ǫ) with ǫ = 10 −8 , and NCP formulation x T y = 0,x, y ≥ 0 were compared on the <strong>MPEC</strong>Lib test set. CONOPT (version 3.14) andIPOPT (version 3.2.3) were used to the solve the result<strong>in</strong>g NLP problems. Allreformulations were performed automatically by the NLPEC solver describedearlier. Also <strong>in</strong>cluded <strong>in</strong> the comparison is the IPOPT-C solver, which is capableof process<strong>in</strong>g the complementarity constra<strong>in</strong>ts without reformulation. TheCONOPT and IPOPT runs were performed entirely with<strong>in</strong> the GAMS environment.However, IPOPT-C is not currently l<strong>in</strong>ked to GAMS. In order toperform the tests with IPOPT-C, the GAMS models were <strong>in</strong>stead convertedto the AMPL model<strong>in</strong>g language format. The problems were then run us<strong>in</strong>gAMPL and IPOPT-C.The results are presented <strong>in</strong> a Dolan-Morè plot. All results were obta<strong>in</strong>ed ona Intel Pentium 4, 1.8 GHz computer with 992 MB of RAM. This plot portraysboth robustness and relative performance of a set of solvers on a given problemset. Here we def<strong>in</strong>e a performance ratio calculated as:η p,s =t p,sm<strong>in</strong> {t p,s : 1 ≤ s ≤ n s }(23)where n s is the number of solvers, which are run over the n p problems <strong>in</strong> set P,and where t p,s is the solution time for solver s on problem p. For all solvers sthat fail to solve problem p, we set the performance ratio to an arbitrarily largenumber. The profiles are then def<strong>in</strong>ed by:p s (τ) = 1 n psize {p ∈ P : η p,s ≤ τ} (24)10


and p s (τ) is the percentage of problems solved with<strong>in</strong> τ ≥ η p,s times the bestperformance. This analysis can be performed automatically us<strong>in</strong>g the PAVERserver [18].Figure 1 shows the performance plot of different reformulations and solverstested on all 92 problems of the test set. S<strong>in</strong>ce the problems are <strong>in</strong>herentlynonconvex, multiple local solutions can be expected for each problem. Becauseof this, not all of the different reformulation and solver comb<strong>in</strong>ations convergedto the same local solutions. This makes a direct comparison of performancedifficult as the solvers often converge to different solutions. On the other hand,this figure is useful to demonstrate the robustness of the methods, as seen bylim τ →∞ p s (τ). For all but one solution method, the reformulation and solvercomb<strong>in</strong>ation was able to solve at least 84% of all of the test problems. Themost reliable solvers were IPOPT-Reg(ǫ) (99%), CONOPT-Penalty (98%) andIPOPT-Penalty (94 %). In contrast, IPOPT with the NCP formulation (listedas IPOPT-Mult on the figure) was only able to solve 57% of all of the problems.This is not entirely unexpected as IPOPT is an <strong>in</strong>terior po<strong>in</strong>t algorithm and theNCP formulation provides no strict <strong>in</strong>terior to the NLP problem.Figure 2 shows the same reformulations and solvers tested as before excepton only 22 of the 92 test problems. These 22 problems all converged to the samesolution for all reformulations and solvers when they converged. This allows fora fair comparison of the solvers’ performance. Table 1 displays these 22 testproblems and their respective objective function values.A few observations can be made on these results. CONOPT-Reg(ǫ) performedthe fastest on 73 % of the problems but it turns out to solve only 76%of them. Instead, CONOPT-Penalty and IPOPT-Penalty (PF(ρ) are the nextbest <strong>in</strong> performance and they also prove to the most robust, solv<strong>in</strong>g over 90%of the problems. These turn out to be the best all-around methods.For the results <strong>in</strong> Figure 2, the CONOPT formulations take advantage ofthe active set strategy and particularly the detection and removal of degenerateconstra<strong>in</strong>ts. As a result, the CONOPT-Mult (with the NCP reformulation)performs well; this method is fastest on 50% of the problems but no morerobust than CONOPT-Reg. Because the test problems are not large, CONOPTprovides the best performance on this test set.On the other hand, the IPOPT formulations (IPOPT-Penalty, IPOPT-Reg,IPOPT-C) follow similar trends with<strong>in</strong> Figure 2, with the penalty formulation asthe most robust. This can be expla<strong>in</strong>ed by the similarities among these methods,as analyzed <strong>in</strong> [15]. F<strong>in</strong>ally, IPOPT-Mult (with the NCP reformulation) is theworst performer as it cannot remove dependent constra<strong>in</strong>ts and therefore suffersmost from the <strong>in</strong>herent degeneracies <strong>in</strong> MPCCs.5 Process Eng<strong>in</strong>eer<strong>in</strong>g Examples of MPCCsComplementarity applications <strong>in</strong> chemical eng<strong>in</strong>eer<strong>in</strong>g are widespread for model<strong>in</strong>gdiscrete decisions. Their use for model<strong>in</strong>g and optimiz<strong>in</strong>g split range controllersand tiered splitters has been known for some time [7]. However, with a11


100Performance Profile9080Percent Of Models Solved7060504030CONOPT-PenaltyIPOPT-Penalty20CONOPT-MultIPOPT-MultCONOPT-Reg10IPOPT-RegIPOPT-CCAN_SOLVE01 10 100 1000Time FactorFigure 1: Performance Profiles of several reformulations on all of the 92<strong>MPEC</strong>Lib Test <strong>Problem</strong>s100Performance Profile9080Percent Of Models Solved70605040CONOPT-Penalty30IPOPT-PenaltyCONOPT-MultIPOPT-MultCONOPT-Reg20IPOPT-RegIPOPT-CCAN_SOLVE101 10 100 1000Time FactorFigure 2: Performance Profiles of several reformulations on 22 Select Test <strong>Problem</strong>s12


<strong>Problem</strong> Name Objective Function Value Constra<strong>in</strong>ts Variables Complementaritiesbard2 -6600 10 13 8bard3 -12.67872 6 7 4bartruss3 0 3.54545E-07 29 36 26bartruss3 1 3.54545E-07 29 36 11bartruss3 2 10166.57 29 36 6bartruss3 3 3E-07 27 34 26bartruss3 4 3E-07 27 34 11bartruss3 5 10166.57 27 34 6desilva -1 5 7 4ex9 1 4m -61 5 6 4f<strong>in</strong>db10s 2.02139E-07 203 198 176fjq1 3.207699 7 8 6gauv<strong>in</strong> 20 3 4 2kehoe1 3.6345595 11 11 5outrata31 2.882722 5 6 4outrata33 2.888119 5 6 4outrata34 5.7892185 5 6 4qvi 7.67061E-19 3 5 3three 2.58284E-20 4 3 1t<strong>in</strong>que dhs2 N/A 4834 4805 3552t<strong>in</strong>que sws3 12467.56 5699 5671 4480tollmpec -20.82589 2377 2380 2376Table 1: Objective Function Values of the 22 <strong>MPEC</strong>Lib Test <strong>Problem</strong>s ThatConverged to Same Solutions13


etter understand<strong>in</strong>g of NLP-based solution strategies for MPCCs, it is essentialto consider the formulation of well-posed complementarity models. In thissection, we consider a systematic MPCC model<strong>in</strong>g strategy and apply it to anumber of widely applied process examples.To develop these concepts, we aga<strong>in</strong> caution that complementarity formulationsare nonconvex problems and may lead to multiple local solutions. Consequently,with the application of standard NLP solvers, the user will need tobe satisfied with local solutions, unless additional problem-specific <strong>in</strong>formationcan be applied. Despite the nonconvexity <strong>in</strong> all MPCC formulations, it is clearthat poorly posed MPCC formulations also need to be avoided. For <strong>in</strong>stance,the feasible region for:0 ≤ y ⊥ (1 − y) ≥ 0 (25)consists of only two isolated po<strong>in</strong>ts, and the disjo<strong>in</strong>t feasible region associatedwith this example often leads to convergence failures, unless the model is <strong>in</strong>itializedclose to a solution with either y = 0 or y = 1. Moreover, without carefulconsideration, it is not difficult to create MPCCs with similar difficulties.Because the MPCC can be derived from an associated <strong>MPEC</strong>, we considerthe formulation <strong>in</strong> (2). To avoid disjo<strong>in</strong>t regions, we require that both θ(x, y)and C(x) be convex <strong>in</strong> y for all (x, y) ∈ Z. This leads to a well-def<strong>in</strong>ed solutionfor the <strong>in</strong>ner problem. Moreover, the result<strong>in</strong>g complementarities are then“strongly monotone”, which have been shown to be very useful <strong>in</strong> practice. Thisobservation leads to the follow<strong>in</strong>g guidel<strong>in</strong>es for the examples <strong>in</strong> this study:• Start with the <strong>in</strong>ner level problem:m<strong>in</strong>yϕ(x)y s.t. y a ≤ y ≤ y b (26)where ϕ(x) is a switch<strong>in</strong>g function for the discrete decision.• For (26) it is especially important that the upper level constra<strong>in</strong>ts <strong>in</strong> (2),i.e., (x, y) ∈ Z, not limit the selection of any value of y ∈ C(x).• The result<strong>in</strong>g <strong>MPEC</strong> is then converted to an MPCC. For <strong>in</strong>stance, apply<strong>in</strong>gthe KKT conditions to (26) we obta<strong>in</strong> the complementaritiesϕ(x) − s a + s b = 0 (27a)0 ≤ (y − y a ) ⊥ s a ≥ 0 (27b)0 ≤ (y b − y) ⊥ s b ≥ 0 (27c)• The relations <strong>in</strong> (27) can be further simplified through variable elim<strong>in</strong>ationand application of the complementarity conditions.• F<strong>in</strong>ally, the result<strong>in</strong>g complementarity conditions are <strong>in</strong>corporated with<strong>in</strong>NLPs us<strong>in</strong>g the formulations described <strong>in</strong> section 3.14


Note that <strong>in</strong> the absence of upper level constra<strong>in</strong>ts, the solution of (27)allows y to take any value <strong>in</strong> [y a , y b ] when ϕ(x) = 0. This is necessary to avoiddisjo<strong>in</strong>t feasible regions for the result<strong>in</strong>g MPCC. As a result, limitations on yfrom upper level constra<strong>in</strong>ts should be avoided, and this is frequently a problemspecific task. For <strong>in</strong>stance, complementarity formulations should not be used tomodel EXCLUSIVE OR (as <strong>in</strong> (25)) because they lead to disjo<strong>in</strong>t regions for y.In the rema<strong>in</strong>der of this section, we apply these guidel<strong>in</strong>es to develop complementaritymodels that arise <strong>in</strong> process applications. A number of these canalso be found <strong>in</strong> [21].5.1 Commonly Used FunctionsThe follow<strong>in</strong>g nonsmooth functions can be represented by complementarity formulations.• The signum or sign operation y = sgn(x) can be represented by a complementarityformulation:m<strong>in</strong> −y · x s.t. − 1 ≤ y ≤ 1 (28)and the correspond<strong>in</strong>g complementarity relations can be written as:x = s b − s a , 0 ≤ s a ⊥ (y + 1) ≥ 0, 0 ≤ s b ⊥ (1 − y) ≥ 0 (29)• The absolute value operator z = |f(x)| can be rewritten as:z = f(x)y,y = arg{m<strong>in</strong> −f(x)ŷ s.t. − 1 ≤ ŷ ≤ 1} (30)ŷThe correspond<strong>in</strong>g KKT conditions are:z = f(x)y, f(x) = s b − s a0 ≤ s b ⊥ (1 − y) ≥ 0, 0 ≤ s a ⊥ (y + 1) ≥ 0These relations can be simplified by apply<strong>in</strong>g the complementarity conditionsto elim<strong>in</strong>ate the variable y <strong>in</strong> the first two equations, lead<strong>in</strong>g to:z = s b + s a , f(x) = s b − s a (31)0 ≤ s b ⊥ s a ≥ 0• The max operator z = max{f(x), z a } can be rewritten as:z = f(x)+(z a −f(x))yy = arg{m<strong>in</strong>ŷ(f(x) −z a )ŷ s.t. 0 ≤ ŷ ≤ 1} (32)The correspond<strong>in</strong>g KKT conditions are:z = f(x) + (z a − f(x))y f(x) − z a = s a − s b0 ≤ s b ⊥ (1 − y) ≥ 0, 0 ≤ s a ⊥ y ≥ 015


Simplify<strong>in</strong>g the complementarity conditions to elim<strong>in</strong>ate the variable y <strong>in</strong>the first two equations leads to:z = f(x) + s b , f(x) − z a = s a − s b (33)0 ≤ s b ⊥ s a ≥ 0• The m<strong>in</strong> operator y = m<strong>in</strong> (f(x), y b ) can be treated <strong>in</strong> a similar way bydef<strong>in</strong><strong>in</strong>g the problem:z = f(x) + (f(x) − z b )yy = arg{m<strong>in</strong>ŷ(f(x) − z b )ŷ s.t. 0 ≤ ŷ ≤ 1} (34)Apply<strong>in</strong>g the KKT conditions and simplify<strong>in</strong>g leads to the follow<strong>in</strong>g complementaritysystem:5.2 Flow Reversalz = f(x) + s a , z b − f(x) = s b − s a (35)0 ≤ s b ⊥ s a ≥ 0Flow reversal can occur <strong>in</strong> many places <strong>in</strong> chemical processes <strong>in</strong>clud<strong>in</strong>g fuelheaders and pipel<strong>in</strong>e distribution networks. This is problematic for model<strong>in</strong>gs<strong>in</strong>ce physical properties and other equations implicitly assume the sign of theflowrate to be positive. These situations may be modeled with the absolute valueoperator, us<strong>in</strong>g complementarities as <strong>in</strong> (31). The magnitude of the flowrate canthen be used <strong>in</strong> the sign sensitive equations.On the other hand, it may not be possible to implement these changes <strong>in</strong>some model<strong>in</strong>g environments, such as commercial process optimization softwarewith directional streams and <strong>in</strong>tegrated thermo-physical models. In this case,each stream with the potential for flow reversal can be modeled as two parallelcounter-current streams. The stream flowrates are then complemented, suchthat either one stream or the other has zero flow. Us<strong>in</strong>g mixers and splitters asappropriate, the two streams can then be <strong>in</strong>tegrated <strong>in</strong>to any process flowsheet[9]. While this method may obscure the orig<strong>in</strong>al representation of the system,it rema<strong>in</strong>s equivalent.5.3 Relief Valves, Check Valves, Compressor Kick-BackRelief valves, check valves and compressor kick-back operation are all relatedphenomenon that can be modeled us<strong>in</strong>g complementarities <strong>in</strong> a manner similarto the functions used <strong>in</strong> Section 5.1.• Check valves prevent flows <strong>in</strong> reverse directions that may arise from changes<strong>in</strong> differential pressure. Assum<strong>in</strong>g that the directional flow is a monotonicfunction of the differential pressure, we model flow through the check valveas:flow = max{0, f(∆p)}and rewrite the max operator as the complementarity system given <strong>in</strong> (33).16


Figure 3: Flowsheet Example of Compressor Kick-Back• As shown <strong>in</strong> Figure 3, compressor kickback, where recirculation is requiredwhen the compressor feed drops below a critical value, can be modeled <strong>in</strong>a similar manner. Follow<strong>in</strong>g the stream notation <strong>in</strong> Figure 3 we can write:f S2 = max(f S1 , f crit ), f S5 = f crit − f S1and rewrite the max operator us<strong>in</strong>g the complementarities <strong>in</strong> (33).• Relief valves open and allow flow only when a pressure, p, or pressuredifferential, ∆p, is larger than a predeterm<strong>in</strong>ed value. Once open the flowis some function of the pressure, f(p). This can be modeled as:flow = f(p)y (36)m<strong>in</strong>ŷ(p max − p)ŷ (37a)s.t. 0 ≤ ŷ ≤ 1 (37b)The <strong>in</strong>ner m<strong>in</strong>imization problem sets y = 1 if p > p max and sets y = 0if p < p max . This zero-one behavior determ<strong>in</strong>es if the flowrate should bezero, when valve is closed, or determ<strong>in</strong>ed by the expression f(p), when thevalve is open. The related complementarity conditions are:flow = f(p)y(p max − p) = s 0 − s 10 ≤ y ⊥ s 0 ≥ 00 ≤ (1 − y) ⊥ s 1 ≥ 05.4 Piecewise FunctionsPiecewise smooth functions are often encountered <strong>in</strong> physical property models,tiered pric<strong>in</strong>g and table lookups. The composite function can be represented by17


the follow<strong>in</strong>g <strong>in</strong>ner m<strong>in</strong>imization problem and associated equation [21]:m<strong>in</strong>ys.t.N ∑i=1(x − a i )(x − a i−1 )y i (38a)N∑y i = 1i=1y i ≥ 0(38b)(38c)z =N∑f i (x)y i (39)i=1where N is the number of piecewise segments, f i (x) is the function over the<strong>in</strong>terval x ∈ [a i−1 , a i ], and z represents the value of the piecewise function. ThisLP will set y i = 1 and y j≠i = 0 when x ∈ [a i−1 , a i ]. The associated equationwill then set z = f i (x), which is the function value on the <strong>in</strong>terval. The NLP(38) can be rewritten as the follow<strong>in</strong>g complementarity system:N∑y i = 1 (40a)i=1(x − a i )(x − a i−1 ) − γ − s i = 0 (40b)0 ≤ y i ⊥ s i ≥ 0 (40c)In some cases, the function of <strong>in</strong>terest may only be piecewise cont<strong>in</strong>uousand y i can “cheat” by tak<strong>in</strong>g fractional values. For <strong>in</strong>stance, cost per unit may<strong>in</strong>crease stepwise over different ranges, and jumps <strong>in</strong> the costs may occur at a i .A way around this problem would be to def<strong>in</strong>e f i (x) as the total cost (i.e., unitcost times quantity), which is cont<strong>in</strong>uous everywhere, but not differentiable ata i .5.5 PI Controller SaturationPI controller saturation has been studied us<strong>in</strong>g complementarity formulations[30]. The PI control law takes the follow<strong>in</strong>g form:(u(t) = K c e(t) + 1 ∫ t)e(t ′ )dt ′ (41)τ Iwhere u(t) is the control law output, e(t) is the error <strong>in</strong> the measured variable,K c is the controller ga<strong>in</strong> and τ I is the <strong>in</strong>tegral time constant. The controlleroutput v(t) is typically subject to upper and lower bounds, v up and v lo , i.e.,v = max(v lo , m<strong>in</strong>(v up , u(t))). The follow<strong>in</strong>g <strong>in</strong>ner m<strong>in</strong>imization relaxes thecontroller output to take <strong>in</strong>to account the saturation effects:0m<strong>in</strong>y up,y lo(v up − u(t))y up + (u(t) − v lo )y lo (42)s.t. 0 ≤ y lo , y up ≤ 118


with v = u(t) + (v up − u(t))y up + (v lo − u(t))y lo . Apply<strong>in</strong>g the KKT conditionsto (42) leads to:v = u(t) + (v up − u(t))y up + (v lo − u(t))y lo (43)(v up − u(t)) − s 0,up + s 1,up = 0(u − v lo ) − s 0,lo + s 1,lo = 00 ≤ s 0,up ⊥ y up ≥ 0, 0 ≤ s 1,up ⊥ (1 − y up ) ≥ 00 ≤ s 0,lo ⊥ y lo ≥ 0, 0 ≤ s 1,lo ⊥ (1 − y lo ) ≥ 0Elim<strong>in</strong>at<strong>in</strong>g the variables y up and y lo and apply<strong>in</strong>g the complementarity conditionsleads to the follow<strong>in</strong>g simplification:v = u(t) + s 1,lo − s 1,up (44)(v up − u(t)) − s 0,up + s 1,up = 0(u − v lo ) − s 0,lo + s 1,lo = 00 ≤ s 1,up ⊥ s 0,up ≥ 0, 0 ≤ s 0,lo ⊥ s 1,lo ≥ 05.6 Disappearance of PhasesThe disappearance of phases <strong>in</strong> vapor liquid equilibrium systems was considered<strong>in</strong> [10]. Here the equilibrium between two phases is relaxed by an additionalvariable β and def<strong>in</strong>ed by y j = βK j x j . For a flash vessel with feed F, thecomplementarity system can be derived from the follow<strong>in</strong>g <strong>in</strong>ner m<strong>in</strong>imimizationproblem:m<strong>in</strong>L,Vlead<strong>in</strong>g to the follow<strong>in</strong>g complementarity system:(1 − β)(L − V ) (45a)s.t. L + V = F (45b)L, V ≥ 0 (45c)y j = βK j x jβ = 1 − s l + s v(46a)(46b)0 ≤ L ⊥ s l ≥ 0 (46c)0 ≤ V ⊥ s v ≥ 0 (46d)In this manner, phase existence can be determ<strong>in</strong>ed with<strong>in</strong> the context of anoptimization problem. If either liquid or vapor is absent, the correspond<strong>in</strong>gslack variable may become positive. This <strong>in</strong> turn relaxes the calculation ofthe phase equilibrium, which is appropriate s<strong>in</strong>ce only one phase is present.As seen <strong>in</strong> [10, 14, 20, 23], this formulation can also be extended to modeldistillation columns, and, as shown <strong>in</strong> [10], the conditions are also equivalent tom<strong>in</strong>imization of Gibbs free energy.19


6 Case StudiesFrom the results of Sections 3 and 4, we f<strong>in</strong>d that the penalty formulationperforms well, particularly with an active set solver such as CONOPT. With thisexam<strong>in</strong>ation of NLP reformulations for MPCC as well as a systematic strategyfor develop<strong>in</strong>g complementarity formulations, we now consider two large-scalecase studies that illustrate both of these concepts. The first considers discretedecisions <strong>in</strong> dynamic systems with the reformulation of the signum function,while the second considers the disappearance of phases <strong>in</strong> optimization modelsfor distillation.6.1 Dynamic Optimization with Direct TranscriptionThe goal of this case study is to demonstrate the NLP formulations <strong>in</strong> Section 3on an example with arbitrarily many complementarity conditions. The exampleproblem is derived from discretization of a differential <strong>in</strong>clusion (ẋ ∈ sgn(x)+2),where the derivative can be def<strong>in</strong>ed by a complementarity system [29]. While wedo not consider limit<strong>in</strong>g properties of the discretization, our numerical resultswill focus on how the solution time varies as a function of the problem size(controlled by the discretization strategy).The differential <strong>in</strong>clusion can be reformulated as the follow<strong>in</strong>g optimal controlproblem:∫ tendm<strong>in</strong> (x end − 5/3) 2 + x 2 · dt (47a)t 0s.t. ẋ = u + 2 (47b)x = s + − s −(47c)0 ≤ 1 − u ⊥ s + ≥ 0 (47d)0 ≤ u + 1 ⊥ s − ≥ 0 (47e)Apply<strong>in</strong>g the implicit Euler’s method with a fixed step size, the discretizedproblem becomes:NFE∑m<strong>in</strong> (x end − 5/3) 2 + h · x 2 ii=1(48a)s.t. ẋ i = u i + 2 i = 1, . . .,NFE (48b)x i = x i−1 + h · ẋ i i = 1, . . .,NFE (48c)x i = s + i− s − ii = 1, . . .,NFE (48d)0 ≤ 1 − u i ⊥ s + i ≥ 0 i = 1, . . .,NFE (48e)0 ≤ u i + 1 ⊥ s − i ≥ 0 i = 1, . . .,NFE (48f)The result<strong>in</strong>g discretized problem can be scaled up to be arbitrarily large by<strong>in</strong>creas<strong>in</strong>g the number of f<strong>in</strong>ite elements NFE, with a discrete decision <strong>in</strong> each20


element. This system was modeled <strong>in</strong> GAMS and solved via automatic reformulationus<strong>in</strong>g the NLPEC package. Additionally, the GAMS model was convertedto an AMPL model and the problems were solved us<strong>in</strong>g IPOPT-C.This dynamic optimization problem could also be modeled easily as anMIQP. However, f<strong>in</strong>d<strong>in</strong>g a solution is NP-Hard and with an <strong>in</strong>creas<strong>in</strong>g discretization,the problem can become expensive. Solv<strong>in</strong>g this system as an MPCCgenerally leads to only polynomial complexity. On the other hand, the MPCCmodel is nonconvex and global optimality cannot be ensured, although on thisexample a comparison with MIQP solvers showed that global solutions werealways obta<strong>in</strong>ed by the MPCC solver.The discretized MPCC was solved us<strong>in</strong>g the penalty reformulation (PF(ρ)with CONOPT us<strong>in</strong>g ρ = 1000). The results are shown <strong>in</strong> Table 2. It can be seenthat the number of iterations <strong>in</strong>creases almost l<strong>in</strong>early with the number of f<strong>in</strong>iteelements and this behavior is typical for active set strategies. In addition, thecomputational time per iteration <strong>in</strong>creases l<strong>in</strong>early with the number of f<strong>in</strong>iteelements, as the majority of CPU time is expected to be spent <strong>in</strong> the sparsel<strong>in</strong>ear solver. In this case, sparse l<strong>in</strong>ear solvers <strong>in</strong>crease approximately l<strong>in</strong>early<strong>in</strong> computation time with problem size. Consequently, the total computationaltime for CONOPT-PF grows approximately quadratically with problem size.IPOPT-C was also used to solve this problem and the results are shown <strong>in</strong>Table 2. It can be seen that the number of iterations <strong>in</strong>creases only weakly as afunction of problem size, with no clear trend. In fact, the iteration count seemsto saturate with problem size. In addition, the computational time per iterationalso <strong>in</strong>creases l<strong>in</strong>early with the number of f<strong>in</strong>ite elements. This is expected s<strong>in</strong>cethe largest computational cost of IPOPT is due to the l<strong>in</strong>ear solver. Here, thissolver cost <strong>in</strong>creases l<strong>in</strong>early with problem size. As a result, the total computationaltime <strong>in</strong>crease is only slightly superl<strong>in</strong>ear with problem size. Moreover, itis also notable that while CONOPT is faster for the smaller problems, IPOPT-C becomes faster for the larger problems. This is consistent with expectations.CONOPT must also deal with the comb<strong>in</strong>atorial complexity of identify<strong>in</strong>g theactive set at every iteration, whereas the barrier method IPOPT-C identifiesthe active set only at the solution.F<strong>in</strong>ally, the SBB solver, a simple branch and bound solver <strong>in</strong> GAMS, wasapplied to an MIQP formulation of this problem. Here, b<strong>in</strong>ary variables replaceu i <strong>in</strong> every f<strong>in</strong>ite element (48) and correspond<strong>in</strong>g relations are added to reflectthe sign of x i . From the results <strong>in</strong> Table 2, we can observe the NP-hard aspectof solv<strong>in</strong>g the MIQP with this approach. Solv<strong>in</strong>g the problem with 1000 f<strong>in</strong>iteelements requires over 400 CPUs and the problem with 2000 f<strong>in</strong>ite elementscannot be solved with<strong>in</strong> 6000 CPUs.6.2 Distillation OptimizationAs seen <strong>in</strong> Section 5.6, phase behavior of a vapor liquid system is determ<strong>in</strong>edby the m<strong>in</strong>imization of the Gibbs free energy and can be modeled throughcomplementarity constra<strong>in</strong>ts. These conditions allow phase disappearance tobe modeled <strong>in</strong> distillation systems for optimization of both steady state and21


Objective SBB CONOPT-PF IPOPT-CNFE Function CPU s. CPU s. Iterations CPU s. Iterations10 1.4738 0.242 0.047 15 0.110 17100 1.7536 50.266 0.234 78 1.250 411000 1.7864 410.195 9.453 680 28.406 782000 1.7888 >6000 35.359 1340 14.062 253000 1.7894 — 112.094 2020 106.188 844000 1.7892 — 211.969 2679 84.875 565000 1.7895 — 340.922 3342 199.391 876000 1.7898 — 468.891 3998 320.140 1157000 1.7896 — 646.953 4655 457.984 1418000 1.7898 — 836.891 5310 364.937 98Table 2: Solution times (Pentium 4, 1.8 GHz, 992 MB RAM) for differentsolution strategiesdynamic tray columns [14, 20, 23]. In these studies smooth<strong>in</strong>g methods andregularization methods were applied for distillation optimization, with the refluxratio, feedtray location and tray number as decision variables. In this section, werevisit these steady state distillation optimizations for the sake of a performancecomparison, particularly with the penalty formulation, PF(ρ).Here we def<strong>in</strong>e distillation column models that consist of MESH (Mass balance,Equilibrium, Summation, and Heat balance) equations that are modifiedto optimize the feed tray location and total tray count. As shown <strong>in</strong> Figure4, streams for the feed and the reflux are fed to all trays as dictated by twodiscretized Gaussian distribution functions. As developed and described <strong>in</strong> [14],both distribution functions are def<strong>in</strong>ed by a mean and standard deviation. Forthe feed and reflux distributions, their means represent the feedtray locationand the total number of trays, respectively, and these are used as cont<strong>in</strong>uousdecision variables <strong>in</strong> the distillation optimization. We also choose a standarddeviation of 0.5, so that distribution flow to trays away from the mean tapersoff smoothly to negligible values. Moreover, the grayed area <strong>in</strong> Figure 4 consistsonly of vapor traffic, and consequently, each tray model must <strong>in</strong>clude complementaritiesthat allow for disappearance of the liquid phase. As described <strong>in</strong>Section 5.6, phase equilibrium is relaxed by the complementarity constra<strong>in</strong>ts:y ij = β i K ij x ijβ i = 1 − s l i + sv i(49a)(49b)0 ≤ L i ⊥ s l i ≥ 0 (49c)0 ≤ V i ⊥ s v i ≥ 0 (49d)where i = {1, 2, . . .,N max } and N max is the maximum number of equilibriumstages, j is the component <strong>in</strong>dex, L i and V i are the liquid and vapor flow rateon tray i, and x and y are the respective mole fractions.The result<strong>in</strong>g complementarity distillation model is an alternative nonl<strong>in</strong>-22


Figure 4: Diagam of distillation column show<strong>in</strong>g that the feed and reflux flowsare distributed across multiple trays accord<strong>in</strong>g to the DDF. The grayed sectionof the column is above the primary reflux location and has negligible liquidflows.ear program for distillation design and optimization. As shown <strong>in</strong> [14, 23], itdeterm<strong>in</strong>es the optimal number of trays, reflux ratio and feedtray location fordetailed column models. As case studies we consider two systems drawn from[14]. The first system is a benzene/toluene separation and the second systemis an argon column from an air separation unit. Both models use ideal thermodynamics,are modeled <strong>in</strong> GAMS and solved with CONOPT. Three MPCCformulations were considered for these cases:• Penalty formulation, PF(ρ) with ρ = 1000 chosen as the penalty weightfor the benzene/toluene separation and ρ = 10 6 for the argon column.• Relaxed formulation, Reg(ǫ). This strategy was solved <strong>in</strong> two NLP stageswith ǫ = 10 −6 followed by ǫ = 10 −12 . This formulation was also appliedto distillation optimization <strong>in</strong> [23].• NCP formulation us<strong>in</strong>g the Fischer-Burmeister function (22). This approachis equivalent to RegEq(ǫ) and was solved <strong>in</strong> three NLP stages withǫ = 10 −4 followed by ǫ = 10 −8 and ǫ = 10 −12 . This strategy was appliedto distillation optimization <strong>in</strong> [14].Benzene-Toluene Column OptimizationThis b<strong>in</strong>ary column has a maximum of 25 trays, its feed is 100 mol/s of a70%/30% mixture of benzene/toluene and distillate flow is specified to be 50%23


of the feed. The reflux ratio is allowed to vary between one and 20, the feedtraylocation varies between 2 and 20 and the total tray number varies between 3and 25. The objective function for the benzene-toluene separation m<strong>in</strong>imizes:objective = wt · D · x D,Toluene + wr · r + wn · Nt (50)where Nt is the number of trays, r is the reflux ratio, D is distillate flow,x D,Toluene is the toluene mole fraction and wt, wr, and wn are the weigh<strong>in</strong>gparameters for each term; these weights allow the optimization to trade offproduct purity, energy cost and capital cost. The benzene-toluene optimizationswere <strong>in</strong>itialized with 21 trays, a feedtray location at the seventh tray and areflux ratio at 2.2. Temperature and mole fraction profiles were <strong>in</strong>itialized withl<strong>in</strong>ear <strong>in</strong>terpolations based on the top and bottom product properties. Theresult<strong>in</strong>g GAMS models consisted of 353 equations and 359 variables for theReg(ǫ) and NCP formulations, and 305 equations and 361 variables for thePF(ρ) formulation. The follow<strong>in</strong>g cases were considered:• Case 1 (wt = 1, wr = 1, wn = 1): This represents the base case with equalweights on toluene <strong>in</strong> distillate, reflux ratio and tray count. As seen <strong>in</strong> theresults <strong>in</strong> Table 3, the optimal solution has an objective function value of9.4723 with <strong>in</strong>termediate values of r and Nt. This is found quickly by thePF(ρ) formulation. On the other hand, the Reg(ǫ) formulation term<strong>in</strong>atesclose to this solution, while the NCP formulation term<strong>in</strong>ates early withpoor progress.• Case 2 (wt = 1, wr = 0.1, wn = 1): In this case, less emphasis is given toenergy cost. As seen <strong>in</strong> the results <strong>in</strong> Table 3, the optimal solution nowhas a lower objective function value of 6.8103 along with a higher value ofr and lower value of Nt. This is found quickly by both PF(ρ) and Reg(ǫ),although only the former satisfies the convergence tolerance. On the otherhand, the NCP formulation aga<strong>in</strong> term<strong>in</strong>ates early with poor progress.• Case 3 (wt = 1, wr = 1, wn = 0.1): In contrast to Case 2, less emphasisis now given to capital cost. As seen <strong>in</strong> the results <strong>in</strong> Table 3, the optimalsolution now has an objective function value of 2.9048 with lowervalues of r and higher values of Nt. This is found quickly by the Reg(ǫ)formulation. Although Reg(ǫ) does not satisfy the convergence tolerance,the optimum could also be verified by PF(ρ). On the other hand, PF(ρ)quickly converges to a slightly different solution, which it identifies as a localoptimum, while the NCP formulation requires more time to term<strong>in</strong>atewith poor progress.Argon Column OptimizationThis argon separation column has a maximum of 63 trays, its feed is 6546.54lbmol/h of a 0.005%/9.753%/90.24% mixture of nitrogen/argon/oxygen anddistillate flow is specified to be 202.4576 lbmol/h with less than 1 mol % oxygen.24


Cases 1 2 3 Argoniter-PF(ρ) 356 395 354 1182iter-Reg(ǫ) 538 348 370 10269iter-NCP 179 119 254 328CPUs-PF(ρ) 7.188 7.969 4.875 69.281CPUs-Reg(ǫ) 14.594 7.969 7.125 706.406CPUs-NCP 7.906 3.922 9.188 20.781obj-PF(ρ) 9.4723 6.8103 3.0053 77.248obj-Reg(ǫ) 9.5202* 6.8103* 2.9048* 82.491*obj-NCP 11.5904* 9.7288* 2.9332* 88.406*reflux-PF(ρ) 1.619 4.969 1.485 32.248reflux-Reg(ǫ) 1.811 4.969 1.431 33.491reflux-NCP 1.683 2.881 1.359 38.406Nt-PF(ρ) 6.524 5.528 8.844 45Nt-Reg(ǫ) 6.645 5.528 9.794 49Nt-NCP 9.437 9.302 9.721 50Table 3: Distillation Optimization: Comparision of MPCC <strong>Formulations</strong>. Allcomputations were performed on a Pentium 4 with 1.8 GHz and 992 MB RAM.*CONOPT term<strong>in</strong>ated at feasible po<strong>in</strong>t with negligible improvement of theobjective function, but without satisfy<strong>in</strong>g KKT tolerance.The reflux ratio is allowed to vary between 20 and 100, the feedtray locationvaries between 1 and 10 and the total tray number varies between 31 and 63.The objective function for the argon problem m<strong>in</strong>imizes: objective = r + Nt.The argon column optimizations were <strong>in</strong>itialized with 50 trays, a feedtray locationat the first tray and a reflux ratio at 32.8. Temperature and mole fractionprofiles were <strong>in</strong>itialized with l<strong>in</strong>ear <strong>in</strong>terpolations based on the top and bottomproduct properties. The result<strong>in</strong>g GAMS models consist of 1260 equations and1267 variables for the Reg(ǫ) and NCP formulations, and 1136 equations and1269 variables for the PF(ρ) formulation.As seen <strong>in</strong> the results <strong>in</strong> Table 3, the optimal solution has an objective functionvalue of 77.248. This was found relatively quickly by the PF(ρ) formulation.On the other hand, the Reg(ǫ) formulation term<strong>in</strong>ates close to this solution butrequires an order of magnitude more effort. Aga<strong>in</strong>, the NCP formulation term<strong>in</strong>atesearly with poor progress.Along with the numerical study <strong>in</strong> Section 4, these four cases demonstratethat optimization of detailed distillation column models can be performed efficientlywith MPCC formulations. In particular, it can be seen that the penaltyformulation (PF(ρ)) represents a significant improvement over the NCP formulationboth <strong>in</strong> terms of iterations and CPU seconds; PF(ρ) offers advantagesover the Reg(ǫ) formulation as well.25


7 ConclusionsWith the development and widespread use of large-scale nonl<strong>in</strong>ear programm<strong>in</strong>gtools for process optimization, there has been an associated applicationof complementarity formulations to represent discrete decisions. This study <strong>in</strong>vestigatesboth the formulation and solution strategies for the MathematicalPrograms with Compementarity Constra<strong>in</strong>ts (MPCCs). Because MPCCs havedependent consta<strong>in</strong>ts at all feasible po<strong>in</strong>ts along with unbounded multipliers,special care is needed <strong>in</strong> the formulation of associated optimization problemsalong with a reliable solution algorithm. In this study, we <strong>in</strong>vestigate and summarizea number of regularization and penalty MPCC formulations and f<strong>in</strong>dthat the penalty formulation is particularly advantageous, especially when coupledwith an active set NLP solver. This conclusion is observed <strong>in</strong> a numericalcomparison with a library of MPCC test problems.Because all MPCCs can be formulated as underly<strong>in</strong>g <strong>MPEC</strong> problems thatresult from nested optimization formulations, a necessary condition for wellbehavedMPCCs is that the <strong>in</strong>ner optimization problem be convex with solutionsthat are not restricted by upper level constra<strong>in</strong>ts. This condition is used to motivatea systematic strategy for the formulation of complementarity constra<strong>in</strong>ts.This strategy is applied to a number of widely-used examples <strong>in</strong> process eng<strong>in</strong>eer<strong>in</strong>g,<strong>in</strong>clud<strong>in</strong>g flow reversals, relief valves, compressor kickbacks, controllersaturation and disappearance of phases <strong>in</strong> equilibrium stages.This comb<strong>in</strong>ation of well-posed formulations and MPCC solutions strategiesis demonstrated on two large-scale case studies with as many as 8000 discretedecisions. First, <strong>in</strong> the discretized dynamic system with switches, both thepenalty and relaxed formulations are very effective with active set and barriersolvers, respectively. Second, <strong>in</strong> deal<strong>in</strong>g with disappear<strong>in</strong>g phases <strong>in</strong> distillationoptimization models, the penalty formulation shows significant performance improvementsover the NCP formulation <strong>in</strong> a previous study. From these results,we f<strong>in</strong>d that well-posed complementarities coupled with NLP problems basedon penalty formulations are efficient and effective strategies for the solution ofMPCCs <strong>in</strong> process eng<strong>in</strong>eer<strong>in</strong>g.AcknowledgementsFund<strong>in</strong>g from the National Science Foundation under grant #CTS-0438279 isgratefully appreciated. The authors also wish to thank Dr. Steven Dirkse forfruitful discussions and help with the NLPEC package.References[1] Anitescu, M., P. Tseng, and S.J. Wright, “Elastic-mode algorithms formathematical programs with equilibrium constra<strong>in</strong>ts: global convergenceand stationarity properties,” Math. Program., 110, pp. 337-371 (2007).26


[2] Burke, J.V. and S-P. Han, A Robust Sequential Quadratic Programm<strong>in</strong>gMethod, Mathematical Program<strong>in</strong>g, 43, pp. 277-303, (1989).[3] Chen, X. and M. Fukushima, “A Smooth<strong>in</strong>g Method for a MathematicalProgram with P-Matrix L<strong>in</strong>ear Complementarity Constra<strong>in</strong>ts,” ComputationOptimization and <strong>Applications</strong>, 27, pp. 223-246 (2004).[4] M. C. Ferris and J. S. Pang, (eds.) Complementarity and Variational <strong>Problem</strong>s:State of the Art, Philadelphia, Pennsylvania, 1997. SIAM Publications.[5] M. C. Ferris, “Complementarity <strong>Problem</strong>s and <strong>Applications</strong>,” presented atSIAM Conference on Optimization, Stockholm (2005)[6] Fletcher, R. and S. Leyffer, “Numerical experience with solv<strong>in</strong>g <strong>MPEC</strong>s asNLPs”, Numerical Analysis Report NA/210, Department of Mathematics,University of Dundee (2002).[7] Gallun, S.E., R.H. Luecke, D.E. Scott, A.M. Morshedi, “Use open equationsfor better models”, Hydrocarbon Process<strong>in</strong>g, pp. 78-90 (1992).[8] GAMS: The Solver Manuals, GAMS Development Corporation, Wash<strong>in</strong>gton,DC (2004)[9] Gett<strong>in</strong>g Started with ROMeo, SimSci-Esscor, 2004.[10] Gopal, V. and L.T. Biegler. “Smooth<strong>in</strong>g methods for Complementarity<strong>Problem</strong>s <strong>in</strong> Process Eng<strong>in</strong>eer<strong>in</strong>g” J. AIChE, 45, 7, pp. 1535-1547, (1999).[11] Hu, X. M. and D. Ralph, “Convergence of a Penalty Method of MathematicalProgramm<strong>in</strong>g with Complementarity Constra<strong>in</strong>ts,” Journal of OptimizationTheory and <strong>Applications</strong>, 123, 2, pp. 365-390 (2004).[12] Heemels, M. PhD Thesis, Technische Universiteit E<strong>in</strong>doven (1999).[13] Kameswaran, S., G. Staus and L. T. Biegler, “Parameter Estimation ofCore Flood and Reservoir Models,” Computers and <strong>Chemical</strong> Eng<strong>in</strong>eer<strong>in</strong>g,29, 8, pp. 1787-1800 (2005).[14] Lang, Y-D and L.T. Biegler, “Distributed Stream Method for Tray Optimization”,AIChE Journal, 48, 3, pp.582-595 (2002).[15] Leyffer, S., G. Lopez-Calva and J. Nocedal, “Interior Methods for MathematicalPrograms with Complementarity Constra<strong>in</strong>ts,” SIAM Journal ofOptimization, 17, 1, pp. 52-77 (2006).[16] Lopez-Calva, G. “Regularization via Exact Penalty Methods” 4th InternationalConference on Complementarity <strong>Problem</strong>s, August 10th, 2005.[17] Luo, Z.Q., J.S. Pang, and D. Ralph, Mathematical Programs with EquilibriumConstra<strong>in</strong>ts, Cambridge University Press, Cambridge, 1996.27


[18] Mittelmann, H.D. and A. Pruessner, “A Server for Automated PerformanceAnalysis of Benchmark<strong>in</strong>g Data”, Optimization Methods and Software, 21,1, pp. 105-120 (2006).[19] <strong>MPEC</strong> Library, http://www.gamsworld.eu/mpec/mpeclib.htm[20] Raghunathan, A. and L. T. Biegler, “<strong>MPEC</strong> <strong>Formulations</strong> and Algorithms<strong>in</strong> Process Eng<strong>in</strong>eer<strong>in</strong>g,” Computers and <strong>Chemical</strong> Eng<strong>in</strong>eer<strong>in</strong>g, 27, pp.1381-1392 (2003)[21] Raghunathan, A., PhD Thesis, Department of <strong>Chemical</strong> Eng<strong>in</strong>eer<strong>in</strong>g,Carnegie Mellon University (2004)[22] Raghunathan, A.U. and L.T. Biegler, “Interior po<strong>in</strong>t methods for MathematicalPrograms with Complementarity Constra<strong>in</strong>ts (MPCCs),” SIAM J.Optimization, 15, 3, pp. 720-750 (2005)[23] Raghunathan, A., M.S. Diaz, L.T. Biegler “An <strong>MPEC</strong> Formulation for DynamicOptimization of Distillation Operation,” Computers and <strong>Chemical</strong>Eng<strong>in</strong>eer<strong>in</strong>g, 28/10 pp. 2037-2052 (2004)[24] Ralph, D. and S. J. Wright, “Some Properties of Regularization and PenalizationSchemes for <strong>MPEC</strong>s,” Optimization Methods and Software, 19,5, pp. 527-556 (2004)[25] Poku, M., L. Biegler, and D. Ternet, Real-Time Optimization (RTO) Onl<strong>in</strong>eTool - ROMeo & IPOPT, PREPRINT, 2005.[26] Scheel, H. and S. Scholtes, “Mathematical Programs with ComplementarityConstra<strong>in</strong>ts: Stationarity, Optimality, and Sensitivity,” Mathematics ofOperations Research, 25, 1, (2000)[27] Scholtes, S. and M. Stöhr, “How Str<strong>in</strong>gent is the L<strong>in</strong>ear Independence Assumptionfor Mathematical Programs with Complementarity Constra<strong>in</strong>ts?”Mathematics of Operations Research, 26, 4, pp. 851-863 (2001).[28] Ste<strong>in</strong>, O., J. Oldenburg, W. Marquardt, “Cont<strong>in</strong>uous Reformulations ofDiscrete-Cont<strong>in</strong>uous Optimization <strong>Problem</strong>s,” Computers & <strong>Chemical</strong> Eng<strong>in</strong>eer<strong>in</strong>g,Vol. 28, No. 10, 2004, 1951 - 1966[29] Stewart, D. and M. Anitescu “Optimal Control of Systems with Discont<strong>in</strong>uousDifferential Equations,” Prepr<strong>in</strong>t number: ANL/MCS-P1258-0605,Mathematics and Computer Science Division, Argonne National Laboratory,June 2005.[30] Young, J.C.C., R. Baker, and C.L.E. Swartz, “Input saturation effects <strong>in</strong>optimiz<strong>in</strong>g control: <strong>in</strong>clusion with<strong>in</strong> a simultaneous optimization framework,”Computers and <strong>Chemical</strong> Eng<strong>in</strong>eer<strong>in</strong>g, 28, pp. 1347-1360 (2004)28

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!