mit15_093j_f09_sdp
TRANSCRIPT
-
8/12/2019 MIT15_093J_F09_sdp
1/51
Introduction to Semidefinite Programming (SDP)
RobertM.Freund
1 Introduction
Semidefinite programming (SDP) isthemost exciting development inmath-ematicalprogramminginthe1990s.SDPhasapplicationsinsuchdiversefields astraditionalconvex constrained optimization, controltheory, andcombinatorial optimization. BecauseSDP is solvable via interior pointmethods,mostoftheseapplicationscanusuallybesolvedveryefficientlyinpracticeaswellasintheory.
2 Review of Linear Programming
Considerthelinearprogrammingprobleminstandardform:
LP : minimize c xs.t. ai x=bi, i= 1, . . . ,m
n+.x
Herexisavectorofnvariables,andwewritec xfortheinnerproduct
jn=1cjxj,etc.
Also,n+ n x0},andwecalln+thenonnegativeorthant.n
:={x |In fact, isaclosed convex cone, whereKiscalledaclosedaconvexcone+ifKsatisfiesthefollowingtwoconditions:
1
-
8/12/2019 MIT15_093J_F09_sdp
2/51
3
Ifx,wK,thenx +wKforallnonnegativescalarsand.Kisaclosedset.
Inwords,LP isthefollowingproblem:
Minimize the linear function c x, subject to the condition that xmust solvemgiven equations ai x=bi, i= 1, . . . , m, and that xmust lie in the closedconvex cone K= n .+
Wewillwritethestandardlinearprogrammingdualproblemas:
m
LD: maximize yibii=1
m
s.t. yiai +s=ci=1
n+.s
GivenafeasiblesolutionxofLP anda feasiblesolution (y, s)ofLD,the
duality gap issimplyc xmi=1yibi = (cmi=1yiai) x=s x0,because x0ands0.WeknowfromLPdualitytheorythatsolongasthepri-malproblemLP isfeasibleandhasboundedoptimalobjectivevalue,thenthe primalandthe dual bothattain theiroptimawithno duality gap.Thatis,thereexistsx and (y, s) feasiblefortheprimalanddual,respectively,
m suchthatc x i=1yi bi =s x = 0.
Facts about Matrices and the Semidefinite Cone
IfXisannnmatrix,thenXisa positivesemidefinite (psd)matrix ifnvTXv0 forany v .
2
-
8/12/2019 MIT15_093J_F09_sdp
3/51
IfXisannnmatrix,thenXisa positive definite (pd)matrix ifnvTXv>0 forany v , v= 0.
LetSn nmatrices, and letSn +thesetof positivesemidefinite (psd)nnsymmetricmatrices. SimilarlyletSn denote thesetof positive definite (pd) n
nsymmetricmatrices.++
LetXandY beanysymmetricmatrices.WewriteX0todenotethatX issymmetricandpositivesemidefinite,andwewriteXYtodenotethatXY 0.WewriteX0todenotethatX issymmetricandpositivedefinite,etc.
2Remark1 Sn+
denote the set of symmetricn denote
{X Sn | X 0} is a closed convex cone in n(n+ 1)/2. of=dimension n
nTo see whythis remark is true, suppose thatX,W S
+
,0.Foranyv n,wehave:vT(X+W)v=vTXv+vTWv0,
. Pickanyscalars
wherebyX n nW S This shows thatS+ +++forwardtoshowthatSn
Recallthefollowingpropertiesofsymmetricmatrices:
IfX Sn,thenX =QDQT forsomeorthonormalmatrixQandsomediagonalmatrixD. (RecallthatQisorthonormalmeansthatQ1=QT,andthatDisdiagonalmeansthattheoffdiagonalentriesofDareallzeros.)
3
isacone.Itisalsostraight-.isaclosedset.
-
8/12/2019 MIT15_093J_F09_sdp
4/51
4
If X = QDQT
asabove, then the columns ofQ forma set of n orthogonaleigenvectorsofX,whoseeigenvaluesarethecorrespondingdiagonalentriesofD.
X 0ifandonlyifX = QDQT wherethe eigenvalues (i.e., thediagonalentriesofD) areallnonnegative.
X 0ifandonlyifX = QDQT wherethe eigenvalues (i.e., thediagonalentriesofD) areallpositive.
IfX0andifXii = 0,thenXij =Xji = 0forallj = 1, . . . , n.ConsiderthematrixMdefinedasfollows:
P vM= T ,v d
whereP 0,v isavector,andd isascalar. ThenM 0ifandonlyifd vTP1v >0.Semidefinite Programming
LetXSn.WecanthinkofXasamatrix,orequivalently,asanarrayofn2 componentsofthe form (x11, . . . , xnn). Wecanalso justthinkofXasanobject (avector)in thespaceSn. AllthreedifferentequivalentwaysoflookingatXwillbeuseful.
WhatwillalinearfunctionofXlooklike? IfC(X) isalinearfunctionofX,thenC(X) canbewrittenasC X,where
n n
C X:= CijXij .i=1j=1
IfX isasymmetricmatrix,thereisnolossofgeneralityinassumingthat
4
-
8/12/2019 MIT15_093J_F09_sdp
5/51
thematrixC isalsosymmetric. Withthisnotation,wearenowreadytodefineasemidefinite program. Asemidefinite program (SDP)isanopti-mizationproblemoftheform:
SDP : minimize C Xs.t. Ai X=bi , i= 1, ... , m,
X0,
Notice that inanSDP thatthevariable isthematrixX, butitmightbehelpfultothinkofXasanarrayofn2 numbersorsimplyasavectorinSn.TheobjectivefunctionisthelinearfunctionC XandtherearemlinearequationsthatXmustsatisfy,namelyAi X=bi , i= 1, . . . , m.ThevariableXalsomust lie in the (closedconvex)coneof positivesemidef-initesymmetricmatricesSn NotethatthedataforSDP consistsofthe+.symmetricmatrixC(whichisthedatafortheobjectivefunction) andthemsymmetricmatricesA1, . . . , Am,andthemvectorb,whichformthemlinearequations.
LetusseeanexampleofanSDP forn=3andm=2. Definethefollowingmatrices:
1 0 1 0 2 8 1 2 3
A1=0 3 7, A2=2 6 0, and C=2 9 0,1 7 5 8 0 4 3 0 7
andb1= 11andb2= 19.ThenthevariableXwillbethe3
3symmetric
matrix:
5
-
8/12/2019 MIT15_093J_F09_sdp
6/51
x11 x12 x13 X= x21 x22 x23 ,x31 x32 x33
andso,forexample,
C X = x11+ 2x12+ 3x13+ 2x21+ 9x22+ 0x23+ 3x31+ 0x32+ 7x33= x11+ 4x12+ 6x13+ 9x22+ 0x23+ 7x33.
since,inparticular,X issymmetric. ThereforetheSDP canbewrittenas:
SDP: minimize x11+ 4x12+ 6x13+ 9x22+ 0x23+ 7x33
s.t.x11+ 0x12+ 2x13+ 3x22+ 14x23+ 5x33 = 11
0x11+ 4x12+ 16x13+ 6x22+ 0x23+ 4x33 = 19 x11 x12 x13
X=x21 x22 x230.x31 x32 x33
NoticethatSDP looksremarkablysimilartoalinearprogram. However,thestandardLPconstraintthatxmust lie in the nonnegativeorthant is re-placedbytheconstraintthatthevariableXmust lie in theconeof positivesemidefinitematrices.Justasx0statesthateachofthencomponentsofxmustbenonnegative,itmaybehelpfultothinkofX
0asstating
thateachoftheneigenvalues ofXmustbenonnegative.
6
-
8/12/2019 MIT15_093J_F09_sdp
7/51
Itiseasytoseethata linearprogramLP isaspecialinstanceofanSDP.Toseeonewayof doingthis,suppose that (c, a1, . . . , am, b1, . . . , bm)comprisethedataforLP.Thendefine:
ai1 0 ... 0
c1 0 ... 0
0 ai2 ... 0 0 c2 ... 0
Ai = ... ... ... ... , i=1, ...,m, and C= ... ... ... ....
0 0 ... ain 0 0 ... cn
ThenLP canbewrittenas:
SDP : minimize C Xs.t. Ai X=bi , i= 1, ... ,m,
Xij = 0, i= 1, . . . , n, j =i + 1, ... , n,X0,
withtheassociationthat
x1 0 . . . 0 0 x2 . . . 0
X= . . . . .. . . . . . . . 0 0 . . . xn
Ofcourse,inpracticeonewouldneverwant toconvertaninstanceof LPintoaninstanceofSDP. TheaboveconstructionmerelyshowsthatSDPincludeslinearprogrammingasaspecialcase.
7
-
8/12/2019 MIT15_093J_F09_sdp
8/51
5 Semidefinite Programming DualityThe dual problemofSDP is defined (or derived fromfirst principles)to be:
m
SDD: maximize yibii=1
m
s.t. yiAi +S=Ci=1
S
0.
One convenient wayofthinkingabout this problem is as follows.Given mul-mtipliersy1, . . . , ym,theobjective istomaximizethe linear function i=1yibi.
TheconstraintsofSDDstatethatthematrixSdefinedas
m
S=C yiAii=1
mustbepositivesemidefinite.Thatis,
m
C yiAi 0.i=1
We illustrate this construction with the example presented earlier. Thedualproblemis:
SDD: maximize 11y1+ 19y2
1 0 1 0 2 8 1 2 3s.t. y10 3 7+ y22 6 0+ S=2 9 0
1 7 5 8 0 4 3 0 7
S0,8
-
8/12/2019 MIT15_093J_F09_sdp
9/51
whichwecanrewriteinthefollowingform:
SDD: maximize 11y1+ 19y2
s.t.
1 1y10y2 2 0y12y2 3 1y18y2 2 0y12y2 9 3y16y2 0 7y10y2 0.3 1y18y2 0 7y10y2 7 5y14y2
It is often easier toseeandto workwitha semidefinite program whenit is presented in the formatofthe dualSDD,sincethevariablesarethemmultipliersy1, . . . , ym.
Asinlinearprogramming,wecanswitchfromoneformatofSDP (pri-
malordual)toanyotherformatwithgreatease,andthereisnolossofgenerality inassuminga particular specific formatfor theprimal orthedual.
Thefollowingpropositionstates thatweakdualitymustholdfor theprimalanddualofSDP:
Proposition5.1 Given a feasible solution Xof SDP and a feasible solu-mtion (y, S)of SDD, the duality gap is C X i=1yibi =S X0. If
m
C X i=1yibi = 0, then Xand (y, S) are each optimal solutions to SDPand SDD, respectively, and furthermore, SX= 0.
Inorder to prove Proposition 5.1, itwill beconvenient toworkwiththetrace ofamatrix,definedbelow:
9
-
8/12/2019 MIT15_093J_F09_sdp
10/51
n
trace(M) = Mjj .j=1
Simplearithmeticcanbeusedto establishthefollowingtwo elementaryidentities:
trace(MN) =trace(NM)A B=trace(ATB)
Proof of Proposition 5.1. Forthefirstpartoftheproposition,wemustshowthatifS0andX0,thenS X0.LetS=PDPT andX=QEQT whereP,QareorthonormalmatricesandD,Earenonnegativediagonalmatrices.Wehave:
S X=trace(STX) =trace(SX) =trace(PDPTQEQT)
n=trace(DPTQEQTP) = Djj (P
TQEQTP)jj 0,j=1
wherethelastinequalityfollowsfromthefactthatallDjj 0andthefactthat the diagonalofthe symmetric positivesemidefinite matrixPTQEQTPmustbenonnegative.
To provethesecond part of the proposition,suppose thattrace(SX) = 0.Thenfromtheaboveequalities,wehave
n
Djj (PTQEQTP)jj = 0.
j=1
10
-
8/12/2019 MIT15_093J_F09_sdp
11/51
6
However, thisimplies thatforeachj = 1, . . . , n, eitherDjj =0orthe(PTQEQTP)jj =0. Furthermore,thelattercaseimpliesthatthej
th rowofPTQEQTP is allzeros. ThereforeDPTQEQTP =0, andsoSX =PDPTQEQT = 0.
q.e.d.
Unlikethecaseof linear programming,wecannot assertthateitherSDP
orSDDwillattaintheirrespectiveoptima,and/orthattherewillbenodualitygap,unlesscertainregularityconditionshold. OnesuchregularityconditionwhichensuresthatstrongdualitywillprevailisaversionoftheSlatercondition,summarizedinthefollowingtheoremwhichwewillnotprove:
Theorem5.1 Let zP and z denote the optimal objective function valuesD
of SDPand SDD, respectively. Suppose that there exists a feasible solutionX of SDP such that X 0, and that there exists a feasible solution (y,S)of SDDsuch that S0. Then both SDP and SDDattain their optimal
values, and zP =zD.
Key Properties of Linear Programming that do
not extend to SDPThefollowingsummarizessomeofthemoreimportantpropertiesoflinearprogrammingthatdonotextendtoSDP:
There maybe afinite or infinite duality gap.The primaland/or dualmayormaynotattain theiroptima. Asnoted above inTheorem5.1, bothprogramswillattaintheircommonoptimal value ifbothprogramshave feasiblesolutions in the interiorof thesemidefinitecone.
There isnofinite algorithmforsolvingSDP. Thereisa simplexalgorithm,butitisnotafinitealgorithm. ThereisnodirectanalogofabasicfeasiblesolutionforSDP.
11
-
8/12/2019 MIT15_093J_F09_sdp
12/51
Givenrational data,the feasibleregionmay havenorationalsolutions.Theoptimalsolutionmaynothaverationalcomponentsorrationaleigenvalues.
Givenrational datawhose binaryencoding issizeL,thenormsofanyfeasibleand/oroptimalsolutionsmayexceed22
L(orworse).
Givenrational datawhose binaryencoding issizeL,thenormsofanyfeasibleand/oroptimalsolutionsmaybelessthan22
L(orworse).
7 SDP in Combinatorial Optimization
SDP haswideapplicability incombinatorialoptimization. AnumberofNPhard combinatorialoptimization problems haveconvexrelaxationsthataresemidefiniteprograms. Inmanyinstances,theSDP relaxationisverytight in practice,and in certain instances in particular, theoptimalsolutiontotheSDP relaxationcanbeconvertedtoafeasiblesolutionfortheorigi-nalproblemwithprovablygoodobjectivevalue. AnexampleoftheuseofSDP incombinatorialoptimizationisgivenbelow.
7.1 An SDP Relaxation of the MAX CUT Problem
LetGbeanundirectedgraphwithnodesN=
{1, . . . , n
},andedgesetE.
Letwij =wji betheweightonedge (i, j),for (i, j)E. Weassumethatwij 0 forall (i, j) E.TheMAXCUTproblemistodetermineasubsetSofthenodesN forwhichthesumoftheweightsoftheedgesthatcrossfromStoitscomplementS ismaximized (where (S:=N\S) .WecanformulateMAXCUT asanintegerprogramasfollows.Letxj = 1
forjSandxj =1forjS.Thenourformulationis:n n
MAXCUT: maximizex1 wij(1 xixj)4
i=1j=1
s.t. xj {1, 1}, j = 1, ... , n.
12
-
8/12/2019 MIT15_093J_F09_sdp
13/51
Nowlet
Y=xxT ,
whereby
Yij =xixj i= 1, . . . , n, j = 1, ... , n.
AlsoletW bethematrixwhose(i, j)th element iswij for i = 1, . . . , n andj = 1, . . . , n.ThenMAXCUTcanbeequivalentlyformulatedas:
n n
MAXCUT : maximizeY,x1
i=1j=1wij WY4
s.t. xj {1, 1}, j = 1, . . . , n
Y=
xx
T
.
Notice inthisproblem that thefirst set ofconstraintsare equivalent toYjj = 1, j= 1, . . . , n.Wethereforeobtain:
n n
MAXCUT: maximizeY,x1
i=1j=1wij WY4
s.t. Yjj = 1, j = 1, . . . , n
Y=xxT .
13
-
8/12/2019 MIT15_093J_F09_sdp
14/51
8
Lastofall,noticethatthematrix Y =xxT
isasymmetricrank1posi-tivesemidefinitematrix. Ifwerelax thisconditionbyremovingtherank-1restriction,weobtainthefollowingrelaxtionofMAXCUT,whichisasemidefiniteprogram:
n n
RELAX: maximizeY14 wij W Y
i=1j=1
s.t. Yjj = 1, j = 1, . . . , n
Y0.
ItisthereforeeasytoseethatRELAXprovidesanupperboundonMAX-CUT,i.e.,
MAXCUTRELAX.
Asitturnsout,onecanalsoprovewithouttoomucheffortthat:
0.87856RELAXMAXCUTRELAX.
Thisisanimpressiveresult,inthatitstatesthatthevalueofthesemidefi-niterelaxation is guaranteedto benomore than 12% higher than thevalueofNPhardproblemMAXCUT.
SDP in Convex Optimization
Asstatedabove,SDP hasverywideapplicationsinconvexoptimization.The typesof constraintsthatcan bemodeled in theSDPframework include:linear inequalities, convexquadraticinequalities,lowerboundsonmatrix
14
-
8/12/2019 MIT15_093J_F09_sdp
15/51
norms, lowerboundsondeterminantsof symmetricpositive semidefinitematrices, lower bounds on the geometricmeanofanonnegativevector, plusmanyothers. Usingtheseandotherconstructions,thefollowingproblems(amongmanyothers)canbecastintheformofasemidefiniteprogram:linearprogramming,optimizingaconvexquadraticformsubjecttoconvexquadraticinequalityconstraints,minimizingthevolumeofanellipsoidthatcoversagiven setofpointsandellipsoids,maximizingthevolumeofanellipsoidthatiscontainedinagivenpolytope,plusavarietyofmaximumeigenvalueandminimumeigenvalueproblems.InthesubsectionsbelowwedemonstratehowsomeimportantproblemsinconvexoptimizationcanbereformulatedasinstancesofSDP.
8.1 SDP for Convex Quadratically Constrained Quadratic
Programming, Part I
Aconvexquadraticallyconstrainedquadraticprogramisaproblemoftheform:
QCQP: minimize xTQ0x+q0Tx+c0
xs.t. xTQix+qi
Tx+ci 0 , i= 1, ... , m,
wheretheQ00andQi 0, i= 1, . . . , m.Thisproblemisthesameas:
QCQP: minimize x,s.t. xTQ0x+q0
Tx+c00xTQix+qi
Tx+ci 0 , i= 1, ... , m.
WecanfactoreachQi into
Qi =MiTMi
15
-
8/12/2019 MIT15_093J_F09_sdp
16/51
forsomematrixMi.Thennotetheequivalence:
xTIMi
T ciMixqiTx 0 xTQix+qiT x+ci 0.
InthiswaywecanwriteQCQPas:
QCQP: minimize x,s.t.
I M0xxTM0
T c0q0Tx+ 0
I MixxTMT i
Tx0 , i= 1, ... ,m.
i ci q
Noticeintheaboveformulationthatthevariablesare andxandthatallmatrixcoefficientsarelinearfunctionsofandx.
8.2 SDP for Convex Quadratically Constrained Quadratic
Programming, Part II
Asitturnsout,thereisanalternativewaytoformulateQCQPasasemidefiniteprogram.Webeginwiththefollowingelementaryproposition.
Proposition8.1 Given a vector x k and a matrix W kk, then1 xT 0 ifandonlyif WxxT .x W
16
-
8/12/2019 MIT15_093J_F09_sdp
17/51
Usingthis proposition, it isstraghtforwardtoshowthatQCQPisequiv-alenttothefollowingsemidefiniteprogram:
QCQP: minimize x,W,
s.t.
c01 12q0T 1 xT 0q0 Q0
x W2
ci 21qiT 1 xT1qi Qi x W 0 , i= 1, . . . ,m2
1 xT.0
x W
Noticeinthisformulationthattherearenow(n+1)(2n+2) variables,butthat
theconstraintsareall linear inequalitiesasopposedtosemidefinite inequal-ities.
8.3 SDP for the Smallest Circumscribed Ellipsoid Problem
A givenmatrixR0anda given pointzcanbeusedtodefineanellipsoidinn:
ER,z :={y|(y z)TR(y z) 1}.
OnecanprovethatthevolumeofER,z isproportionaltodet(R1).
17
-
8/12/2019 MIT15_093J_F09_sdp
18/51
SupposewearegivenaconvexsetX n
describedastheconvexhullofkpoints c1, . . . , ck. We would like tofind an ellipsoidcircumscribingthesekpoints that hasminimumvolume.Our problemcan bewritten in the fol-lowingform:
MCP : minimize vol (ER,z)R,zs.t. ci ER,z, i= 1, ... , k,
whichisequivalentto:
MCP : minimize ln(det(R))R,zs.t. (ci z)TR(ci z) 1, i= 1, . . . , k
R0,
Now factorR =M2 whereM 0(thatis,M isasquarerootof R),andnowMCPbecomes:
MCP: minimize ln(det(M2))M,zs.t. (ci z)TMTM(ci z) 1, i= 1, ... , k,
M0.
Nextnoticetheequivalence:
I Mci Mz(Mci Mz)T 1 0 (ci z)TMTM(ci z) 1
18
-
8/12/2019 MIT15_093J_F09_sdp
19/51
InthiswaywecanwriteMCPas:
MCP: minimizeM,z
2 ln(det(M))
s.t.I
(Mci Mz)TMci Mz
10, i=1, ..., k,
M0.
Lastofall,makethesubstitutiony=Mztoobtain:
MCP: minimize 2 ln(det(M))M,y
s.t.(Mci
Iy)T Mci1y 0, i= 1, ... , k,
M0.
NoticethatthislastprograminvolvessemidefiniteconstraintswhereallofthematrixcoefficientsarelinearfunctionsofthevariablesMandy.How-ever, theobjective function isnot a linear function.It is possible toconvertthis problem further intoa genuine instanceofSDP, because there is awaytoconvertconstraintsoftheform
ln(det(X))
toasemidefinitesystem. Nevertheless,thisisnotnecessary, either from
a theoreticalor a practicalviewpoint, because it turns out that the functionf(X) =ln(det(X)) is extremelywellbehavedand is veryeasyto optimize(bothintheory andinpractice).
19
-
8/12/2019 MIT15_093J_F09_sdp
20/51
Finally,notethataftersolvingtheformulationofMCP above,wecanrecoverthematrixRandthecenterzoftheoptimalellipsoidbycomputing
R=M2 and z=M1y.
8.4 SDP for the Largest Inscribed Ellipsoid Problem
RecallthatagivenmatrixR0andagivenpointzcanbeusedtodefineanellipsoidinn:
ER,z :={x|(xz)TR(xz) 1},
andthatthevolumeofER,z isproportionaltodet(R1).
Supposewearegivenaconvexset X n describedastheintersectionofkhalfspaces{x|(ai)Txbi}, i= 1, . . . , k,thatis,
X={x|Axb}
wherethe ith rowof thematrix A consists of the entriesof the vectorai, i= 1, . . . , k. WewouldliketofindanellipsoidinscribedinXofmaxi-mumvolume.Ourproblemcanbewritteninthefollowingform:
MIP : maximize vol(ER,z)R,z
s.t. ER,z X,whichisequivalentto:
20
-
8/12/2019 MIT15_093J_F09_sdp
21/51
MIP : maximize det(R1)R,zs.t. ER,z {x|(ai)Txbi}, i=1, ..., k
R0,
whichisequivalentto:
MIP : maximize ln(det(R1))R,zs.t. maxx{aiTx|(xz)TR(xz) 1} bi, i= 1, . . . , k
R0.
Foragiven i= 1, . . . , k,thesolutiontotheoptimizationproblemintheithconstraintis
R1aix =z+aTR1aii
withoptimalobjectivefunctionvalue
aiT z+ ai
TR1ai ,
andsoMIP canberewrittenas:
21
-
8/12/2019 MIT15_093J_F09_sdp
22/51
MIP : maximize ln(det(R1))R,z
s.t. aTi z+ aTi R
1ai bi, i= 1, . . . , k R0.
NowfactorR1=M2whereM0 (thatis,M isasquarerootofR1),andnowMIPbecomes:
MIP : maximize ln(det(M2))M,z
s.t. aTi z+ aTi M
TMai bi, i= 1, . . . , k M0,
whichwecanrewriteas:
MIP : maximize 2 ln(det(M))M,zs.t. aTMTMai (bi aiTz)2, i= 1, . . . , k i
bi aiTz0, i= 1, . . . , k M0.
Nextnoticetheequivalence:
(bi ai z)I Mai aTMTMai (bi aTi z)2 T i (Mai)
Ti z)
0 and (bi aT bi aiTz0.
22
-
8/12/2019 MIT15_093J_F09_sdp
23/51
InthiswaywecanwriteMIPas:
MIP : minimize 2 ln(det(M))M,z
(bi aiTz)I Mais.t.(Mai)
T (bi aTi z) 0, i= 1, ... , k,M0.
NoticethatthislastprograminvolvessemidefiniteconstraintswhereallofthematrixcoefficientsarelinearfunctionsofthevariablesMandz.How-ever,theobjectivefunctionisnotalinearfunction. Itispossible,butnotnecessary in practice,toconvertthis problem further intoa genuine instanceofSDP,becausethereisawaytoconvertconstraintsoftheform
ln(det(X))
toa semidefinitesystem. Suchaconversionisnotnecessary,eitherfroma theoreticalor a practicalviewpoint, because it turns out that the functionf(X) =ln(det(X)) is extremelywellbehavedand is veryeasyto optimize(bothintheory andinpractice).
Finally,notethataftersolvingtheformulationof MIP above,wecanrecoverthematrixRoftheoptimalellipsoidbycomputing
R=M2.
8.5 SDP for Eigenvalue Optimization
Therearemanytypesofeigenvalueoptimizationproblemsthatcanbefor-mualatedasSDPs.Atypicaleigenvalueoptimizationproblemistocreate
23
-
8/12/2019 MIT15_093J_F09_sdp
24/51
amatrix
k
S:=B wiAii=1
givensymmetric datamatrices BandAi, i= 1, . . . , k,usingweightsw1, . . . , wk,insuchawaytominimizethedifferencebetweenthelargestandsmallesteigenvalueofS.Thisproblemcanbewrittendownas:
EOP : minimizew,S
max(S) min(S)k
s.t. S=Bi=1
wiAi,
wheremin(S)andmax(S)denotethesmallestandthelargesteigenvalueofS,respectively.WenowshowhowtoconvertthisproblemintoanSDP.
RecallthatScanbefactoredintoS=QDQT whereQisanorthonor-malmatrix (i.e.,Q1=QT)andDisadiagonalmatrixconsistingoftheeigenvaluesofS.Theconditions:
ISI
canberewrittenas:
Q(I)QT QDQT Q(I)QT .
After premultiplyingtheaboveQT and postmultiplyingQ,theseconditionsbecome:
24
-
8/12/2019 MIT15_093J_F09_sdp
25/51
9
IDI
whichareequivalentto:
min(S) and max(S) .
ThereforeEOPcanbewrittenas:
EOP : minimize w,S,,
k
s.t. S=B wiAii=1
ISI.
Thislastproblemisasemidefiniteprogram.
Usingconstructssuchasthoseshownabove,verymanyothertypesofeigenvalueoptimizationproblemscanbeformulatedasSDPs.
SDP in Control Theory
Avarietyofcontrolandsystem problemscan becastandsolvedas instancesofSDP.However,thistopicisbeyondthescopeofthesenotes.
25
-
8/12/2019 MIT15_093J_F09_sdp
26/51
10 Interiorpoint Methods for SDPAttheheartofaninteriorpointmethodisabarrierfunctionthatexertsarepellingforcefromtheboundaryofthefeasibleregion. ForSDP,weneedabarrierfunctionwhosevaluesapproach+ aspointsXapproachtheboundaryofthesemidefiniteconeSn+.
LetX S+n . ThenX willhaveneigenvalues, say1(X), . . . , n(X)(possiblycountingmultiplicities). Wecancharacterizetheinteriorofthesemidefiniteconeasfollows:
intSn 1(X) >0, . . . , n(X)+={XSn | >0}.
AnaturalbarrierfunctiontousetorepelX fromtheboundaryofSn+thenis
n n
ln(
i(X)) =
ln(
i(X)) =
ln(det(X)).
j=1 j=1
ConsiderthelogarithmicbarrierproblemBSDP()parameterizedbythepositivebarrierparameter:
BSDP() : minimize C X ln(det(X))s.t. Ai X=bi , i= 1, ... ,m,
X0.
Letf(X)denotetheobjectivefunctionofBSDP(). Thenitisnottoodifficulttoderive:
26
-
8/12/2019 MIT15_093J_F09_sdp
27/51
f(X) =CX1, (1)
andsotheKarushKuhnTuckerconditionsforBSDP() are:
Ai X=bi , i= 1, ... ,m,
X
0, (2) m CX1= yiAi.
i=1
BecauseX issymmetric,we can factorizeX intoX = LLT . We thencandefine
S=X1=LTL1,
whichimplies
1LTSL=I,
andwecanrewritetheKarushKuhnTuckerconditionsas:
27
-
8/12/2019 MIT15_093J_F09_sdp
28/51
Ai X=bi , i= 1, ... ,m, X0,X=LLT(3)m yiAi +S=C i=1 1I
LTSL= 0.
From the equationsof (3)it follows that if (X,y,S) is a solutionof (3), thenXisfeasibleforSDP, (y, S) isfeasibleforSDD,andtheresultingdualitygapis
n n n n
S X= SijXij = (SX)jj = =n.i=1j=1 j=1 j=1
Thissuggeststhatwe try solvingBSDP() for avariety ofvaluesof as 0.
However,wecannot usuallysolve (3)exactly, because the fourthequationgroup is not linear in the variables.We will instead define aapproximatesolutionoftheKarushKuhnTuckerconditions(3). Beforedoingso,weintroducethefollowingnormonmatrices,calledtheFrobeniusnorm:
n nM:=M M= M2 . ij
i=1j=1
Forsome important propertiesof the Frobeniusnorm,seethe lastsubsection
ofthissection. AapproximatesolutionofBSDP() isdefinedasanysolution (X,y,S) of
28
-
8/12/2019 MIT15_093J_F09_sdp
29/51
Ai X=bi , i= 1, ... ,m, X0,X=LLT(4)m yiAi +S=C i=1 1I
LTSL .
Lemma10.1 If (X, y,S) is a approximate solution of BSDP()and
-
8/12/2019 MIT15_093J_F09_sdp
30/51
10.1 The AlgorithmBasedon the analysis just presented,we are motivated to developthe fol-lowingalgorithm:
Step 0 . Initialization. Datais(X0, y0, S0, 0). k=0. Assumethat(X0, y0, S0) isa approximatesolutionofBSDP(0) forsome knownvalueofthatsatisfies
-
8/12/2019 MIT15_093J_F09_sdp
31/51
Apictureofthisalgorithmlookssomethinglikethis:
31
-
8/12/2019 MIT15_093J_F09_sdp
32/51
Someoftheunresolvedissuesregardingthisalgorithminclude:
howtosetthefractionaldecreaseparameter thederivationoftheNewtonstepD andthemultipliersy
whetherornotsuccessive iterativevalues (Xk, yk, Sk) areapproximatesolutionstoBSDP(k),and
howtogetthemethodstartedinthefirstplace.10.2 The Newton Step
SupposethatX isafeasiblesolutiontoBSDP():
BSDP() : minimize C X ln(det(X))s.t. Ai X=bi , i= 1, ... ,m,
X0.
LetusdenotetheobjectivefunctionofBSDP() byf(X),i.e.,
f(X) =C X ln(det(X)).
Thenwecanderive:
f(X) =CX1
andthequadraticapproximationofBSDP()atX=X canbederivedas:
32
-
8/12/2019 MIT15_093J_F09_sdp
33/51
minimize f(X) + (CX1) (XX) + 1X1(XX) X1(XX) 2 X
s.t. Ai X=bi, i= 1, ... , m.LettingD=XX,thisisequivalentto:
minimize (CX1) D +21X1D X1DD
s.t. Ai D= 0, i= 1, ... , m.
Thesolutionto thisprogramwillbetheNewtondirection. TheKarushKuhnTuckerconditionsforthisprogramarenecessaryandsufficient,andare:
m X1 X1 +X1D = yiAi Ci=1 (8)
Ai D= 0, i= 1, ... ,m.TheseequationsarecalledtheNormal Equations.LetD andy denotethesolution to the Normal Equations.Note in particular from thefirstequationin (8)thatD must besymmetric.Suppose that (D,y) is the (unique)so-lutionof the Normal Equations (8).Weobtain thenewvalueofthe primalvariableXbytakingtheNewtonstep,i.e.,
X =X+D .
Wecan producenewvaluesofthe dualvariables (y, S)bysettingthenewm
valueofytobey andbysettingS =C y Ai. Using (8),then,wei=1
havethat
33
-
8/12/2019 MIT15_093J_F09_sdp
34/51
S =X1X1D X1. (9)
Wehave thefollowingverypowerfulconvergence theoremwhichdemon-strates the quadraticconvergence ofNewtonsmethod forthisproblem,withanexplicit guaranteeoftherange inwhich quadraticconvergencetakesplace.
Theorem10.1 (ExplicitQuadraticConvergenceofNewtonsMethod). Suppose that (X,y, is a approximate solution of BSDP() and
-
8/12/2019 MIT15_093J_F09_sdp
35/51
S
=X1
X1D
X1
=LT
(IL1D
LT
)L1
.Wewillfirstshowthat L1DLT . It turnsoutthat thisisthecrucialfact fromwhicheverythingwillfollownicely. Toprovethis,notethat
m m m
yiAi +S =C= yi
Ai +S =
yiAi +LT(IL1D LT)L1.
i=1 i=1 i=1
TakingtheinnerproductwithD yields:
S D
=LTL1 D LTL1DLTL1 D,
whichwecanrewriteas:
LTSL L1D
LT =I L1D
LT L1D LT L1D LT ,whichwefinallyrewriteas:
L1D LT 2= I
1LTSL L1DLT .
InvokingtheCauchySchwartzinequalityweobtain:
L1D
LT
2
I
1LTSL
L1DLT
L1D
LT
,
fromwhichweseethatL1D LT .Itthereforefollowsthat
X
=L(I+L1D
LT)LT 0and
S
=LT(IL1D LT)L10,sinceL1D LT
-
8/12/2019 MIT15_093J_F09_sdp
36/51
wherewedefineL =LM.Thennotethat
I
1(L
)TSL
=I
1MLTSLM
L1) LT)M=I1MLT(LT(IL1DLT) LM=IM(IL1D
=IMM+M(L1D LT)M=IMM+M(MMI)M= (IMM)(IMM)= (L1D
LT)(L1D
LT).
Fromthiswenexthave:
1
I (L)
T
S
L=(L
1
D
LT
)(L1
D
LT
) (L1
D
LT
)2
2
.
Thisshows that (X
, y
, S
) isa2approximatesolutionofBSDP().q.e.d.
10.3 Complexity Analysis of the Algorithm
Theorem10.2 (Relaxation Theorem). Suppose that (X, y,S)is a -approximate solution of BSDP() and
-
8/12/2019 MIT15_093J_F09_sdp
37/51
+ 1
n = +
n n
= +
n n= .
q.e.d.
Theorem10.3 (ConvergenceTheorem). Suppose that (X0, y0, S0) is aapproximate solution of BSDP(0) and
-
8/12/2019 MIT15_093J_F09_sdp
38/51
Abetterpictureofthisalgorithmlookssomethinglikethis:
38
-
8/12/2019 MIT15_093J_F09_sdp
39/51
Theorem10.4 (ComplexityTheorem). Suppose that (X0, y0, S0)is a= 14approximate solution of BSDP(0). In order to obtain primal anddual feasible solutions (Xk, yk, Sk) with a duality gap of at most , one needsto run the algorithm for at most
k= 6
n ln1.25X0S00.75
iterations.
Proof: Letkbeasdefinedabove.Notethat1 1
= 1
+
n= 1 2+4n = 1 2 + 41n1 6
1
n.12
Therefore
k 1 61
n
k0.
Thisimpliesthat
CXk m biyik =Xk Sk kn(1 + ) 1
61
n
k(1.25n0)
i=1
k1 X0 S0 1 6n (1.25n) 0.75n ,
from (5).Takinglogarithms,weobtain
m ln C Xk
biy
k k ln 1 + ln 1.25X0 S0 i 1 6n 0.75 i=1
1.25X0
6
kn
+ ln0.75
S01.25X0 S0 1.25 ln0.75
+ ln
0.75X0S0 = ln().
m
ThereforeCXk i=1
biyik .
q.e.d.
39
-
8/12/2019 MIT15_093J_F09_sdp
40/51
10.4 How to Start the Method from a Strictly Feasible PointThealgorithmand its performancerelieson having astarting point (X0, y0, S0)thatisaapproximatesolutionofthe problemBSDP(0).Inthissubsec-tion,weshowhowtoobtainsuchastartingpoint,givenapositivedefinitefeasiblesolutionX0ofSDP.
Wesupposethatwearegivenatargetvalue0 ofthebarrierparam-eter, andwearegivenX = X0 that isfeasible forBSDP(0), that is,Ai X
0=bi, i= 1, . . . , m,andX00. Wewillattempttoapproximately
solveBSDP(0)startingatX=X0,usingtheNewtondirectionateachiteration.Theformalstatementofthealgorithmisasfollows:
Step0. Initialization. Datais (X0, 0).k= 0.AssumethatX0satisfiesAi X
0=bi, i= 1, ... ,m,X00.
Step1. SetCurrentvalues. X =Xk.FactorX =LLT .Step2. ComputeNewtonDirectionandMultipliers. ComputetheNewtonstepD
forBSDP(0) atX=X bysolvingthefollowingsystemofequations in thevariables (D, y):
m X1+0X1DX1 C0 = yiAii=1 (10)
Ai D= 0, i= 1, ... , m.
Denote thesolution to thissystem by (D,y).Set
m
S =C yiAi.i=1
Step 3. Test the Current Point. IfL1D LT 1 ,stop. Inthis4case,X isa14 approximatesolutionofBSDP(
0),along withthedualval-
ues (y,S).
40
-
8/12/2019 MIT15_093J_F09_sdp
41/51
Step4. UpdatePrimalPoint.X =X+D
where0.2
= .L1D LTAlternatively,canbecomputedbyalinesearchoff0(X+ D).
Step 5. Reset Counter and Continue. Xk+1 X , k k+1. GotoStep1.
ThefollowingpropositionvalidatesStep3ofthealgorithm:
Proposition10.1 Suppose that (D,y) is the solution of the Normal equa-tions (10) for the point X for the given value 0 of the barrier parameter,and that
L1DLT 14.
Then X is a 14approximate solution of BSDP(0).
m
Proof: Wemustexhibitvalues (y, S) thatsatisfy yiAi +S=Candi=1
1LTSL 1.I0 4m
Let (D,y) solvethe Normalequations (10),and letS =C yiAi.Then
i=1
wehavefrom (10)that
1 1I
0LTS
L =I0
LT(0(LTL1LTL1D LTL1))L =L1DLT ,
whereby
I10 L
T
S
L=L1D
LT
41.
q.e.d.
41
-
8/12/2019 MIT15_093J_F09_sdp
42/51
ThenextpropositionshowsthatwheneverthealgorithmproceedstoStep4,thentheobjectivefunctionf0(X) decreasesbyatleast0.0250:
Proposition10.2 Suppose that X satisfies Ai X =bi, i= 1, . . . , m, andX0. Suppose that (D,y)is the solution of the Normal equations (10)
for the point X for a given value 0of the barrier parameter, and that
L1DLT>1.4
Then for all [0, 1), 2
f0 X +L1
D
LT
D
f0(X) + 0
L1D
LT
+
2(1 ).
In particular,
0.2 f0 X + L1D LT D f0(X) 0.0250. (11)
Inorderto provethis proposition,wewillneedtwo powerful factsaboutthelogarithmfunction:
Fact1. Supposethat|x|
-
8/12/2019 MIT15_093J_F09_sdp
43/51
q.e.d.Fact2. SupposethatRSn andthatR
-
8/12/2019 MIT15_093J_F09_sdp
44/51
Then
f0 X +L1D
LTD
= f0 X +D
= C X +C D 0ln(det(L(I+L1DLT)LT))
= C X 0ln(det(X)) + C D 0ln(det(I+L1D LT)) f0(X) + CD 0IL1D LT +0 22(1)= f0(X) + C D
0LTL1 D +0
2(1
2)
= f0(X) + C0X1 D +0 2 2(1) .
Now, (D , y) solvetheNormalequations:
m X1+0X1D X1 C0 =y Ai ii=1 (12)
Ai D = 0, i= 1, ... ,m.
Takingtheinnerproductofbothsidesofthefirstequationabovewith Dandrearrangingyields:
0X1D
X1D
=(C0X1) D .Substitutingthisinourinequalityaboveyields:
44
-
8/12/2019 MIT15_093J_F09_sdp
45/51
f0 X +L1D
LTD
f0(X) 0X1D X1D +0 2(12)= f0(X) 0L1D LT L1D LT +0 2(12)= f0(X) 0L1D LT2+0 2(12)= f0(X) 0L1D LT+0 22(1)
= f0(X
) +
0
L1
D
LT
+2(1
2) .
Subsituting= 0.2andandL1DLT>14 yieldsthefinalresult.q.e.d.
Lastofall,weproveaboundonthenumberofiterationsthatthealgo-rithmwillneedinordertofinda14 approximatesolutionofBSDP(
0):
Proposition10.3 Suppose that X0satisfies Ai X0=bi, i= 1, . . . , m, andX00. Let 0 be given and let f
0 be the optimal objective function valueof BSDP(0). Then the algorithm initiated at X0will find a 14approximate
solution of BSDP(0) in at most
f0(X0) fk=
00.0250
iterations.
Proof: This follows immediatelyfrom (11).Each iteration that isnota14-approximatesolutiondecreasestheobjectivefunctionf0(X) ofBSDP(0)byatleast0.0250.Therefore,therecannotbemorethan
f0(X0) f00.025
0
iterationsthatarenot14 approximatesolutionsofBSDP(0).
q.e.d.
45
-
8/12/2019 MIT15_093J_F09_sdp
46/51
10.5 Some Properties of the Frobenius NormProposition10.4 If MSn, then
n
1. M = (j(M))2,where 1(M), 2(M), . . . , n(M) is an enu-j=1
meration of the neigenvalues of M.
2. If is any eigenvalue of M, then || M.3. |trace(M)| nM.4. If M0,andsoM0.q.e.d.
Proposition10.5 If A,BSn, then AB AB.Proof: Wehave 2 n n n
AB= AikBkji=1j=1 k=1
46
-
8/12/2019 MIT15_093J_F09_sdp
47/51
= =
n n n n
ik kj A2 B2i=1j=1 k=1 k=1
n n n nA2 B2ik kj AB.
i=1k=1 j=1k=1
q.e.d.
11 Issues in Solving SDP using the Ellipsoid Al-gorithm
Toseehowtheellipsoidalgorithmisusedtosolveasemidefiniteprogram,assumeforconveniencethattheformatoftheproblemisthatofthedualproblemSDD.Thenthefeasibleregionoftheproblemcanbewrittenas:
m
F= (y1, . . . , ym) m |Ci=1
yiAi 0 ,
andtheobjectivefunctionis mi=1yibi. NotethatF isjustaconvexre-gioninm.
Recallthatatany iterationoftheellipsoidalgorithm,thesetofsolutionsofSDDis knownto lie inthecurrentellipsoid,andthecenterofthecurrentellipsoidis,say,y= (y1, . . . ,ym). Ify F,thenweperformanoptimality
m mcutoftheform i=1yibi i=1yibi,andusestandardformulastoupdatetheellipsoidanditsnewcenter.If F,thenweperformafeasibilitycuty /bycomputingavectorh m suchthathTy > hTy forallyF.
TherearefourissuesthatmustberesolvedinordertoimplementtheaboveversionoftheellipsoidalgorithmtosolveSDD:
47
-
8/12/2019 MIT15_093J_F09_sdp
48/51
1.Testingify F. ThisisdonebycomputingthematrixS=Cmi=1yiAi. IfS 0,then y F. TestingifthematrixS 0can
bedonebycomputinganexactCholeskyfactorizationofS,whichtakesO(n3) operations,assumingthatcomputingsquarerootscanbeperformed (exactly)inoneoperation.
2.Computingafeasibilitycut. Asabove,testingifS 0canbecom-puted inO(n3) operations.IfS0,thenagainassumingexactarith-metic,wecanfindannvectorvsuchthatTS 3)oper-v v
-
8/12/2019 MIT15_093J_F09_sdp
49/51
determinedbyexaminingtheinputlengthofthedata,asisthecaseinlinearprogramming.Oneneedstoknowsomespecialinformationabout the specificproblemathand inorderto determinerbeforesolvingthesemidefiniteprogram.
12 Current Research in SDP
Therearemanyveryactiveresearchareasinsemidefiniteprogramminginnonlinear (convex) programming, in combinatorialoptimization,and in con-troltheory. Intheareaofconvexanalysis,recentresearchtopicsincludethe geometryand the boundarystructureofSDPfeasibleregions (including
notionsofdegeneracy)andresearchrelatedtothecomputationalcomplex-ityofSDP suchasdecidabilityquestions,certificatesofinfeasibility,anddualitytheory. Intheareaofcombinatorialoptimization,therehasbeenmuchresearchonthe practicalandthetheoreticaluseofSDPrelaxationsofhardcombinatorialoptimizationproblems.Asregardsinteriorpointmeth-ods,thereareahostofresearchissues,mostlyinvolvingthedevelopmentofdifferentinteriorpointalgorithmsandtheirproperties,includingratesofconvergence,performanceguarantees,etc.
13 Computational State of the Art of SDP
BecauseSDPhas so manyapplications,and because interior pointmethodsshowsomuchpromise,perhapsthemostexcitingareaofresearchonSDPhas to do withcomputation and implementationof interior pointalgorithmsforsolvingSDP. MuchresearchhasfocusedonthepracticalefficiencyofinteriorpointmethodsforSDP. However, intheresearchtodate,com-putationalissueshavearisenthataremuchmorecomplexthanthoseforlinearprogramming,andthesecomputationalissuesareonlybeginningtobe wellunderstood.They probablystem from a varietyof factors, includingthe factthatSDPisnot guaranteedto havestrictlycomplementaryoptimalsolutions (as is the case in linear programming). Finally, becauseSDP is
suchanewfield,thereisnorepresentativesuiteofpracticalproblemsonwhichtotestalgorithms,i.e.,thereisnoequivalentversionofthe netlibsuiteofindustriallinearprogrammingproblems.
49
-
8/12/2019 MIT15_093J_F09_sdp
50/51
Agoodwebsiteforsemidefiniteprogrammingis:
50
-
8/12/2019 MIT15_093J_F09_sdp
51/51
MIT OpenCourseWarehttp://ocw.mit.edu
15.093J / 6.255J Optimization Methods
Fall2009
For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.
http://ocw.mit.edu/http://ocw.mit.edu/termshttp://ocw.mit.edu/termshttp://ocw.mit.edu/