thinking outside of the black boxw3.impa.br/~sagastiz/sagastizop11.pdf · black box information...

45
N ONSMOOTH O PTIMIZATION : T HINKING O UTSIDE OF THE BLACK BOX Claudia Sagastizábal mailto:[email protected], http://www.impa.br/~sagastiz SIAM Conference on Optimization, Darmstadt, May 18th 2011 With thanks to: AFOSR Grant FA9550-08-1-0370, NSF Grant DMS 0707205, and CNPq & Faperj from Brazil

Upload: others

Post on 18-Oct-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

NONSMOOTH OPTIMIZATION:THINKING OUTSIDE OF

THE BLACK BOX

Claudia Sagastizábal

mailto:[email protected], http://www.impa.br/~sagastizSIAM Conference on Optimization,

Darmstadt, May 18th 2011

With thanks to:AFOSR Grant FA9550-08-1-0370, NSF Grant DMS 0707205,

and CNPq & Faperj from Brazil

Page 2: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

NSO AlgorithmsFor a convex nonsmooth function, solving

minf(x)

with a black box method

x

f(x)

g(x) ∈ ∂f(x)is doomed to slow convergence speed.

Page 3: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

NSO AlgorithmsFor a convex nonsmooth function, solving

minf(x)

with a black box method

x

f(x)

g(x) ∈ ∂f(x)is doomed to slow convergence speed.Better performance possible by exploiting structure

Page 4: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

How does structure appear?

– Explicitly

as a sum

as a composition

– Implicitly

U-Lagrangian

VU-decomposition

partly smooth functions

Page 5: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

How does structure appear?

– Explicitly

as a sum

as a composition

– Implicitly

U-Lagrangian

VU-decomposition

partly smooth functions

Page 6: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

How does structure appear?

– Explicitly

as a sum

as a composition

6= black boxes

– Implicitly

U-Lagrangian

VU-decomposition

partly smooth functions

Page 7: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

How does structure appear?

– Explicitly

as a sum

as a composition

6= black boxes

– Implicitly

U-Lagrangian

VU-decomposition

partly smooth functions

digging tools

Page 8: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Explicit Structure:

Opening the Black Box

Page 9: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

A convex partly nonsmooth function

For x ∈ IRn, given matrices A� 0, B� 0,f(x) =

√x>Ax+x>Bx

has a unique minimizer at 0.On N (A) the function is not differentiable, andthe first term vanishes: f|N (A) looks smooth.

R(A) N (A)

Page 10: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

This function has several interesting structuresIf no structure at all

f(x) =√x>Ax+x>Bx

This defines a sum black box:

Page 11: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

This function has several interesting structuresIf no structure at all

f(x) =√x>Ax+x>Bx

This defines the black box :

x

f(x)

g(x) ∈ ∂f(x)

Page 12: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

This function has several interesting structuresSum structure

f(x) = f1(x)+ f2(x) with

f1(x) =√x>Ax

f2(x) = x>Bx

This defines a sum black box:

Page 13: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

This function has several interesting structuresSum structure

f(x) = f1(x)+ f2(x) with

f1(x) =√x>Ax

f2(x) = x>Bx

This defines a sum black box:

x

f1(x), f2(x)

gj(x) ∈ ∂fj(x)j=1,2

Page 14: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

This function has several interesting structuresComposite structure

f(x)= (h◦c)(x) with

c(x) =(x,x>Bx

)∈ IRn+1

h(C) =√C>1:nAC1:n+Cn+1

for C smooth and h positively homogeneous

This defines a composite black box: aeriou

Page 15: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

This function has several interesting structuresComposite structure

f(x)= (h◦c)(x) with

c(x) =(x,x>Bx

)∈ IRn+1

h(C) =√C>1:nAC1:n+Cn+1

for C smooth and h positively homogeneous

This defines a composite black box:

x

C := c(x) and h(C)

Jacobian Dc(x) and

G(C) ∈ ∂h(C)

Page 16: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

This function has several interesting structuresMissing information structure

Suppose not all of A/B is known/accessible,

so that only estimates are available for f

Page 17: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

This function has several interesting structuresMissing information structure

Suppose not all of A/B is known/accessible,

so that only estimates are available for f

This defines a noisy black box:

x

fx ≈ f(x)

gx ≈ g(x) ∈ ∂f(x)

Page 18: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

How to use explicit structure in an algorithm?

Black box information defines pieces that puttogether create a model ϕ of the function f.

The model is used to define iterates not too faraway from a “good” past iterate, x̂. At iteration i,

xi+1 minimizes ϕ(x)+1

2µ|x− x̂|2

“pieces” chosen to make minimization simple (QP)

aeriou

Page 19: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

How to use explicit structure in an algorithm?

Black box information defines pieces that puttogether create a model ϕ of the function f.

The model is used to define iterates not too faraway from a “good” past iterate, x̂. At iteration i,

xi+1 minimizes ϕ(x)+1

2µ|x− x̂|2

“pieces” chosen to make minimization simple (QP)

for example, “piece”=linearization:

xifi = f(xi)

gi = g(xi)=⇒ϕ(x) = maxi {fi+gi>(x−xi) }

Page 20: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

How to use explicit structure in an algorithm?

Black box information defines pieces that puttogether create a model ϕ of the function f.

The model is used to define iterates not too faraway from a “good” past iterate, x̂. At iteration i,

xi+1 minimizes ϕ(x)+1

2µ|x− x̂|2

“pieces” chosen to make minimization simple (QP)

for example, “piece”=linearization:

xifi = f(xi)

gi = g(xi)=⇒ϕ(x) = maxi

{fi+gi>(x−xi)

}

Page 21: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

xi+1 = argminxϕ(x)+

1

2µ|x− x̂|2

for example, “piece”=linearization:

xifi = f(xi)

gi = g(xi)=⇒ϕ(x) = maxi

{fi+gi>(x−xi)

}

Some jargonx̂ is a serious point⋃i

(xi, fi,gi) is the bundle B

If xi+1 gives sufficient decrease for f, it becomes the next x̂

Otherwise, it is declared a null point

Page 22: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

xi+1 = argminxϕ(x)+

1

2µ|x− x̂|2

for example, “piece”=linearization:

xifi = f(xi)

gi = g(xi)=⇒ϕ(x) = maxi

{fi+gi>(x−xi)

}

Some jargonx̂ is a serious point⋃i

(xi, fi,gi) is the bundle B

If xi+1 gives sufficient decrease for f, it becomes the next x̂

Otherwise, it is declared a null point

Page 23: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

xi+1 = argminxϕ(x)+

1

2µ|x− x̂|2

for example, “piece”=linearization:

xifi = f(xi)

gi = g(xi)=⇒ϕ(x) = maxi

{fi+gi>(x−xi)

}xdifferent boxes

Some jargon different modelsx̂ is a serious point⋃i

(xi, fi,gi) is the bundle B

If xi+1 gives sufficient decrease for f, it becomes the next x̂

Otherwise, it is declared a null point

Page 24: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Structured models for f

No structureϕ(x) = maxi

{fi+gi>(x−xi)

}= maxi

{(fi1+ f

i2)+(gi1+g

i2)>(x−xi)

}

Sum structureϕ(x) = maxi

{fi1+g

i1>(x−xi)

}+maxi

{fi2+g

i2>(x−xi)

}

Page 25: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Structured models for f

No structureϕ(x) = maxi

{fi+gi>(x−xi)

}= maxi

{(fi1+ f

i2)+(gi1+g

i2)>(x−xi)

}

Sum structureϕ(x) = maxi

{fi1+g

i1>(x−xi)

}+maxi

{fi2+g

i2>(x−xi)

} Larger

QP

Page 26: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Structured models for f

Composite structure ϕ(x) = maxi{Gi>

(c(x̂)+Dc(x̂)(x− x̂)

)}≈h(c(x̂)+Dc(x̂)(x− x̂))

Page 27: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Structured models for f

Composite structure ϕ(x) = maxi{Gi>

(c(x̂)+Dc(x̂)(x− x̂)

)}≈h(c(x̂)+Dc(x̂)(x− x̂))

Good

near x̂

Page 28: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Inexact models for f

Missing structure ϕ(x) = maxi{fi+gi>(x−xi)

}

Page 29: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Inexact models for f

Missing structure ϕ(x) = maxi{fi+gi>(x−xi)

}

ϕ may

cut f

excessive noise is attenuated via µ

Page 30: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Stochastic Programming Applications

Mid-term planning for power generation

Scenario tree with 50,000 nodes

Nuclear LPs with 100,000 variables and 300,000 constraints

Page 31: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Stochastic Programming Applications

Mid-term planning for power generation

Incremental Bundle

fx and gx

Skips Nuclear LPs (alternating) ≡ noisy black box

25% less CPU time than exact bundle, same accuracy

Page 32: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Stochastic Programming Applications

2-stage stochastic linear programs

L-shaped decomposition into N scenarios

Page 33: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Stochastic Programming Applications

2-stage stochastic linear programs

Inexact Bundle

Qx and gx

Skips 80% LPs solution ≡ noisy black box

4 times faster than L-shaped, same accuracy

Page 34: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Combinatorial Optimization Applications

Exponential number of hard constraints

Lagrangian Relaxation

Page 35: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Combinatorial Optimization Applications

Exponential number of hard constraints

Dynamic Bundle

fx and gx

Like “Relax-and-cut”

with increased stability

Page 36: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Extracting

Implicit Structure

Page 37: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

VU Algorithm

Recall that f|N (A) is nice:

R(A) N (A)

V U

Page 38: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

VU Algorithm

Recall that f|N (A) is nice:

R(A) N (A)

bundle QP Newton-move

Page 39: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

VU Algorithm

Recall that f|N (A) is nice:

R(A) N (A)

bundle QP Newton-move

V ?? U

Page 40: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

VU Algorithm

Recall that f|N (A) is nice:

R(A) N (A)

bundle QP Newton-move

V ?? U

Answer: Bundle QP identifies the “ridge” of nonsmoothness

Solve a 2nd QP to create a model of V using ∂ϕ

Page 41: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

VU Algorithm: superlinear “serious” subsequence

Page 42: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Across borders

Constrained problems

minf(x) s.t. c(x)≤ 0

ϕ models the Improvement Function

maxx

{f(x)− f(x̂), [c(x)]+}

(changes with each serious point x̂)

Page 43: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Across borders

Nonconvex problems

ϕ models the Local Convexification

f(x)+1

2η|x− x̂|2

(changes with each serious point x̂)

Page 44: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Across borders

Combinations:

Page 45: THINKING OUTSIDE OF THE BLACK BOXw3.impa.br/~sagastiz/sagastizOP11.pdf · Black box information defines pieces that put together create a model ’ of the function f. The model is

Closing credits: co-authors

– Robert Mifflin

– Alexandre Belloni

– Aris Daniilidis

– Grégory Emiel

– Warren Hare

– Elizabeth Karas (with A. Ribeiro)

– Claude Lemaréchal (with F. Oustry)

– Welington Oliveira (with S. Scheimberg)

– Mikhail Solodov

– Wim Van Ackooij (with R. Henrion and R. Zorgati)