0% found this document useful (0 votes)
137 views16 pages

Closed Functions - Conjugate Function

The document discusses conjugate functions. It defines a conjugate function as the supremum of xy - f(x) for all x in the domain of f. The conjugate of a function is always closed and convex, even if the original function is not. Properties of conjugate functions include Fenchel's inequality and that the second conjugate of a closed convex function is equal to the original function. Examples of calculating conjugates of common functions like norms and indicator functions are provided.

Uploaded by

hassan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
137 views16 pages

Closed Functions - Conjugate Function

The document discusses conjugate functions. It defines a conjugate function as the supremum of xy - f(x) for all x in the domain of f. The conjugate of a function is always closed and convex, even if the original function is not. Properties of conjugate functions include Fenchel's inequality and that the second conjugate of a closed convex function is equal to the original function. Examples of calculating conjugates of common functions like norms and indicator functions are provided.

Uploaded by

hassan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 16

L.

Vandenberghe EE236C (Spring 2013-14)

8. Conjugate functions

• closed functions

• conjugate function

8-1
Closed set

a set C is closed if it contains its boundary:

xk ∈ C, xk → x̄ =⇒ x̄ ∈ C

operations that preserve closedness

• the intersection of (finitely or infinitely many) closed sets is closed

• the union of a finite number of closed sets is closed

• inverse under linear mapping: {x | Ax ∈ C} is closed if C is closed

Conjugate functions 8-2


Image under linear mapping

the image of a closed set under a linear mapping is not necessarily closed

example (C is closed, AC = {Ax | x ∈ C} is open):

R2+
 
C = {(x1, x2) ∈ | x1x2 ≥ 1}, A= 1 0 , AC = R++

sufficient condition: AC is closed if

• C is closed and convex


• and C does not have a recession direction in the nullspace of A. i.e.,

Ay = 0, x̂ ∈ C, x̂ + αy ∈ C ∀α ≥ 0 =⇒ y=0

in particular, this holds for any A if C is bounded

Conjugate functions 8-3


Closed function

definition: a function is closed if its epigraph is a closed set

examples

• f (x) = − log(1 − x2) with dom f = {x | |x| < 1}


• f (x) = x log x with dom f = R+ and f (0) = 0
• indicator function of a closed set C: f (x) = 0 if x ∈ C = dom f

not closed

• f (x) = x log x with dom f = R++, or with dom f = R+ and f (0) = 1


• indicator function of a set C if C is not closed

Conjugate functions 8-4


Properties

sublevel sets: f is closed if and only if all its sublevel sets are closed

minimum: if f is closed with bounded sublevel sets then it has a minimizer

common operations on convex functions that preserve closedness

• sum: f + g is closed if f and g are closed (and dom f ∩ dom g 6= ∅)

• composition with affine mapping: f (Ax + b) is closed if f is closed

• supremum: supα fα(x) is closed if each function fα is closed

Conjugate functions 8-5


Outline

• closed functions

• conjugate function
Conjugate function

f (x)
xy
the conjugate of a function f is

f ∗(y) = sup (y T x − f (x))


x∈dom f
x

(0, −f ∗(y))

f ∗ is closed and convex even if f is not

Fenchel’s inequality

f (x) + f ∗(y) ≥ xT y ∀x, y

(extends inequality xT x/2 + y T y/2 ≥ xT y to non-quadratic convex f )

Conjugate functions 8-6


Quadratic function

1 T
f (x) = x Ax + bT x + c
2

strictly convex case (A ≻ 0)

1
f (y) = (y − b)T A−1(y − b) − c

2

general convex case (A  0)

1
f ∗(y) = (y − b)T A†(y − b) − c, dom f ∗ = range(A) + b
2

Conjugate functions 8-7


Negative entropy and negative logarithm

negative entropy

n
X n
X
f (x) = xi log xi f ∗(y) = eyi−1
i=1 i=1

negative logarithm

n
X n
X
f (x) = − log xi f ∗(y) = − log(−yi) − n
i=1 i=1

matrix logarithm

f (X) = − log det X (dom f = Sn++) f ∗(Y ) = − log det(−Y ) − n

Conjugate functions 8-8


Indicator function and norm

indicator of convex set C: conjugate is support function of C



0 x∈C
f (x) = f ∗(y) = sup y T x
+∞ x ∈
6 C x∈C

norm: conjugate is indicator of unit dual norm ball



0 kyk∗ ≤ 1
f (x) = kxk f ∗(y) =
+∞ kyk∗ > 1

(see next page)

Conjugate functions 8-9


proof: recall the definition of dual norm:

kyk∗ = sup xT y
kxk≤1

∗ T

to evaluate f (y) = supx y x − kxk we distinguish two cases

• if kyk∗ ≤ 1, then (by definition of dual norm)

y T x ≤ kxk ∀x

and equality holds if x = 0; therefore supx (y T x − kxk) = 0

• if kyk∗ > 1, there exists an x with kxk ≤ 1, xT y > 1; then

f ∗(y) ≥ y T (tx) − ktxk = t(y T x − kxk)

and r.h.s. goes to infinity if t → ∞

Conjugate functions 8-10


The second conjugate

f ∗∗(x) = sup (xT y − f ∗(y))


y∈dom f ∗

• f ∗∗ is closed and convex

• from Fenchel’s inequality (xT y − f ∗(y) ≤ f (x) for all y and x):

f ∗∗(x) ≤ f (x) ∀x

equivalently, epi f ⊆ epi f ∗∗ (for any f )

• if f is closed and convex, then

f ∗∗(x) = f (x) ∀x

equivalently, epi f = epi f ∗∗ (if f is closed convex); proof on next page

Conjugate functions 8-11


proof (f ∗∗ = f if f is closed and convex): by contradiction
suppose (x, f ∗∗(x)) 6∈ epi f ; then there is a strict separating hyperplane:
 T  
a z−x
≤c<0 ∀(z, s) ∈ epi f
b s − f ∗∗(x)

for some a, b, c with b ≤ 0 (b > 0 gives a contradiction as s → ∞)

• if b < 0, define y = a/(−b) and maximize l.h.s. over (z, s) ∈ epi f :

f ∗(y) − y T x + f ∗∗(x) ≤ c/(−b) < 0

this contradicts Fenchel’s inequality


• if b = 0, choose ŷ ∈ dom f ∗ and add small multiple of (ŷ, −1) to (a, b):
 T  
a + ǫŷ z−x ∗ T ∗∗

≤ c + ǫ f (ŷ) − x ŷ + f (x) < 0
−ǫ s − f ∗∗(x)

now apply the argument for b < 0

Conjugate functions 8-12


Conjugates and subgradients

if f is closed and convex, then

y ∈ ∂f (x) ⇐⇒ x ∈ ∂f ∗(y) ⇐⇒ xT y = f (x) + f ∗(y)

proof: if y ∈ ∂f (x), then f ∗(y) = supu(y T u − f (u)) = y T x − f (x)

f ∗(v) = sup (v T u − f (u))


u

≥ v T x − f (x)
= xT (v − y) − f (x) + y T x
= f ∗(y) + xT (v − y)

for all v; therefore, x is a subgradient of f ∗ at y (x ∈ ∂f ∗(y))

reverse implication x ∈ ∂f ∗(y) =⇒ y ∈ ∂f (x) follows from f ∗∗ = f

Conjugate functions 8-13


Some calculus rules

separable sum

f (x1, x2) = g(x1) + h(x2) f ∗(y1, y2) = g ∗(y1) + h∗(y2)

scalar multiplication: (for α > 0)

f (x) = αg(x) f ∗(y) = αg ∗(y/α)

addition to affine function

f (x) = g(x) + aT x + b f ∗(y) = g ∗(y − a) − b

infimal convolution

f (x) = inf (g(u) + h(v)) f ∗(y) = g ∗(y) + h∗(y)


u+v=x

Conjugate functions 8-14


References

• J.-B. Hiriart-Urruty, C. Lemaréchal, Convex Analysis and Minimization


Algoritms (1993), chapter X

• D.P. Bertsekas, A. Nedić, A.E. Ozdaglar, Convex Analysis and


Optimization (2003), chapter 7.

• R. T. Rockafellar, Convex Analysis (1970)

Conjugate functions 8-15

You might also like