In mathematics and mathematical optimization, the convex conjugate of a function is a generalization of the Legendre transformation which applies to non-convex functions. It is also known as LegendreโFenchel transformation, Fenchel transformation, or Fenchel conjugate (after Adrien-Marie Legendre and Werner Fenchel). The convex conjugate is widely used for constructing the dual problem in optimization theory, thus generalizing Lagrangian duality.
Definition
[edit]Let ๐ {\displaystyle X}
be a real topological vector space and let ๐ {\displaystyle X^{*}}
be the dual space to ๐ {\displaystyle X}
. Denote by
the canonical dual pairing, which is defined by ๐ {\displaystyle \left\langle x^{*},x\right\rangle =x^{*}(x).}
For a function ๐ {\displaystyle f:X\to \mathbb {R} \cup \{-\infty ,+\infty \}}
taking values on the extended real number line, its convex conjugate is the function
whose value at ๐ {\displaystyle x^{*}\in X^{*}}
is defined to be the supremum:
or, equivalently, in terms of the infimum:
This definition can be interpreted as an encoding of the convex hull of the function's epigraph in terms of its supporting hyperplanes.[1]
Examples
[edit]For more examples, see ยง Table of selected convex conjugates.
- The convex conjugate of an affine function ๐ {\displaystyle f(x)=\left\langle a,x\right\rangle -b}
is ๐ {\displaystyle f^{*}\left(x^{*}\right)={\begin{cases}b,&x^{*}=a\\+\infty ,&x^{*}\neq a.\end{cases}}} - The convex conjugate of a power function ๐ {\displaystyle f(x)={\frac {1}{p}}|x|^{p},1<p<\infty }
is ๐ {\displaystyle f^{*}\left(x^{*}\right)={\frac {1}{q}}|x^{*}|^{q},1<q<\infty ,{\text{where}}{\tfrac {1}{p}}+{\tfrac {1}{q}}=1.} - The convex conjugate of the absolute value function ๐ {\displaystyle f(x)=\left|x\right|}
is ๐ {\displaystyle f^{*}\left(x^{*}\right)={\begin{cases}0,&\left|x^{*}\right|\leq 1\\\infty ,&\left|x^{*}\right|>1.\end{cases}}} - The convex conjugate of the exponential function ๐ {\displaystyle f(x)=e^{x}}
is ๐ {\displaystyle f^{*}\left(x^{*}\right)={\begin{cases}x^{*}\ln x^{*}-x^{*},&x^{*}>0\\0,&x^{*}=0\\\infty ,&x^{*}<0.\end{cases}}}
The convex conjugate and Legendre transform of the exponential function agree except that the domain of the convex conjugate is strictly larger as the Legendre transform is only defined for positive real numbers.
Connection with expected shortfall (average value at risk)
[edit]Let F denote a cumulative distribution function of a random variable X. Then (integrating by parts),
๐ {\displaystyle f(x):=\int _{-\infty }^{x}F(u)\,du=\operatorname {E} \left[\max(0,x-X)\right]=x-\operatorname {E} \left[\min(x,X)\right]}
has the convex conjugate
๐ {\displaystyle f^{*}(p)=\int _{0}^{p}F^{-1}(q)\,dq=(p-1)F^{-1}(p)+\operatorname {E} \left[\min(F^{-1}(p),X)\right]=pF^{-1}(p)-\operatorname {E} \left[\max(0,F^{-1}(p)-X)\right].}
Ordering
[edit]A particular interpretation has the transform
๐ {\displaystyle f^{\text{inc}}(x):=\arg \sup _{t}t\cdot x-\int _{0}^{1}\max\{t-f(u),0\}\,du,}
as this is a nondecreasing rearrangement of the initial function f; in particular, ๐ {\displaystyle f^{\text{inc}}=f}
for f nondecreasing.
Properties
[edit]The convex conjugate of a closed convex function is again a closed convex function. The convex conjugate of a polyhedral convex function (a convex function with polyhedral epigraph) is again a polyhedral convex function.
Order reversing
[edit]Declare that ๐ {\displaystyle f\leq g}
if and only if ๐ {\displaystyle f(x)\leq g(x)}
for all ๐ {\displaystyle x.}
Then convex-conjugation is order-reversing, which by definition means that if ๐ {\displaystyle f\leq g}
then ๐ {\displaystyle f^{*}\geq g^{*}.}
For a family of functions ๐ {\displaystyle \left(f_{\alpha }\right)_{\alpha }}
it follows from the fact that supremums may be interchanged that
and from the maxโmin inequality that
Biconjugate
[edit]The convex conjugate of a function is always lower semi-continuous. The biconjugate ๐ {\displaystyle f^{**}}
(the convex conjugate of the convex conjugate) is also the closed convex hull, i.e. the largest lower semi-continuous convex function with ๐ {\displaystyle f^{**}\leq f.}
For proper functions ๐ {\displaystyle f,}
- ๐ {\displaystyle f=f^{**}}
if and only if ๐ {\displaystyle f}
is convex and lower semi-continuous, by the FenchelโMoreau theorem.
The Fenchel inequality (below) implies ๐ {\displaystyle f^{**}\leq f}
. More precisely, ๐ {\displaystyle f^{**}}
is the greatest lower semicontinuous convex function not exceeding ๐ {\displaystyle f}
, often described as the closed convex envelope of ๐ {\displaystyle f}
. In particular, by the FenchelโMoreau theorem, a proper function is equal to its biconjugate if and only if it is convex and lower semicontinuous.[2][3]
Fenchel's inequality
[edit]For any function f and its convex conjugate f *, Fenchel's inequality (also known as the FenchelโYoung inequality) holds for every ๐ {\displaystyle x\in X}
and ๐ {\displaystyle p\in X^{*}}
:
Furthermore, the equality holds only when ๐ {\displaystyle p\in \partial f(x)}
, where ๐ {\displaystyle \partial f(x)}
is the subgradient.
The proof follows from the definition of convex conjugate: ๐ {\displaystyle f^{*}(p)=\sup _{\tilde {x}}\left\{\langle p,{\tilde {x}}\rangle -f({\tilde {x}})\right\}\geq \langle p,x\rangle -f(x).}
Convexity
[edit]For two functions ๐ {\displaystyle f_{0}}
and ๐ {\displaystyle f_{1}}
and a number ๐ {\displaystyle 0\leq \lambda \leq 1}
the convexity relation
holds. The ๐ {\displaystyle {*}}
operation is a convex mapping itself.
Infimal convolution
[edit]The infimal convolution (or epi-sum) of two functions ๐ {\displaystyle f}
and ๐ {\displaystyle g}
is defined as
The operation ๐ {\displaystyle \operatorname {\Box } }
is symmetric (commutative) and associative, i.e.
Let ๐ {\displaystyle f_{1},\ldots ,f_{m}}
be proper, convex and lower semicontinuous functions on ๐ {\displaystyle \mathbb {R} ^{n}.}
Then the infimal convolution is convex and lower semicontinuous (but not necessarily proper),[4] and satisfies
or, equivalently,
which expresses the behaviour of convex conjugation with respect to sums of functions.
The infimal convolution of two functions has a geometric interpretation: The (strict) epigraph of the infimal convolution of two functions is the Minkowski sum of the (strict) epigraphs of those functions.[5]
Maximizing argument
[edit]If the function ๐ {\displaystyle f}
is differentiable, then its derivative is the maximizing argument in the computation of the convex conjugate:
- ๐ {\displaystyle f^{\prime }(x)=x^{*}(x):=\arg \sup _{x^{*}}{\langle x,x^{*}\rangle }-f^{*}\left(x^{*}\right)}
and - ๐ {\displaystyle f^{{*}\prime }\left(x^{*}\right)=x\left(x^{*}\right):=\arg \sup _{x}{\langle x,x^{*}\rangle }-f(x);}
hence
- ๐ {\displaystyle x=\nabla f^{*}\left(\nabla f(x)\right),}
- ๐ {\displaystyle x^{*}=\nabla f\left(\nabla f^{*}\left(x^{*}\right)\right),}
and moreover
- ๐ {\displaystyle f^{\prime \prime }(x)\cdot f^{{*}\prime \prime }\left(x^{*}(x)\right)=1,}
- ๐ {\displaystyle f^{{*}\prime \prime }\left(x^{*}\right)\cdot f^{\prime \prime }\left(x(x^{*})\right)=1.}
Scaling properties
[edit]If for some ๐ {\displaystyle \gamma >0,}
๐ {\displaystyle g(x)=\alpha +\beta x+\gamma \cdot f\left(\lambda x+\delta \right)}
, then
Behavior under linear transformations
[edit]Let ๐ {\displaystyle A:X\to Y}
be a bounded linear operator. For any convex function ๐ {\displaystyle f}
on ๐ {\displaystyle X,}
where
is the preimage of ๐ {\displaystyle f}
with respect to ๐ {\displaystyle A}
and ๐ {\displaystyle A^{*}}
is the adjoint operator of ๐ {\displaystyle A.}
[6]
A closed convex function ๐ {\displaystyle f}
is symmetric with respect to a given set ๐ {\displaystyle G}
of orthogonal linear transformations,
if and only if its convex conjugate ๐ {\displaystyle f^{*}}
is symmetric with respect to ๐ {\displaystyle G.}
Table of selected convex conjugates
[edit]The following table provides Legendre transforms for many common functions as well as a few useful properties.[7]
See also
[edit]References
[edit]- ^ "Legendre Transform". Retrieved April 14, 2019.
- ^ Rockafellar 1970.
- ^ Zฤlinescu 2002, pp. 75โ79.
- ^ Phelps, Robert (1993). Convex Functions, Monotone Operators and Differentiability (2 ed.). Springer. p. 42. ISBN 0-387-56715-1.
- ^ Bauschke, Heinz H.; Goebel, Rafal; Lucet, Yves; Wang, Xianfu (2008). "The Proximal Average: Basic Theory". SIAM Journal on Optimization. 19 (2): 766. CiteSeerX 10.1.1.546.4270. doi:10.1137/070687542.
- ^ Ioffe, A.D. and Tichomirov, V.M. (1979), Theorie der Extremalaufgaben. Deutscher Verlag der Wissenschaften. Satz 3.4.3
- ^ Borwein, Jonathan; Lewis, Adrian (2006). Convex Analysis and Nonlinear Optimization: Theory and Examples (2 ed.). Springer. pp. 50โ51. ISBN 978-0-387-29570-1.
- Arnol'd, Vladimir Igorevich (1989). Mathematical Methods of Classical Mechanics (Second ed.). Springer. ISBN 0-387-96890-3. MR 0997295.
- Rockafellar, R. Tyrrell; Wets, Roger J.-B. (26 June 2009). Variational Analysis. Grundlehren der mathematischen Wissenschaften. Vol. 317. Berlin New York: Springer Science & Business Media. ISBN 9783642024313. OCLC 883392544.
- Rockafellar, R. Tyrell (1970). Convex Analysis. Princeton: Princeton University Press. ISBN 0-691-01586-4. MR 0274683.
- Zฤlinescu, Constantin (30 July 2002). Convex Analysis in General Vector Spaces. River Edge, N.J. London: World Scientific Publishing. ISBN 978-981-4488-15-0. MR 1921556. OCLC 285163112 โ via Internet Archive.
Further reading
[edit]- Touchette, Hugo (2014-10-16). "Legendre-Fenchel transforms in a nutshell" (PDF). Archived from the original (PDF) on 2017-04-07. Retrieved 2017-01-09.
- Touchette, Hugo (2006-11-21). "Elements of convex analysis" (PDF). Archived from the original (PDF) on 2015-05-26. Retrieved 2008-03-26.
- Ellerman, David Patterson (1995-03-21). "Chapter 12: Parallel Addition, Series-Parallel Duality, and Financial Mathematics". Intellectual Trespassing as a Way of Life: Essays in Philosophy, Economics, and Mathematics (PDF). The worldly philosophy: studies in intersection of philosophy and economics. Rowman & Littlefield Publishers, Inc. pp. 237โ268. ISBN 0-8476-7932-2. Archived (PDF) from the original on 2016-03-05. Retrieved 2019-08-09.
Series G - Reference, Information and Interdisciplinary Subjects Series
[1] (271 pages)
- Ellerman, David Patterson (May 2004) [1995-03-21]. "Introduction to Series-Parallel Duality" (PDF). University of California at Riverside. CiteSeerX 10.1.1.90.3666. Archived from the original on 2019-08-10. Retrieved 2019-08-09. [2] (24 pages)
