Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Master of Arts
In
Mathematics
by
Miriam Schussler
May 2013
Copyright by
Miriam Schussler
2013
CERTIFICATION OF APPROVAL
Sergei Ovchinnikov
Professor of Mathematics
Eric Hayashi
Professor of Mathematics
Alex Schuster
Professor of Mathematics
DIFFERENCE OF CONVEX FUNCTIONS
Miriam Schussler
San Francisco State University
2013
The main goal of this thesis is to describe a wide class of functions on a closed
bounded interval that are representable as the difference of two convex functions
(DC functions). The necessary background for this thesis is given in Chapter 1.
Then, in Chapter 2 we begin with three proofs that a piecewise linear function (PL
function) is a lattice polynomial on its linear components. We then show that any
PL function is representable as the difference of two convex functions. In Chapter 3
we investigate DC functions from an analytic standpoint. To motivate our proof we
represent a PL function as the indefinite integral of the step function which is its left
derivative. We then use the positive and negative variation of this step function to
construct two convex functions whose difference is equal to our original PL function.
This construction is possible because a PL function is the integral of a function of
bounded variation (BV functions). We conclude this thesis with an extension of this
proof to functions on a closed interval which are indefinite integrals of BV functions.
I certify that the Abstract is a correct representation of the content of this thesis.
I would also like to offer my thanks to Dr. Hayashi and Dr. Schuster for
being on my thesis committee, and thank them for carefully reading my
thesis and suggesting improvements.
v
TABLE OF CONTENTS
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 Definition and Examples of PL-functions . . . . . . . . . . . . . . . . 2
1.2 Lattices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.2.1 R as a Distributive Lattice . . . . . . . . . . . . . . . . . . . . 5
1.2.2 Lattice polynomials . . . . . . . . . . . . . . . . . . . . . . . . 6
1.2.3 Translation Invariance . . . . . . . . . . . . . . . . . . . . . . 7
3 Analytic Proofs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
3.1 A PL function is the difference of the indefinite integrals of increasing
step functions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
3.2 Indefinite Integrals of BV Functions are DC Functions . . . . . . . . 27
Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
vi
LIST OF FIGURES
vii
Chapter 1
Introduction
1
2
Definition 1.1. Let R denote the set of real numbers, and let I = [a, b] be a closed,
bounded, nondegenerate interval in R. A function F on I is said to be piecewise
linear (in a narrow sense) if there is a strictly increasing sequence a = x0 , . . . , xn = b
of points in I and a sequence of (not necessarily distinct) linear functions G1 , . . . , Gn
on I such that
F (x) = Gk (x), if x ∈ Ik , for 1 ≤ k ≤ n,
1.2 Lattices
In this section we review the definition of a lattice and some properties of lattices
which will be used in Chapter 2.
Definition 1.2. [2, p.6] A lattice is a poset P any two of whose elements has a
greatest lower bound or “meet” denoted by x ∧ y , and a least upper bound or “join”
denoted by x ∨ y.
Lemma 1.1. [2, p.8] In any lattice the operations of meet and join satisfy the
following laws:
L1. x ∧ x = x, x ∨ x = x (Idempotent)
L2. x ∧ y = y ∧ x (Commutative)
L3. x ∧ (y ∧ z) = (x ∧ y) ∧ z, x ∨ (y ∨ z) = (x ∨ y) ∨ z (Associative)
L4. x ∧ (x ∨ y) = x ∨ (x ∧ y) = x (Absorbtion)
x ∧ y = x and x ∨ y = y.
Definition 1.3. [2, p. 12] A lattice L is distributive if and only if the following
properties hold for all x, y, z ∈ L:
5
L5. x ∧ (y ∨ z) = (x ∧ y) ∨ (x ∧ z)
L6. x ∨ (y ∧ z) = (x ∨ y) ∧ (x ∨ z)
It follows from L2 and L3 that we can denote the meet and join of a finite number
^ _
of elements by and respectively.
Let J be a finite set of indices, and each Sj , j ∈ J, be a subset of {1, . . . , n}.
^ _ ^ _
We omit brackets in expressions such as ai and write ai .
j∈J i∈Sj j∈J i∈Sj
a, if a ≤ b,
a, if a ≥ b,
a∧b= and a∨b=
b,
if a > b, b,
if a < b,
for a, b ∈ R.
Theorem 1.2. The set R, equipped with operations ∧ and ∨ is a distributive lattice.
Proof. The number x ∧ y is the lesser of x and y while x ∨ y is the greater of x and
y. Both x ∧ (y ∨ z) and (x ∧ y) ∨ (x ∧ z) are equal to x if x is smaller than y or z.
6
Definition 1.4. [2, p.30] Expressions involving the symbols ∧, ∨ and letters are
called lattice polynomials. More formally, let us define individual letters x, y, z,
. . . as polynomials of weight one. Recursively, if p and q are lattice polynomials of
weights w and w0 , respectively, then p ∧ q and p ∨ q are called lattice polynomials of
weight w + w0 .
^ _
p(a1 , . . . , an ) = ai ,
j∈J i∈Sj
where J is a set of indices and each Sj , j ∈ J, is a subset of the set {1, . . . , n}.
1
In fact a lattice satisfies L5 if and only if it satisfies L6. Also, the proof given here that R is a
distributive lattice is valid not only for R but for any chain. [2, pp.11-12]
7
^ ^
ai + u = (ai + u) (i)
i∈S i∈S
_ _
ai + u = (ai + u) (ii)
i∈S i∈S
p(a1 , . . . , an ) + u = p(a1 + u, . . . , an + u)
Proof.
^ _
p(a1 , ..., an ) + u = ai + u by Lemma 1.3
j∈J i∈Sj
^ _
= ( ai + u) by Lemma 1.4(i)
j∈J i∈Sj
^ _
= (ai + u) by Lemma 1.4(ii)
j∈J i∈Sj
^ _
(F + L)(x) = (Gi + L)(x),
j∈J i∈Sj
^ _
then F (x) = Gi (x).
j∈J i∈Sj
^ _
(F + L)(x) = (Gi + L)(x),
j∈J i∈Sj
then
^ _
F (x) = (F + L)(x) − L(x) = (Gi + L)(x) − L(x)
j∈J i∈Sj
^ _
= Gi (x) + L(x) − L(x)
j∈J i∈Sj
^ _
= Gi (x) + L(x) − L(x) by Lemma 1.5
j∈J i∈Sj
^ _
= Gi (x).
j∈J i∈Sj
Chapter 2
9
10
rigorous proof.
in this case.
F (b)−F (a)
Let L be a linear function defined by L(x) = F (a) + b−a
· (x − a). For
each 1 ≤ k ≤ n, we have F − L = (Gk − L) over Ik , so F − L is a PL function
with components G1 − L, ..., Gn − L (See Figure 2.1). Without loss of generality we
can assume that (F − L)(x) 6= 0 for some x ∈ I. We consider the case in which
(F − L)(x) > 0. The maximum of a PL function occurs at a vertex. Because
(F − L)(a) = (F − L)(b) = 0, the maximum of F − L cannot be at an endpoint.
Therefore max{(F − L)(x)} = (F − L)(xk ) for some k ∈ {1, ..., n − 1}.
Let us define two PL-functions F1 and F2 on I by:
Suppose that the induction hypothesis holds when a PL function has fewer than
n components. We see that (F − L)(x) = (F1 − L)(x) ∧ (F2 − L)(x). Thus we can
write (F − L)(x) as a lattice polynomial and therefore as a meet of joins (Lemma
12
1.3) :
^ _
(F − L)(x) = (Gi − L)(x).
j∈J i∈Sj
^ _
F (x) = Gi (x).
j∈J i∈Sj
If min{(F − L)(x)} < 0, we may choose the minimum rather that the maximum
value and the proof is similar.
For the second proof, we begin by introducing two sets in the plane defined by a
function F on I. The epigraph of F is a subset of the plane defined by
⇔ y ≥ F (x) ∨ G(x)
\ _ [ ^
epi(Fj ) = epi Fj and epi(Fj ) = epi Fj , (2.1)
j∈J j∈J j∈J j∈J
for a finite family of functions {Fj }j∈J on I. (For hypographs we have ‘dual’ rela-
tions: hyp(F ) ∩ hyp(G) = hyp(F ∧ G) and hyp(F ) ∪ hyp(G) = hyp(F ∨ G).)
We now find an expression for the epigraph of a PL function in terms of unions
and intersections, and then use the relationship in Equation 2.1 to write the function
as a lattice polynomial.
FA ⊆ epi(F).
Proof. The set FA is not empty because it contains A. Suppose that there is a point
B such that B ∈ FA and B ∈
/ epi(F). Then B belongs to the interior of hyp(F ).
Let C be the point closest to A at which the segment AB intersects the graph of F .
The point C exists because AB and the graph of F are closed subsets of the plane.
There is an an edge ek , which contains C. Therefore, ek is visible from A but B is
in the interior of hyp(Gk ). This contradicts our assumption that B ∈ FA . Hence,
FA ⊆ epi(F).
[
Proof. By Lemma 2.2, FA = epi(F ). Since each JA is a subset of a finite set,
A∈epi(F )
there are only a finite number of distinct sets JA . Let {Sj }j∈J be the finite family
of distinct subsets JA . Then we have
[ [ \
epi(F ) = FA = epi(Gi ).
j∈J j∈J i∈Sj
By (2.1),
^ _
epi(F ) = epi Gi (x) .
j∈J i∈Sj
^ _ ^ _
epi(F ) = epi Gi (x) implies F (x) = Gi (x).
j∈J i∈Sj j∈J i∈Sj
F (x) ≥ L(x) for all x ∈ [a, c], or F (x) ≤ L(x) for all x ∈ [a, c].
17
We consider the case in which the graph of F is above above the graph of L. In
this case, G1 (x) > L(x) for all x > a. Therefore G1 (b) > L(b) = F (b) and G1
satisfies the first Separation Property. To find a component of F which satisfies
the second Separation Property, choose the smallest k such that c ∈ Ik . We have
Gk (c) = L(c) so Gk (x) > L(x) for all x < c, and Gk (x) < L(x) for all x > c.
Therefore Gk (a) ≥ F (a) and Gk (b) ≤ F (b) so Gk satisfies the second Separation
Property.
In the case that F (x) ≤ L(x) on [a, c], Gk satisfies the first Separation Property
and G1 the second.
Using the Separation Properties we give our final proof of Theorem 2.1.
_
Gi (x) ≥ F (x).
i∈Sc
18
_
Gi (c) = F (c).
i∈Sc
It follows that
^_
Gi (x) = F (x).
c∈I i∈Sc
There are only finite number of distinct sets Sc . Let {Sj }j∈J be the family of distinct
sets Sc . Then we have the following representation of F by a lattice polynomial:
^ _
F (x) = Gi (x).
j∈J i∈Sj
PL function
We begin by recalling some basic definitions and properties of convex sets and
functions.
An arbitrary function F on an interval I is said to be convex if each point on
the chord between (x, f (x)) and (y, f (y)) is above the graph of F for any x, y ∈ I.
19
[3, p.113]
The following lemma is a well known fact that we state without proof.
Lemma 2.4. A function f on an interval I is convex if and only if the set epi(f )
is convex.
The interval with endpoints (x, F (x)) and (y, F (y)) is a subset of the plane in
the form
{(tx + (1 − t)y, tF (x) + (1 − t)F (y)) ∈ R2 : 0 ≤ t ≤ 1}.
Lemma 2.5. The family of convex functions on I is closed under addition and
operation ∨.
Proof. Suppose F is a PL function such that F (x) = Gk (x) on Ik and Gk has slope
mk . Let M = max |mk |.
Let H1 (x) = 0 and Hk (x) = 2kM (x − xk−1 ) + Hk−1 (xk−1 ) for 1 < k ≤ n. Define
H by H(x) = Hk (x) for x ∈ Ik . Then H is a convex function [3][5.18].
The function F + H has slope 2kM + mk on Ik and 2(k + 1)M + mk+1 on
Ik+1 . Since 2kM + mk ≤ (2k + 1)M and 2(k + 1)M + mk+1 ≥ (2k + 1)M , the
slopes of successive components, and therefore the left hand derivative of F + H is
nondecreasing. Thus for the same reason as for H, the function F + H is convex.
We have F = (F + H) − H, and both F + H and H are convex functions.
Proof.
^ _
F (x) = Gi (x),
j∈J i∈Sj
21
for some family {Sj }j∈J of subsets of {1, . . . , n}. For each j ∈ J let us define the
function Hj by
_
Hj (x) = Gi (x).
i∈Sj
X X
Hj (x) = Hk (x) − Hk (x).
k∈J k∈J
k6=j
Therefore,
^ X _X
F (x) = Hj (x) = Hk (x) − Hk (x).
j∈J k∈J j∈J k∈J
k6=j
X _X
By Lemma 2.5, Hk (x) and Hk (x) are both convex PL functions.
k∈J j∈J k∈J
k6=j
Chapter 3
Analytic Proofs
22
23
Lemma 3.1. Given a step function f on I = [a, b], the function F defined by
Z x
F (x) = f
a
is a PL function.
Proof. Let a = x0 < x1 < ... < xn = b be a partition of [a, b] such that
Z xk
f = mk (xk − xk−1 ).
xk−1
Z x Z xk−1 Z x Z xk−1
f= f+ f= f + mk (x − xk−1 ), for each x ∈ [xk−1 , xk ].
a a xk−1 a
R xk−1
For each k the quantity a
f is constant. Therefore the last equality shows that
24
Rx
a
f defines a linear function on each interval [xk−1 , xk ]. It is clear that the integral
is a PL function on I.
In the next two proofs (Lemma 3.2 and our second proof of Theorem 2.6) we
refer to the notation given in the following paragraphs.
Suppose F is a PL function on I = [a, b] with components G1 , ..., Gn . We denote
0
the value of the constant function Gk by the real number mk , 1 ≤ k ≤ n.
We define two increasing step functions p and n on I as follows:
k
1X
p(x) = [(mi − mi−1 ) + |mi − mi−1 |], if x ∈ (xk−1 , xk ], for 2 ≤ k ≤ n,
2 2
and
k
1X
n(x) = [|mi − mi−1 | − (mi − mi−1 )], if x ∈ (xk−1 , xk ], for 2 ≤ k ≤ n,
2 2
Lemma 3.2. Every PL function F on the interval I = [a, b] is the indefinite integral
of a left continuous step function.
Rx
Suppose for some k, F (x) = F (a) + a
f for all x in Ik . Then for all x in Ik+1 ,
Rx
Since F (x) = F (a) + xo =a
f for all x ∈ I1 , by induction we have for each x ∈ I,
Z x
F (x) = F (a) + f. (3.2)
a
Z x2
M1 (x2 − x1 ) ≤ f ≤ M2 (x2 − x1 )
x1
Rx
Let F (x) = a
f , x ∈ I. For any x1 , x2 ∈ I, and any y ∈ [x1 , x2 ] we have
Z y
f = F (y) − F (x1 ) = c0 (y − x1 ) for some c0 ∈ [f (x1 ), f (y)] (i)
Zx1x2
f = F (x2 ) − F (y) = c1 (x2 − y) for some c1 ∈ [f (y), f (x2 )] (ii)
y
Z x2
f = c0 (y − x1 ) + c1 (x2 − y) (iii)
x1
Let L be the linear function whose graph contains the points (x1 , F (x1 )) and
(x2 , F (x2 )). Then
F (x2 ) − F (x1 )
L(y) = F (x1 ) + · (y − x1 )
(x2 − x1 )
c0 (y − x1 ) + c1 (x2 − y)
= F (x1 ) + · (y − x1 ) (equation iii)
(x2 − x1 )
(y − x1 )
= F (x1 ) + · [c0 (y − x1 ) + c1 (x2 − y)]
(x2 − x1 )
Z x Z x
F (x) = F (a) + (f (a) + p) − n (3.3)
a a
Rx Rx
Let F1 (x) = F (a) + a
(f (a) + p) and F2 (x) = a
n. By Lemma 3.1, F1
and F2 are PL functions. To show convexity of F1 and F2 note that f (a) + p and
Rx
n are increasing functions. Thus by Lemma 3.3, a (f (a) + p) and F2 are convex
functions. Recall that the sum of two convex functions is convex, so F1 is also
convex. Therefore Equation 3.3 shows F written as the difference of two convex PL
functions.
The second proof of Theorem 2.6 depends on the fact that we could write the step
function f as the difference of two increasing functions. In fact we can write any
function of bounded variation (a BV function) as the difference of two increasing
functions [3][Theorem 5.5]. Perhaps the class of functions which is representable as
the difference of convex functions consists of just those functions which are indefinite
28
integrals of BV functions. Unfortunately, as the next example shows, this is not the
case.
√
Example 3.1. let F = − 1 − x2
F is certainly a convex function defined on the closed interval [−1, 1]. On any
0 0 0 Rx 0
closed interval [a , b ] ⊂ (a, b) F can be written as F (a ) + a0 F . Yet, there is no
Rx
increasing function f such that F (x) = F (−1) + −1 f . The problem occurs at
x = −1 and x = 1 where the derivative of F does not exist.
Indeed, if in addition to convexity we require that a function is differentiable at
the endpoints of [a, b] then we we can write the function as the indefinite integral of
an increasing function. We show this in the next proof.
The function f is increasing on [a, b] so the left hand limit exists at each point and
0 1 0
is finite. Therefore, F (x) = limx→b b−x
(F (b) − F (x)) exists. Similarly, F (a) exists.
0 0
For the converse, assume F is convex on [a, b] and F (a) and F (b) exist. For
a convex function on (a, b) the right and left hand derivatives of F exist and are
0 0
increasing on (a, b). Since F (a) and F (b) exist, we know that both left and right
hand derivatives are bounded on [a, b]. This implies that F is absolutely continuous
on [a, b] [3][Chapter 5, Problem 20] and therefore is an indefinite integral of its
Rx
derivative [3][Theorem 5.14]. Since F is continuous at a, F (x) = F (a) + a f where
0
f is the left hand derivative of F on (a, b] and f (a) = F (a).
Theorem 3.5. Consider the class of functions F on [a, b] representable in the form
F = G − H where G and H are convex and differentiable at the endpoints of [a, b].
Then a function belongs to this class if and only if it is an indefinite integral of a
function of bounded variation.
Z x
(G − H)(x) = G(a) − H(a) + (g − h).
a
30
[1] S Bartels, Ludwig Kuntz, and Stefan Scholtes, Continuous selections of linear
functions and nonsmooth critical point theory, Nonlinear Analysis 24 (1995),
385–407.
[2] Garrett Birkhoff, Lattice theory, third ed., American Mathematical Society Col-
loquium Publications, vol. 25, American Mathematical Society, Providence, R.I.,
1979. MR 598630 (82a:06001)
[3] H. L. Royden, Real analysis, third ed., Macmillan Publishing Company, New
York, 1988. MR 1013117 (90g:00004)
31