Concave and Quasiconcave Functions

Concave and Quasiconcave Functions
Reference: Simon & Blum, chap. 21, Mas-Collel’s mathematical appendix

MC, MD.
1 Convex Sets
• A set U ⊆ Rn is convex if whenever two points x,y ∈ U then the line
connecting x, y completely lies in U .
• Some examples:
• Formally U ⊆ Rn is convex if whenever x, y ∈ U then for every 0 ≤ α ≤ 1
αx + (1 − α)y ∈ U
2 Concave Functions
• Do not confuse concavity/convexity of functions with that of sets!
• Why we care about concave functions
– concave functions represent diminishing marginal returns which makes

sense for utility or some production functions.
∗ u0 (x) ≈ u(x + 1) − u(x) represents marginal utility, if we expect

the marginal utility to decrease then we should have u0 be a
decreasing function, meaning u00 < 0, implying the concavity of
u.
1
– concave utility functions represent risk aversity which a reasonable
assumption for human behavior
∗ A risk averse person prefers a definite outcome to a risky out-

come even if the risky outcome has the same expectation. For
example, consider you can choose one of these cases: case one,
1 1
with probability 2 you get $100 and with probability 2 you get
nothing, or case 2 , you can get $50 for sure. Usually people
choose the not risky case. Meaning if they have utility function
u then
1 1
u($50) ≥ u($100) + u($0)
2 2
Concave functions satisfy these kind of inequalities.
• Definition: A real valued function f defined on a convex subset U ⊆ Rn

is concave if for all x, y ∈ U and for all 0 ≤ α ≤ 1
f (αx + (1 − α)y) ≥ αf (x) + (1 − α)f (y)
similarly f is called convex if
f (αx + (1 − α)y) ≤ αf (x) + (1 − α)f (y)
• What the definitions means graphically
check the meaning of above definitions graphically for
– f (x) = x2 convex
– f (x) = ln x concave
2
Some Properties of concave functions
• If f, g : U ⊆ Rn → R are concave(convex) then f + g is concave(convex).
Proof: based on definition, we need to show for every x, y ∈ U and 0 ≤ α ≤ 1

we have
(f + g)(αx + (1 − α)y) ≥ α(f + g)(x) + (1 − α)(f + g)(y)
that follows directly by summing up corresponding inequalities for f, g :
f (αx + (1 − α)y) ≥ αf (x) + (1 − α)f (y)
g(αx + (1 − α)y) ≥ αg(x) + (1 − α)g(y)
• If f is concave then −f is convex.
Proof: for all x, y ∈ U and 0 ≤ α ≤ 1 we have
f (αx + (1 − α)y) ≥ αf (x) + (1 − α)f (y)
by taking each term to the opposite side we get
α(−f )(x) + (1 − α)(−f )(y) ≥ (−f )(αx + (1 − α)y)
which means −f is convex.
• For specific functions , not general cases, when we have the formula of the
function in hand, it is easier to use other criteria for concavity/convexity
than the basic definition. We have the following theorem
Theorem 1 (Criteria for concavity ) Suppose f : U ⊆ R → R is C 2 ,i.e. f 00

is continuos, then f is concave if and only if f 00 (x) ≤ 0 for all x ∈ U . Similarly
f is convex if and only if f 00 (x) ≥ 0 for all x ∈ U .
• There are other characterizations of concavity, for example the following

theorem. usually we do not use the following theorem as a way to show
concavity. However we state and prove it here because it implies some
important properties of concave functions in optimization.
3
Theorem 2 Let f : U ⊆ R → R be a C 1 function, i.e. continuously differen-
tiable. Then f is concave if and only if
f (y) − f (x) ≤ f 0 (x)(y − x) for all x, y ∈ U (1)
the function f is convex if and only if
f (y) − f (x) ≥ f 0 (x)(y − x) for all x, y ∈ U
Proof. We prove the claim about concave functions only. The convex part
is similar. First suppose f is concave and we want to show (??) holds. Let
x, y ∈ U and 0 < α ≤ 1 then we have
αf (y) + (1 − α)f (x) ≤ f (αy + (1 − α)x)
or
αf (y) − αf (x) ≤ f (αy + (1 − α)x) − f (x)
then
f (αy + (1 − α)x) − f (x)

f (y) − f (x) ≤
α
f (αy + (1 − α)x) − f (x)
= (x − y)
α(x − y)
now letting α → 0 we get
f (αy + (1 − α)x) − f (x)

→ f 0 (x)
α(x − y)
and therefore condition (??) follows.

On the other hand, suppose (??) holds for all x, y ∈ U , then
f (x) − f ((1 − α)x + αy) ≤ f 0 ((1 − α)x + αy)(x − (1 − α)x − αy)
= −αf 0 ((1 − α)x + αy)(y − x)
similarly
f (y) − f ((1 − α)x + αy) ≤ (1 − α)f 0 ((1 − α)x + αy)(y − x)
4
multiply the first inequality by (1 − α) and the second inequality by α , we get
(1 − α)f (x) − (1 − α)f ((1 − α)x + αy) ≤ −α(1 − α)f 0 ((1 − α)x + αy)(y − x)
αf (y) − αf ((1 − α)x + αy) ≤ α(1 − α)f 0 ((1 − α)x + αy)(y − x)
then add them up to get
(1 − α)f (x) + αf (y) ≤ f ((1 − α)x + αy)
The natural generalization of this result to f : Rn → R is
Theorem 3 Let f : U ⊆ Rn → R be continuously differentiable, then f is

concave if and only if
f (y) − f (x) ≤ Df (x)(y − x) for all x, y ∈ U
equivalently
h i
f (y) − f (x) ≤ ∂f ∂f ∂f (y − x)
∂x1 (x) ∂x2 (x) ... ∂xn (x)
∂f ∂f
= (x)(y1 − x1 ) + ... + (x)(yn − xn )
∂x1 ∂xn
One important implication of this theorem is the following
Corollary 4 Let f : U ⊆ Rn → R be concave and continuously differentiable.

If x0 ∈ U is such that Df (x0 ) = 0, i.e.
∂f
(x0 ) = 0 for i = 1, 2, ..., n
∂xi
then x0 is a global maximum of f in the domain.
Proof. Using the previous theorem for all y ∈ U we have
f (y) − f (x0 ) ≤ Df (x0 )(y − x0 ) = 0
therefore for all y ∈ U we have f (y) ≤ f (x0 ) which means x0 is the global
maximum.
5
• The importance of the result is that in an optimization of a concave func-
tion if we only let the first order conditions to be equal to 0 then we are
sure that we are in a maximum and we do not need to check the second
order conditions.
• Suppose f : U ⊆ Rn → R is concave and continuously differentiable.

Consider a point x0 ∈ U , if Df (x0 )(y − x0 ) ≤ 0 for some y ∈ U then
since f (y) − f (x0 ) ≤ Df (x0 )(y − x0 ) we get
Df (x0 )(y − x0 ) ≤ 0 =⇒ f (y) ≤ f (x0 )
Notice that
Df (x0 )(y − x0 ) = ∇f (x0 ) · (y − x0 )
so Df (x0 )(y − x0 ) ≤ 0 means that the two vectors ∇f (x0 ) and (y − x0 )

make an obtuse angle. This implies the upper contour set
{z : f (z) ≥ f (x0 )}
lies above the tangent hyperplane. This result is called the separating
hyperplane result. Graphically:
6
• Similar to concavity for f : R → R meaning f 00 ≤ 0, there is some criteria
about the Hessian matrix of f : Rn → R to guarantee the concavity. It
requires D2 f be a negative semi definite matrix. You will see what this
means in detail in Catherine’s lecture.
3 Quasiconcavity
• Although concavity has desirable implications for optimization, there
are weaker conditions on a functions that also imply nice properties with
regard to optimization.
• having convex upper contour sets is a nice property of a function with

regard to optimization. For example consider the problem of
max u(x1 , x2 ) subject to p1 x1 + p2 x2 ≤ w

x1 ,x2
graphically:
• Definition: A function f : Rn → R is quasiconcave if
f (αx + (1 − α)y) ≥ min{f (x), f (y)} for all x, y ∈ Rn and 0 ≤ α ≤ 1
A function f is quasiconvex if −f is quasiconcave.
The important property of quasiconcave functions is
Theorem 5 f : Rn → R is quasiconcave if and only if the upper contour sets

of f are convex, i.e. the sets like
{x ∈ Rn : f (x) ≥ a} (2)
7
Proof. First suppose f is quasiconcave. To show A = {x ∈ Rn : f (x) ≥ a}
is convex, let’s x, y ∈ A, then
f (x) ≥ a and f (y) ≥ a
for 0 ≤ α ≤ 1 we have
f (αx + (1 − α)y) ≥ min{f (x), f (y)} ≥ a
which implies A is convex. On the other hand, suppose each set of form (??)
is a convex set. To show quasiconcavity, consider x, y ∈ Rn and 0 ≤ α ≤ 1.
Without loss of generality suppose a = f (x) = min{f (x), f (y)} and let A =
{x ∈ Rn : f (x) ≥ a}. Obviously x, y ∈ A and by concavity of A,
αx + (1 − α)y ∈ A
hence by definition of A we have
f (αx + (1 − α)y) ≥ a = min{f (x), f (y)}
this shows f is quasiconcave.
Other properties
• if f, g quasiconcave then f + g is not necessarily quasiconcave.
• If f is concave then f is quasiconcave.
• If f : R → R is monotone then f is both quasiconcave and quasiconvex,
• A C 2 function f : R2 → R is quasiconcave if and only if
2fx fy fxy − fx2 fyy − fy2 fxx ≥ 0
• In general, f : Rn → R is quasiconcave if and only if
whenever f (y) ≥ f (x) then Df (x)(y − x) ≥ 0
• some examples of quasiconcave functions(check it):
8
– f (x) = ln x (also concave)
– any monotone function f : R → R
– f (x, y) = ln x + ln y
1 1
– f (x, y) = x 4 y 4
– f (x, y) = xa y b for 0 < a, b < 1 and a + b ≤ 1
– f (x, y) = −xy

Concave and Quasiconcave Functions

Caricato da

Informazioni sul documento

Descrizione originale:

Titolo originale

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

Concave and Quasiconcave Functions

Caricato da

Copyright:

Formati disponibili

Concave and Quasiconcave Functions

Reference: Simon & Blum, chap. 21, Mas-Collel’s mathematical appendix

• Formally U ⊆ Rn is convex if whenever x, y ∈ U then for every 0 ≤ α ≤ 1

• Why we care about concave functions

– concave functions represent diminishing marginal returns which makes

∗ u0 (x) ≈ u(x + 1) − u(x) represents marginal utility, if we expect

∗ A risk averse person prefers a definite outcome to a risky out-

• Definition: A real valued function f defined on a convex subset U ⊆ Rn

f (αx + (1 − α)y) ≥ αf (x) + (1 − α)f (y)

similarly f is called convex if

f (αx + (1 − α)y) ≤ αf (x) + (1 − α)f (y)

• What the definitions means graphically

check the meaning of above definitions graphically for

• If f, g : U ⊆ Rn → R are concave(convex) then f + g is concave(convex).

Proof: based on definition, we need to show for every x, y ∈ U and 0 ≤ α ≤ 1

(f + g)(αx + (1 − α)y) ≥ α(f + g)(x) + (1 − α)(f + g)(y)

that follows directly by summing up corresponding inequalities for f, g :

f (αx + (1 − α)y) ≥ αf (x) + (1 − α)f (y)

g(αx + (1 − α)y) ≥ αg(x) + (1 − α)g(y)

• If f is concave then −f is convex.

Proof: for all x, y ∈ U and 0 ≤ α ≤ 1 we have

f (αx + (1 − α)y) ≥ αf (x) + (1 − α)f (y)

by taking each term to the opposite side we get

α(−f )(x) + (1 − α)(−f )(y) ≥ (−f )(αx + (1 − α)y)

which means −f is convex.

Theorem 1 (Criteria for concavity ) Suppose f : U ⊆ R → R is C 2 ,i.e. f 00

• There are other characterizations of concavity, for example the following

f (y) − f (x) ≤ f 0 (x)(y − x) for all x, y ∈ U (1)

the function f is convex if and only if

f (y) − f (x) ≥ f 0 (x)(y − x) for all x, y ∈ U

αf (y) + (1 − α)f (x) ≤ f (αy + (1 − α)x)

f (αy + (1 − α)x) − f (x)

now letting α → 0 we get

f (αy + (1 − α)x) − f (x)

and therefore condition (??) follows.

f (x) − f ((1 − α)x + αy) ≤ f 0 ((1 − α)x + αy)(x − (1 − α)x − αy)

= −αf 0 ((1 − α)x + αy)(y − x)

f (y) − f ((1 − α)x + αy) ≤ (1 − α)f 0 ((1 − α)x + αy)(y − x)

αf (y) − αf ((1 − α)x + αy) ≤ α(1 − α)f 0 ((1 − α)x + αy)(y − x)

then add them up to get

(1 − α)f (x) + αf (y) ≤ f ((1 − α)x + αy)

The natural generalization of this result to f : Rn → R is

Theorem 3 Let f : U ⊆ Rn → R be continuously differentiable, then f is

f (y) − f (x) ≤ Df (x)(y − x) for all x, y ∈ U

One important implication of this theorem is the following

Corollary 4 Let f : U ⊆ Rn → R be concave and continuously differentiable.

then x0 is a global maximum of f in the domain.

Proof. Using the previous theorem for all y ∈ U we have

f (y) − f (x0 ) ≤ Df (x0 )(y − x0 ) = 0

• Suppose f : U ⊆ Rn → R is concave and continuously differentiable.

Df (x0 )(y − x0 ) ≤ 0 =⇒ f (y) ≤ f (x0 )

so Df (x0 )(y − x0 ) ≤ 0 means that the two vectors ∇f (x0 ) and (y − x0 )

• having convex upper contour sets is a nice property of a function with

max u(x1 , x2 ) subject to p1 x1 + p2 x2 ≤ w

• Definition: A function f : Rn → R is quasiconcave if

f (αx + (1 − α)y) ≥ min{f (x), f (y)} for all x, y ∈ Rn and 0 ≤ α ≤ 1

A function f is quasiconvex if −f is quasiconcave.

The important property of quasiconcave functions is

Theorem 5 f : Rn → R is quasiconcave if and only if the upper contour sets

f (x) ≥ a and f (y) ≥ a

f (αx + (1 − α)y) ≥ min{f (x), f (y)} ≥ a

hence by definition of A we have