Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Anneli Kaasa
Tartu 2014
PREFACE
Economics describes and analyses the economical processes around us. At that often mathematical
models are used that are analysed with mathematical methods. In order to understand what is
written in textbooks or scientific articles, often knowledge of (higher) mathematics is needed. Also,
when conducting research, the empirical analysis is mostly based on theoretical framework that is
often expressed with the help of mathematics.
The following material intends to help those who need to restore or improve their mathematical
knowledge for economics. After introducing mathematical methods, examples are given about
possible uses of those methods in different fields of economics. The examples illustrate the only the
use of a particular mathematical method in economics and have no intention to completely interpret
a model and results used as an example. This has been left to microeconomics, macroeconomics
and other fields of economics, understanding of which is hopefully easier with the help of this
material.
2
CONTENTS
1. Mathematical models in economics ............................................................................................................... 4
1.1. Expression of mathematical models ........................................................................................................ 4
1.2. Elements of mathematical models ........................................................................................................... 7
1.3. Analysis of mathematical models ............................................................................................................ 9
2. Diferentiation ............................................................................................................................................... 12
2.1. Derivative, difference and differential ................................................................................................... 12
2.2. Applications of derivative...................................................................................................................... 15
2.3. Partial derivatives .................................................................................................................................. 24
2.4. Total differential and total derivative .................................................................................................... 28
2.5. Derivative of an implicit function.......................................................................................................... 31
3. Integration .................................................................................................................................................... 36
3.1. Indefinite integral................................................................................................................................... 36
3.2. Definite integral ..................................................................................................................................... 38
3.3. Relationship between total and marginal function................................................................................. 42
4. Matrix algebra .............................................................................................................................................. 45
4.1. Matrices and determinants ..................................................................................................................... 45
4.2. Linear equation systems ........................................................................................................................ 47
4.3. Input-output models ............................................................................................................................... 49
4.4. Leontiefs model .................................................................................................................................... 50
4.5. Calculations in Leontiefs model ........................................................................................................... 53
5. Optimization ................................................................................................................................................. 58
5.1. Extremums of functions of one variable ................................................................................................ 58
5.2. Extremums of a function of two variable .............................................................................................. 63
5.3. Optimization of a function of n variables .............................................................................................. 67
6. Optimization with constraints ...................................................................................................................... 71
6.1. Extremums in an interval ....................................................................................................................... 71
6.2. Optimizing the function of two variables with one constraint ............................................................... 72
6.3. Lagrange method ................................................................................................................................... 74
6.4. Optimization of a function of n variables with many constraints .......................................................... 83
7. Comparative statics ...................................................................................................................................... 85
7.1. Qualitative and quantitative analysis ..................................................................................................... 85
7.2. Using Jacobians in more complex models ............................................................................................. 88
8. Difference equations..................................................................................................................................... 99
8.1. Dynamic analysis, difference and differential equations ....................................................................... 99
8.2. Solving a difference equation .............................................................................................................. 100
8.3. Assessing the stability of equilibrium.................................................................................................. 102
8.4. Cobweb model ..................................................................................................................................... 105
8.5. Phase diagrams .................................................................................................................................... 108
9. Differential equations ................................................................................................................................. 115
9.1. Solving a differential equation............................................................................................................. 115
9.2. Assessing the stability of equilibrium.................................................................................................. 116
9.3. Phase diagrams .................................................................................................................................... 119
10. Exponential functions ............................................................................................................................... 127
10.1. Specificity of exponential functions .................................................................................................. 127
10.2. Growth rate ........................................................................................................................................ 129
10.3. Financial mathematics ....................................................................................................................... 132
10.4. Optimal timing ................................................................................................................................... 137
Literature ........................................................................................................................................................ 138
Appendices ..................................................................................................................................................... 139
Appendix 1.................................................................................................................................................. 139
Appendix 2.................................................................................................................................................. 139
Appendix 3.................................................................................................................................................. 142
Appendix 4.................................................................................................................................................. 142
3
MATHEMATICAL MODELS IN ECONOMICS Anneli Kaasa
q S quantity supplied;
p price.
4
MATHEMATICAL MODELS IN ECONOMICS Anneli Kaasa
As it can be seen, the relationships correspond to those described verbally: as price increases,
quantity demanded decreases and quantity supplied increases.
The market equilibrium is reached, if the quantity demanded is equal to the quantity supplied.
Hence, the equilibrium condition and the third equation of this model is:
qD = qS .
As the rules are known that determine the pairs corresponding to the demand and supply, the same
relationships can be expressed, for example using the notation of sets:
{( )
D = p, q D 3 p + q D = 24 , }
S = {( p, q ) 4 p q
S S
}
=4 .
Demand D is a set of those pairs of the values of price and (demanded) quantity (ordered pairs) that
follow the rule 3 p + q D = 24 . Supply S is a set of those pairs of the values of price and (supplied)
quantity that follow the rule 4 p q S = 4 . The market equilibrium is a pair (E) of the values of price
and quantity that belongs both to the set D and the set S:
E = DS.
These relationships can also be described graphically. Demanded and supplied quantities are often
depicted on the same axis in order to find the equilibrium of demand and supply. Hence, it is
possible to describe both demand and supply using only two dimensions.
The shapes of the demand and supply curves are determined by the demand and supply functions.
In our example both are linear functions, hence, the graphs are straight lines and hence, in this case
only two points are needed to draw a graph. Intersection points (vertical and horizontal intercepts)
with axes can be used, for example, but another widely used option is to draw a graph on the basis
of the vertical (or horizontal) intercept and the slope.
Usually, the values of the function (dependent variable y) are depicted on the vertical axis and
values of the argument (independent variable x) on the horizontal axis. According to that, quantity
should be depicted on the vertical axis and price on the horizontal axis. However, according to the
tradition in economics, price is usually depicted on the vertical and quantity on the horizontal axis.
When using the intercepts, this is not a problem. However, when using the vertical intercept and
slope method, one has to use the inverse demand function and the inverse supply function. The
curves, however, are still called demand and supply curves and not inverse demand curve or inverse
supply curve.
Solving both functions for p gives:
1 1
p = 8 q D and p = 1 + q S .
3 4
If price is 0, then the demanded quantity is q D = 24 3 0 = 24 . In order the demanded quantity to
1
be 0, price has to be p = 8 0 = 8 . Hence, the vertical intercept of the demand curve is in the
3
point (0 ; 8) and the horizontal intercept in (24 ;0) . Analogically, the intercepts can be found for the
supply curve: (0 ; 1) and ( 4 ;0 ) , respectively. The situation is depicted on Figure 1.1.
5
MATHEMATICAL MODELS IN ECONOMICS Anneli Kaasa
p
quadrant II quadrant I
8 S
1
-4 24 q
8
D
S
E
1
24 q
Figure 1.2. Demand and supply curves as they are usually depicted in economics
The equilibrium point E is in the intersection point of two curves.
When describing the economic phenomena graphically, sketching is often used: a figure has to be
only as precise as needed for the analysis. Sometimes the only important thing is the question about
which of the two lines is steeper. In that case it is reasonable to use the vertical (or horizontal)
1
intercepts and slopes method for sketching. Let us look at the functions p = 8 q D and
3
6
MATHEMATICAL MODELS IN ECONOMICS Anneli Kaasa
1 S
p = 1+ q . First, the vertical intercepts are: 8 for the inverse demand function and 1 for the
4
1 1
inverse supply function. Next, the slopes are for the inverse demand function and for the
3 4
inverse supply function.
The slope of a straight line shows how fast is the variable on the vertical axis increasing (or
decreasing, if the slope is negative) as the variable on the horizontal axis increases. In our case the
demand curve is negatively sloping and the supply curve positively sloping. When comparing the
1 1
absolute values of the slopes: > , we can conclude that in this example the demand curve
3 4
decreases faster than the supply curve increases the demand curve is steeper than the supply
curve, as can be seen on Figure 1.2. It has to be pointed out here that if one compares not the slopes
of the inverse functions but the slopes of the demand and supply functions the result is opposite:
3 < 4 . Hence, it is important to pay attention to the notation of axes: which variable is on which
axis.
Knowing the intercepts and how the slopes relate to each other, we can sketch Figure 1.3 that
describes the same situation as previous Figures although the scales are different, the demand
curve is still steeper than the supply curve.
8 D
S
E
1
q
Figure 1.3. Demand and supply curves in the case of different scales
the value is not known or a generalization is made, parameters can also be denoted with symbols
(for example a, b, c or , , ). For example, the portion of income that is used for consumption
is constant for one person, but for another person it may have different value.
The general form of a market equilibrium model introduced before is:
q S = a + bp ,
q D = c dp ,
qD = qS ,
where a, b, c, d > 0 .
8
MATHEMATICAL MODELS IN ECONOMICS Anneli Kaasa
The definitional equations are usually rules that apply always. Behavioural equations describe a rule
from what deviations can be found in empirical research. Profit is always calculated in one way, but
peoples behaviour does not strictly follow the rule. The equilibrium conditions, in turn are satisfied
only at certain (equilibrium) values of variables.
In our case the equilibrium price appears to be p* = 4 and the equilibrium quantity q* = 12 .
Asterisk (*) is often used in economics for denoting equilibrium or optimal value.
Now we can also depict the equilibrium point on figure (see Figure 1.4).
p
8 S
D
4 E
12 24 q
9
MATHEMATICAL MODELS IN ECONOMICS Anneli Kaasa
Sometimes, when needed, new aspects are added to a simple model. For example, it is possible to
analyse the influence of excise tax on the market equilibrium. Assume that an excise tax of T per
unit is imposed for the good under discussion. T is a parameter, whose value is not known at the
moment.
Now the price that is paid by consumers is divided into two parts: one part (T) goes to the
government as a tax revenue and the other part goes to producers. This can be expressed by a
definitional equation:
pTD = pTS + T ,
where:
pTS price for suppliers (producers) after imposing excise tax,
D
qT = qT ,
S
pD = pS + T.
T T
This system can be solved by replacing pTD with pTS + T . This way we get a system of three
variables and three equations:
( )
qTD = 24 3 pTS + T ,
S
qT = 4 + 4 pT ,
S
D
qT = qT .
S
The further solution is analogical to the solution used before and the result is
D 12
qT = qT = 12 7 T ,
S
D 4
pT = 4 + T ,
7
S 3
pT = 4 7 T .
From here, it can be concluded that since T is positive, then regardless of the size of the excise tax,
imposing it decreases the equilibrium quantity. The market price (price for consumers) increases,
while the price decreases for producers. We can also see how the tax burden is divided between
4 3
consumers and producers: of the tax is added to the price for consumers and of the tax is
7 7
taken away from the price for producers. Since the producers now have to ask the price that is
higher (in order to still get the price they accept, after they have paid tax), on figure the supply
curve is shifted up by T (see Figure 1.5).
10
MATHEMATICAL MODELS IN ECONOMICS Anneli Kaasa
The intersection point of the new supply curve and the demand curve gives us the new equilibrium
price (for consumers). The price for the producers is smaller by T. As the vertical difference of two
supply curves is exactly T, then the new price for the producers can be found by moving down from
the new equilibrium point to the old supply curve. The horizontal line that corresponds to the initial
equilibrium price p* divides the vertical line segment between two supply curves into two parts that
show how the tax burden is divided between consumers and producers. pTD p * shows the price
change for the consumers and pTS p * the price change for the producers. As it can be seen also
from Figure 1.5 in this example consumers have to pay the larger part of tax.
p
8
D S'
T
4
T E' S
pTD 7
p* E
pTS
3
1 T
7
qT q* q
11
2. DIFERENTIATION
The first two notations refer that the derivative (that is a function as well) is derived from the initial
function y = f ( x ) . The last notation refers to the interpretation of derivative: a derivative gives an
y
approximate estimation of the quotient of absolute changes (the smaller x , the more accurate
x
dy
the estimation). The derivative shows approximately the change in the value of a function ( y )
dx
dy y
that comes with the unitary (one unit) growth of the argument ( x = 1 ): .
dx x
The geometrical interpretation of derivative is the slope of the tangent line of the functions graph.
(see Figure 2.1). By finding the value of the derivative at a particular point, we get the slope of the
functions graph at that point. At the same time it is also the slope of the function itself at that point.
Hence, the derivative helps in drawing the graph of a function.
12
DIFFERENTIATION Anneli Kaasa
dy and dx are called differentials of variables y and x, respectively. For an argument the
differential is assumed to be equal to the difference: dx = x . The differential of a function is an
approximate estimation of the actual change of the value of function y that comes with the change
of argument ( x ): dy y . It can be seen from Figure 2.1 that the smaller is the change of
argument, the less the differential and the actual change (difference) of a function differ from each
other. Estimation of the actual change of the value of a function can be found by multiplying the
derivative (approximate change of a function in the case of unitary change of argument) by the
change of argument dx = x :
dy = f ( x ) x .
If we use another notation for the derivative and take into account that dx = x , we can write:
dy
dy = dx , which is definitely true. For example, let there be a function y = 3 x 3 + 4 x 2 10 and a
dx
given time point, where x = 1 and x = 0,02 . The estimation of the actual change y can be found:
y dy = f ( x)x = (9 x 2 + 8 x)x = (9 1 + 8 1) 0,02 = 0,34 .
y
y=f(x)
dy
y dy
x
x = dx
(a ) = a
x x
ln a =
ax
log a e
,
13
DIFFERENTIATION Anneli Kaasa
(ln x ) = 1 ,
x
(log a x ) = 1 1
= log a e .
x ln a x
In the case of sum, difference, product or quotient of functions, following rules can be used. Let
functions u and v be functions of x: u = f ( x ) and v = g ( x ) .
If y = u v , then y = u v ;
if y = uv , then y = u v + u v ;
u u v u v
if y = , then y = .
v v2
In the case of a composite function (function of a function) chain rule can be used. Lets assume that
y = f (u ) , u = u (v) and v = v(x) .
If y = f (u (v( x))) , then y ( x) = y (u ) u (v) v ( x) .
It is also useful to know that:
1
x ( y ) = .
y ( x)
If a derivative is taken from a derivative, a second-order derivative can be found. When continuing
to take derivatives even higher-order derivatives can be found. A second-order derivative can be
d2y
denoted in following ways: y = f ( x) = 2 . The higher-order derivatives are denoted
dx
analogically, but starting from fourth- of fifth-order derivative the order is described by a number in
d5y
brackets: y (5 ) = f (5 ) ( x) = 5 .
dx
14
DIFFERENTIATION Anneli Kaasa
15
DIFFERENTIATION Anneli Kaasa
derivative being negative: R (q ) = 1 ).We also know the maximum point and that if the quantity is
0, the revenue is also 0. Hence, we can sketch the revenue function as shown on Figure 2.2. .
10 q
MR
10
10 q
d2y d 2y
y = = MY > 0 y = = MY < 0
dx 2 dx 2
y =
dy
= MY > 0 y , MY , y , MY
dx accelerated increase decelerated increase
dy y , MY MY y MY MY
y = = MY < 0
dx decelerated decrease accelerated decrease
Let us use a demand function q = q D ( p ) as an example. The price elasticity of demand can be
denoted as pD D stands for the dependent variable (demanded quantity) and p stands for the
independent variable (price). The absolute change of the demanded quantity is q = qt qt 1 . The
relative change shows, how big is the absolute change relative to the value of this variable at the
q qt qt 1
moment. This is usually found in a following way: = . However, that would mean that
qt 1 qt 1
when looking at two absolute changes with different signs but with the same absolute value, the
corresponding relative changes would be different. In the case of calculating elasticities, the
absolute change is divided by the arithmetical average of the first (time t-1) and second (time t)
value:
q qt qt 1 q + qt 1
= , where q = t .
q q 2
For the price elasticity of demand pD we need also the relative change of price:
p pt pt 1 p + pt 1
= , where p = t .
p p 2
Elasticity is then calculated as the quotient of the relative change of the demanded quantity and the
relative change of price:
Dq
q
=
D
p
Dp
p
Dq Dp
that can be rearranged as pD = or:
q p
Dq p
pD = .
Dp q
This is typical form of the formula used for calculating elasticities. As this formula allows us to
calculate elasticity on the arc between two points observed, it is called arc elasticity.
For example, if the price of a good increases from 40 to 50 per kg, then a consumer buys 0,5 kg in a
week now instead of previous 1 kg. The change of the quantity is then q = 0,5 and the average
quantity q = 0,75 . The change of the price is p = 10 and the average price p = 45 .
q p 0,5 45
Elasticity: pN = = = 3 .
p q 10 0,75
If the relative change of the value of the function is larger than the relative change of the argument,
then > 1 and the function is known as elastic. In the opposite case < 1 and the function is
inelastic. If = 1 , it is called unitary elasticity (an unit elastic function). In our example the
function is elastic meaning that the demand is relatively sensitive to changes in price.
The formula of arc elasticity gives so-called average elasticity on an arc, while the elasticities at
different points on this arc may differ from each other. A more precise result is obtained by the
formula of point elasticity. Calculation of a point elasticity assumes infinitely small changes in the
q dq
argument. In that case the changes can be replaced by the differentials and replaced by
p dp
17
DIFFERENTIATION Anneli Kaasa
(derivative). As there is only one point under consideration, there is no need for calculating
averages and there are the values of the function and the argument at a particular point in the
formula instead:
dq p
pD = .
dp q
dq
should be treated here as one symbol referring to a derivative.
dp
For example, let us find the price elasticity of a demand function q = 20 2 p in a point where the
price is 4. If the price is 4, then the quantity is q = 20 2 4 = 12 . The derivative is q = 2 and the
elasticity:
4 2
pD = 2 = ,
12 3
hence, this demand function is inelastic at the price 4.
Analogically other types of elasticities can be calculated: income elasticity of demand, price
elasticity of supply, input elasticities of output etc.
In the case of a linear function the arc elasticity on a segment of line is equal to the point elasticity
at the centre point of that segment. In the case of a curve, arc elasticity is equal to the point
elasticity in a point where tangent line is parallel with the line drawn between the two points used in
calculating the arc elasticity. On Figure 2.3 the arc elasticity between points A and B is equal to the
point elasticity at point C, where tangent line is parallel to the line drawn between A and B.
p
pt-1 A
p
C
pt B
qt-1 q qt q
are dealing with economic-related problems, then we should not use x and y, but the variables in our
problem instead.
If xy > 1 , we say that y is elastic with respect to x, and if xy < 1 inelastic.
In the case of a linear demand curve p = a bq the price elasticity of demand is:
dq p 1 p p
qp = = = .
dp q b (a p ) b a p
Hence, in the case of a price that is half of the price (a) at which the consumers stop consuming,
0,5a
then qp = = 1 . If the price is higher, then qp > 1 , if lower, then qp < 1 (see also
a 0,5a
Figure A1.1).
a
In the case of a hyperbolic demand curve p = the price elasticity of demand is:
q
dq p a p
qp = = 2 = 1 .
dp q q a p
p p
elastic
a
unit elastic
unit elastic
inealstic
0,5a
q q
if = 1 , then MY = AY ;
19
DIFFERENTIATION Anneli Kaasa
The geometrical interpretation of a derivative (marginal function) is the slope of the tangent line in
a particular point where the value of the argument is x0 :
MY ( x0 ) = (x0 ) .
dy
dx
The value of an average function for x0 :
y0
AY ( x0 ) =
x0
can be graphically interpreted as the slope of a straight line that connects the point ( x0 , y 0 ) and the
origin (see Figure A1.2). This line is called radius vector.
In order a function to be elastic ( > 1 ) in a particular point, MY > AY should be true and thus
the absolute value of the slope of the tangent line should be larger than the slope of the radius vector
of that point (the slope of the radius vector is always positive in the first quadrant). Hence, the
tangent line has to be steeper than the radius vector. ). For unit elasticity ( = 1 ) the absolute values
of the slopes of the tangent line and radius vector should be the same (they either coincide or cross).
If the tangent line is flatter than the radius vector, the function is inelastic ( < 1 ).
y y
y0
y0
x0 x x
x0
20
DIFFERENTIATION Anneli Kaasa
p p
elastic
pmax
unit elastic
inelastic
0,5pmax
e = 1
D
D
q
0,5qmax qmax
Figure A1.3 Qualitative assessment of the elasticity of a linear and hyperbolic demand curve
In the case of a hyperbolic demand function the absolute values of the slopes in all points are equal
to the slope of a corresponding radius vector. This is in accordance with the constant unit elasticity.
Differentiation enables to prove the well known relationship from microeconomics between the
graphs of the marginal and revenue functions of a variable (cost, revenue, product etc. functions)
the curve of a marginal function MY intersects the curve of an average function AY in the
extremum (maximum or minimum) point of the average function AY
The average function is a quotient of the value of the function and the value of the argument:
y ( x)
AY = .
x
For finding the extremum of the average function AY the derivative of the average function has to
be equal to 0 (see Chapter 5.1 for optimization). According to the rule of differentiating a quotient
of functions:
d y ( x )
d ( AY ) x y ( x ) x y ( x ) 1 !
= = =0 .
dx dx x2
Hence:
1 y (x ) y(x ) y(x )
y (x ) = 0 or if x > 0 , then y ( x ) = 0 and y ( x ) = or
x x x x
MY = AY .
21
DIFFERENTIATION Anneli Kaasa
and hence, the difference between the marginal indicator and average indicator is always:
x AY (x) .
This relationship has an interesting application, for example on the revenue function R(q ) = pq .
Here, a monopolist and a firm that operates in perfect competition have to be distinguished. For a
monopolist price is related to the quantity according to the consumers demand curve p = p(q ) ,
thus the revenue function is actually: R(q ) = p(q ) q . The average revenue curve coincides then
with the demand curve:
R(q ) p(q ) q
AR(q ) = = = p(q ) .
q q
Since mostly a negatively sloping demand curve is assumed, AR is negatively sloping as well.
Hence, AR (q ) < 0 . We know that MR(q ) = q AR (q ) + AR (q ) . Since quantity can also be assumed
to be mostly positive, then MR(q ) AR(q ) = q AR (q ) < 0 and always
MR < AR .
In the case of a linear demand curve p (= AR ) = a bq , the marginal revenue curve starts from the
same vertical intercept as the demand curve, but decreases two times faster:
MR = q ( b ) + (a bq ) = a 2bq .
a
In the case of a constantly unit elastic demand curve p (= AR ) = (hyperbolic graph) the
q
marginal revenue is 0 and thus the revenue is constant (all points on the demand curve give the
same revenue for the monopolist):
a a
MR = q 2 + = 0 .
q q
In general, if the shape of the demand (and average revenue) curve is known, it is possible to
derive the shape of the marginal revenue curve using the relationship derived before. Namely, at
a particular value of the argument n the difference between the average and marginal revenue is
AR (n) MR(n) = n AR (n) . The derivative of the average revenue is the slope of the average
revenue curve at the quantity n that can by found as
dAR
(n ) = a (see Figure A1.4). The
dq n
reasoning behind this quotient can be explained as follows. In the case of the quantity n the slope of
dAR
an average revenue curve (n) is equal to the slope of a tangent line in the point corresponding
dq
to the quantity n. The tangent line is a straight line and in the case of a straight line the slope of it
always shows exactly (no estimation errors) the change in the value of a function per unit of change
dy y
in the argument (in general form = ). Because of that, for determining the slope of a straight
dx x
line one can always choose two points on that line, for example the quantities 0 and n, and find the
changes that occur when moving from the first point to the second. If quantity increases from 0 to n:
q = n , then the corresponding change on the vertical axis is AR = a . Hence, the formula
dAR
(n ) = a can be used here.
dq n
22
DIFFERENTIATION Anneli Kaasa
AR,
MR,
p
AR(n)
n AR,
a D
MR(n)
n q
a AR, D AR, D
MR
q q
Figure A1.5, Finding the marginal revenue curve in the case of a linear and a constantly unit elastic
demand curve
23
DIFFERENTIATION Anneli Kaasa
If a firm operates in perfect competition, the market price is fixed for the firm p = p 0 and the
average revenue is equal to that price AR = p 0 . At the same time the marginal revenue is also equal
to the price (price does not depend on the quantity):
dp 0
MR = q + p0 = q 0 + p0 = p0 .
dq
In that case both marginal revenue and average revenue curves coincide with a horizontal straight
line at the level p = p 0 .
One important application of derivatives is optimization that is discussed separately.
24
DIFFERENTIATION Anneli Kaasa
z
A partial derivative shows approximately the change of the value of the function ( z ) that
x
comes with the unitary growth of the argument ( x = 1 ) holding other arguments constant (in our
z y
case y = 0 ): . Also, similarly to simple derivative, the smaller is the change dx = x
x x y =0
given to argument, the more precisely is the actual change in the value of a function z estimated
by the differential.
The partial derivative assumes holding other variables constant. Hence, when finding partial
derivatives all other variables can be viewed as constants. When finding a partial derivative of
z = f ( x, y ) with respect to x all usual rules of differentiation apply and at that y has to be treated as
a constant (similarly to the numbers possibly included into a function).
For example, given the function z = 3 x 2 y + 3 x 3 + y 10 , the partial derivative of z with respect to x
is z x = 6 xy + 9 x 2 + 0 0 , and with respect to y: z y = 3 x 2 + 0 + 1 0 .
Regarding the geometrical interpretation of a partial derivative, the geometrical expression of a
function with two arguments is a surface in a three-dimensional space. For example, one can
imagine a room, where a vertical z-axis is going up from one lower corner and the x- and y-axes are
going from the same corner to the two sides along the edges of the floor. Let us call the walls above
these edges x- and y-walls. A surface has a tangent plane in every point (one can think of a piece of
cardboard touching a ball). Partial derivative of z with respect to x shows the slope of the tangent
plane with respect to x-axis. In imagination, the plane can be divided into straight lines (very thin
strips of cardboard) that are very close to each other (creating thus a plane) and parallel to the x-
wall. If we project one of these lines to the x-wall, the slope of this line on the set of xz-plane (on
the x-wall) is shown by the partial derivative z x . One can imagine looking at one strip of cardboard
(parallel to x-wall) when facing straight to the x-wall; in that case the slope z x can be seen on the
background of the x-wall. Looking straight, the third dimension (y-wall) has no influence, that is in
accordance with the presumption of the partial derivative that at the same time dy = 0 .
Analogically, the partial derivative with respect to y shows the slope of the tangent plane with
respect to y-axis.
Higher-order partial derivatives are found analogically, for example second-order partial
derivative is found by taking derivative twice with respect to a particular variable:
z xx = f xx = 2 =
( )
z
2 z x
,
x x
z
2z y
z yy = f yy = 2 = .
y y
Mixed (or cross) partial derivatives are found by differentiating first with respect to one variable
and then with respect to another variable:
z xy = f xy = =
z( )
2 z x
,
xy y
z
2z y
z yx = f yx = = .
yx x
25
DIFFERENTIATION Anneli Kaasa
According to Youngs theorem for functions used in economics the result does not depend on the
order of differentiation:
( x) = z y .
z
y x
Given z = 3 x 2 y + 3 x 3 + y 10 for finding z xx a derivative with respect to x is taken again of the
first-order derivative with respect to x: z x = 6 xy + 9 x 2 resulting in z xx = 6 y + 18 x . If we take a
derivative of z y = 3 x 2 + 1 again with respect to y, we get z yy = 0 . Mixed derivative can be found by
taking derivative of z x = 6 xy + 9 x 2 with respect to y or by differentiating z y = 3 x 2 + 1 with respect
to x. Both methods result in z xy = z yx = 6 x .
A partial derivative is used in finding marginal functions from multivariable initial functions. For
example, let us assume that quantity q produced in a firm in a period depends both on the number of
workers L (labour) and on the number of machines used in production that can be thought of as
capital K. Let the production function be:
q = q ( K , L) = 0,5 K 2 2 KL + L2 .
Marginal product of labour MPL shows, how the quantity produced changes as the quantity of
labour is increased by one unit. MPL can be found taking derivative of production function with
respect to L:
MPL = q L = 2 K + 2 L .
Marginal product of capital MPK can be found analogically:
MPK = q K = K 2 L .
Often, it is possible to create new indicators by combining various marginal indicators. For
dR
example, when multiplying the marginal revenue MR = by the marginal product of an input
dq
dq dR dq dR
(denoted with x) MPx = , we get the marginal revenue of this input: MR x = = .
dx dq dx dx
Partial derivatives are also used in calculating elasticities. Let us assume that there are two goods, A
and B and we know that the demanded quantity of A in a period depends on the price of this good
p A , price of the other good p B and the income m (money):
q AD = 12 4 p A + 0,04m + 0,5 p B .
Elasticity of the demand for A with respect to p A is calculated using a partial derivative in a
following way:
q AD p A p
AD = = 4 DA .
p A q A
D
qA
This elasticity shows the relative change (in percents) in the demanded quantity of A as the price of
this good increases by 1% and the price of another good and income do not change.
The elasticity of the demand for A with respect to the price of the other good is calculated as:
q AD p B p
BD = = 0,5 DB ,
p B q A
D
qA
26
DIFFERENTIATION Anneli Kaasa
and hence, if + > 1 , there are increasing returns to scale and if + < 1 , decreasing returns to
scale. It can be said that this function is homogeneous to a degree of + .
In the case of a production function in general q = q(K , L ) the differential of the product is
q q dn dK dL dn dn
dq = dK + dL . As = = , then dK = K and dL = L and
K L n K L n n
q dn q dn
dq = K+ L.
K n L n
1 n
When multiplying both sides by an expression , we get the elasticity of scale on the left side
dn q
and the sum of production elasticities with respect to both inputs on the right side:
dq n q dn 1 nK q dn 1 nL q K q L
nq = = + = + = Kq + Lq .
dn q K n dn q L n dn q K q L q
The latter means that the elasticity of scale can be calculated as the sum of production elasticities
with respect to all inputs.
q K q L K L
In our example: nq = Kq + Lq = + = K 1 L + K L 1 = + .
K q L q K L K L
27
DIFFERENTIATION Anneli Kaasa
Again, one important application of partial derivatives is optimization that is discussed separately.
28
DIFFERENTIATION Anneli Kaasa
indirect impact into account. The first term in the formula shows the direct impact of x to z and the
second term the indirect impact: the impact of x to y is multiplied by the impact of y to z.
Let us look at one example from microeconomics. Let us assume that the demand for a good
depends on the price p and real income mr :
q = f (mr , p) ,
where it is assumed that:
q
> 0 , since the demanded quantity increases as the (real) income increases;
mr
q
< 0 , since the demanded quantity decreases as the price increases (this good is substituted with
p
another);
and the real income depends on the price:
mr = g ( p) ,
dmr
where < 0 , since the real income (goods that can be consumed) decreases as the price
dp
increases.
To find the total impact of the change in price we find the total differential of q:
q q
dq = dp + dmr
p mr
and from that the total impact of p on q:
dq q q dmr
= + .
dp p mr dp
As we know the direction (sign) of the relationships between variables, we can analyse, whether the
total impact is positive or negative:
dq q q dmr
= + < 0.
dp p mr dp
() (+) ()
It can be concluded that in this case, taken all relationships into account, when price increases, the
demanded quantity decreases. In microeconomics, the direct impact of price is called substitution
effect and the direct impact through income is called income effect.
In more complex cases the total derivative can be found analogically using the total differential by
substituting all derivatives that correspond to non-existing relationships with 0. If there is more than
one independent variable in the system, the total derivative describing their total impact is called
partial total derivative.
For example, if z = f (x, y, u, v, w) , u = g ( x, y ) , v = h( x ) and w = j ( y ) , then total differential of z
is:
z z z z z
dz = dx + dy + du + dv + dw
x y u v w
and dividing by dx gives us the partial total derivative of z with respect to x (while no initial change
is given to y):
29
DIFFERENTIATION Anneli Kaasa
dz z dx z dy z du z dv z dw
= + + + + =
dx dy = 0 x dx y dx u dx v dx w dx
z z du z dv
= 1 + 0 + + +0=
x u dx v dx
z z du z dv
= + + .
x u dx v dx
z
This partial total derivative can also be denoted as .
x
As another example, let the demanded quantity q1 depend on the price p1 , the price of another
good p 2 and the real income mr :
q = f (mr , p1 , p 2 ) ,
where it is assumed that:
q
> 0 , since the demanded quantity increases as the (real) income increases;
mr
q1
< 0 , since the demanded quantity decreases as the price increases (this good is substituted with
p1
another;
q1
> 0 , since the demanded quantity of the other good decreases as its price increases and this
p 2
another good is substituted with the first good;
and the real income depends on the prices:
mr = g ( p1 , p 2 ) ,
dmr dmr
where < 0 and < 0 , since the real income (goods that can be consumed) decreases as the
dp1 dp 2
prices increase.
q1
To find the total impact of the price of the other good on the demanded quantity . we find the
p2
q1 q q
total differential: dq1 = dp1 + 1 dp 2 + 1 dmr . And the partial total derivative with respect
p1 p 2 mr
to p 2 :
q1 q1 dp1 q1 dp 2 q1 dmr q1 q q dmr
= + + = 0 + 1 1 + 1
p 2 p1 dp 2 p 2 dp 2 mr dp 2 p1 p 2 mr dp 2
q1 q1 q1 dmr
or: = + .
p 2 p 2 mr dp 2
+ +
AE SE
We can see that this impact depends on whether the substitution or the income effect dominates:
30
DIFFERENTIATION Anneli Kaasa
q1
if SE > IE , then > 0 ; as p 2 increases, q1 increases and this means that these goods are
p2
substitutes for this consumer;
q1
if SE = IE , then = 0 ; as p 2 increases, q1 does not change and these goods are independent
p2
for this consumer;
q1
if SE < IE , then < 0 ; as p 2 increases, q1 decreases and this means that these goods are
p2
complements for this consumer.
If the chains of indirect impact are longer, consisting of three or more impacts, then the number
of the multiplied derivatives in the terms must be larger, respectively. The procedure can be viewed
as a decomposition of the total differential. For example, let us look at the system z = f ( x, y, u ) ,
x = g ( y ) and y = h(u ) . Here, the total impact of u on z can be found as follows:
z z z
dz = dx + dy + du =
x y u
z dx z z
= dy + dy + du ,
x dy y u
dz z dx dy z dy z du z dx dy z dy z
from here: = + + = + + ,
du x dy du y du u du x dy du y du u
direct
indirect
where the first term describes the impact of u on z through x and y, the second term describes the
impact of u on z through y (in this case the chain goes directly to z from y and not through x) and
the last term describes the direct impact of u on z..
31
DIFFERENTIATION Anneli Kaasa
F
dy
= x = Fx .
dx F Fy
y
F
Here it must be noted that the denominator cannot be 0: = Fy 0 . If a derivative function has
y
been calculated using this formula, it can include both variables (differently from the usual case
where only arguments can be seen in a derivative function).
For example, given x 2 + y 2 = 9 the derivative can be found as follows:
dy F 2x x
= x = = , y 0.
dx Fy 2y y
q2
32
DIFFERENTIATION Anneli Kaasa
u
dq 2 q1 MU 1
= = ,
dq1 u MU 2
q 2
where MU 1 stands for the marginal utility of the first good and MU 2 stands for the marginal utility
of the second good. In our case:
dq 2 0,6q10, 4 q 20, 7 6q
= 0, 6 0,3
= 2 .
dq1 0,7 q1 q 2 7 q1
The slope of an indifference curve also gives the marginal rate of substitution. For example in the
case of the function z = f ( x, y ) the marginal rate of substitution shows the change of y ( y ) that is
needed as x increases by one unit ( x = 1 ), in order to hold z constant. The value of z does not
change, if the total differential is equal to 0:
z z
dz = dx + dy = 0 .
x y
Hence, we can find a necessary change of y to hold z constant at the unitary change of x:
y dy z
= x .
x dx zy
Given u = 12q10, 6 q 20, 7 a necessary change of the quantity of the second good to hold utility constant
when the quantity of the first good is increased by one unit, is:
q 2 MU 1 6q
= 2 .
q1 MU 2 7 q1
If the consumer has 10 units of both goods, and needs two additional units of the first good, then
what has to happen to the quantity of the second good to keep the same utility level? This can be
found in a following way:
6 q2 6 10
q2 q1 = 2 1,71 .
7q1 7 10
Hence, the consumer has to give up 1,71 units of the second good.
A function in the form z = c x a y b , where a, b and c are positive parameters, is called Cobb-
Douglas type function and is used often in economics as utility or production or other functions.
The formation of the indifference curves can be described as follows. In general, in economics, a
situation can be described with the help of a function of two arguments z = f ( x, y ) . The value of
the function z can take on different values and here, the relationship between x and y is of interest at
a fixed value of z. In every case of a fixed value of z, we have an implicit function F ( x, y ) = c that
determines the relationship between x and y. All relationships corresponding to different constant
values of z can be depicted on the xy-plane as the indifference curves. The formation of these
curves can be imagined as follows.
The geometrical representation of the function z = f ( x, y ) in a three-dimensional space is often a
mountain or a valley. A well known example is the utility mountain. In the case of a mountain
depicted on Figure A1.6 every value of z corresponds to a horizontal cut (parallel to the xy-plane):
the higher the cut, the larger the value of z. When the intersection lines (circles) of these planes and
the surface z = f ( x, y ) are projected (let to fall down) onto the xy-plane, the situation is depicted on
the xy-plane (only two dimensions needed).
33
DIFFERENTIATION Anneli Kaasa
y y
x x
Joonis A1.6. The horizontal cuts of a surface and their projections on the xy-plane
Now, there are curves on the xy-plane (circles in our case) and every curve incorporates those
bundles of the values of x and y that give a certain value of z. Those curves are called indifference
curves. The slope of theses indifference curves on the xy-plane can be calculated with the help of an
dy F z
implicit function derivative rule: = x = x .
dx Fy zy
Often only a part of the indifference curves are actually shown in traditional figures in economics.
For example, in the case of the utility function u = u (q1 , q 2 ) , it is assumed that the households do
not operate in the areas (quadrants) where additional consumption decreases utility. That means
that always MU 1 , MU 2 > 0 and thus a q1 q 2 -plane the slope of the indifference curve is always
negative:
(+)
dq 2 MU 1
= < 0.
dq1 MU 2
(+)
This condition is satisfied only in two quadrants of the q1 q 2 -plane (upper right and lower left on
Figure A1.7), in one of them are the curves convex and in the other concave. Why one on them is
not suitable can be seen by looking at the mountain from the sky. When moving up on the figure,
the utility is first increasing and the decreasing after the dotted line. When moving to the right on
the figure, again the utility is first increasing and then decreasing. Hence, the only part where both
marginal utilities are positive, is on the lower left, where the indifference curves are negatively
sloping and convex.
MU 2 < 0 MU 2 < 0
MU1 < 0
MU1 > 0
MU 2 > 0
MU 2 > 0
q1
34
DIFFERENTIATION Anneli Kaasa
Here, it has to be noted, that in the case of the Cobb-Douglas type functions z = x y the
geometrical representation is not a mountain in the sense used before, but rather an infinitely
increasing surface (maximization is possible only with constraints). In that case one can imagine a
quarter of the base of the mountain described before (without any maximum point). In that case the
intersections of the horizontal cuts of this surface appear to be arcs that we are used to see on the
figure in microeconomics (see Figure A1.8).
y y
x x
Figure A1.8. The horizontal cuts of the infinitely increasing surface and their projections on the xy-
plane
35
3. INTEGRATION
f (x ) dx .
There is an integral sign before the function that is integrated and there is also the differential of the
variable with respect to which the function is integrated in the notation. Integration should bring us
back to the initial function y = F ( x ) + C . If there is a constant in the initial function, the taking
derivative of it gives 0 and that means that only on the basis of a derivative function it is not
possible to determine the constant of the initial function. Therefore, in mathematics the constant of
integration C is always added to represent the constant of the initial function (if the value of this
constant is not known):
f (x ) dx = F ( x) + C ,
Knowing that f ( x ) dx = dy = dF ( x ) , it can also be written that:
dF (x ) = F (x) ,
confirming that we are dealing with opposite operations.
The rules of integration can be derived from the rules of differentiation. Here are the rules for
integrating the functions usually used in economics:
0 dx = C ,
b +1
(a x )dx = a b + 1 + C ,
b x
e dx = e x + C ,
x
ax
a dx = + C = a x log a e + C ,
x
ln a
1
x dx = ln x + C .
Integration allows us to find the initial or total function from a marginal function. If a marginal
function is given, then knowing that a marginal function has been found by differentiating the initial
function, this initial function can be found by using the opposite operation. When in mathematics a
constant of integration in general form C is added, in economics often the value of this constant is
known or there is a notation (symbol) that can be used to indicate the interpretation of the constant.
Let the revenue function be MR = 100 8q , the marginal cost
marginal function
MC = 9q 36q + 40 and fixed costs equal to 200. Finding the total cost function:
2
(
C = MC dq = 9q 2 36q + 40 dq = )
= 3q 3 18q 2 + 40q + const .
Since we know that fixed costs are equal to 200, we can rewrite:
36
INTEGRATION Anneli Kaasa
If no units are sold ( q = 0 ) then there will be no revenue as well, hence, the constant of this
function is 0:
R = 100q 4q 2 .
Integration is an opposite operation of taking derivatives. Assume we have a function
y = F (x ) + C (we separate the constant C from the remaining function). This is our starting or
initial function. If we take a derivative of it, we get y = F ( x ) + 0 . Let us assume F ( x ) = f (x ) .
Then: y = F ( x ) + 0 = f ( x ) + 0 . If we now integrate this function, we have to get back the initial
function: f (x ) dx = F ( x) dx =F ( x) + C . Hence, the constant C represents the constant of the
initial function.
If we know marginal function MY = y = F ( x ) + 0 = f ( x ) + 0 , we can find total function by
integrating marginal function: y = MY dx = f (x ) dx = F ( x) dx =F ( x) + C .
In the case of functions used in economics the value of the constant of integration (constant of the
initial function) mostly shows the value of the initial function if the argument is equal to 0. For
example if the initial function is y = ax 2 + bx + c , its derivative is y = ax + b and integrating this
y dx = (ax + b)dx = ax + bx + C . The constant of integration (and the constant of the
2
gives us
initial function) C = c is equal to the value of the initial function for the value of argument equal to
0: y (0) = a 0 2 + 0 x + c = c .
That is not always the case, however. For example if the initial function is y = ae x + b , its
derivative is y = ae x and integrating this gives us y dx = ae dx = ae x + C . At that the constant
x
of integration (and the constant of the initial function) C = b is not equal to the value of the initial
function for the value of argument equal to 0: y (0 ) = ae 0 + b = a + b .
The relationship similar to that between the total and marginal functions can be found at other
variables as well. Although the names of these variables are not so obviously referring to their
relationship, the nature of the relationship is similar: a variable and another variable that describes
the change of the first variable the value added of the first variable that is added. For example,
the value of capital forms with the help of investment: the investment of a particular period is equal
to the capital that is added in that period:
I (t ) = = K or K (t ) = I (t )dt .
dK
dt
(the dot over the symbol K denotes the derivative with respect to time (see the chapter about the
growth rate).
37
INTEGRATION Anneli Kaasa
f(x)
a b x
a a
First, the function is integrated. Next, the value of the resulting function at the lower limit and the
same at the upper limit are calculated and then the former is subtracted from the latter. This
b
difference can also be denoted with a vertical line with the lower and upper limits: F ( x) .
a
f (x ) dx = F ( x) = F (b ) F (a ) (see
b
A definite integral is defined by the lower and upper limits:
a a
upper figure) and its geometrical representation is an area between the curve and an axis between
the limits (lower figure).
y=F(x)+C
F (b ) F (a )
a b x
y'=F'(x)=f(x)
F (b ) F (a )
a b x
38
INTEGRATION Anneli Kaasa
p
consumer's S
p1 sruplus
E producer's
p* surplus
D
p0
q* q
S consumer = q D ( p ) dp .
p*
39
INTEGRATION Anneli Kaasa
There are analogical options for the producers surplus. According to the first option, the area under
the supply curve can be subtracted from the rectangle representing the producers revenue
R = p *q * . For that we need to find a definite integral between 0 and q* of the inverse supply
function (p solved for q):
q*
S producer = p * q * p S (q ) dq .
0
The supply curve shows the price the producers are willing to get at a particular quantity and that
comes from the marginal costs (a supply curve is also a marginal cost curve in most cases). Hence
this method subtracts the variable costs (the marginal cost of a fixed cost is 0) from the revenue.
Another option is to calculate directly the area between the supply curve and the vertical axis by
integrating the supply function (q solved for p) and determining the limits on the p-axis ( p 0 and
p*):
p*
S producer = q S ( p )dp .
p0
For example, let the inverse demand function be p D = 40 3q and the inverse supply function
p S = 0,5q 2 q + 10 . In order to find equilibrium point we can use the condition:
p D = p S , hence 40 3q = 0,5q 2 q + 10 and 0,5q 2 + 2q 30 = 0 .
The solutions for this quadratic equation are q1 = 10 and q 2 = 6 , however, the equilibrium
quantity cannot be negative. The price corresponding to q* = 6 is p* = 40 3 6 = 22 .
The situation is depicted on Figure 3.3. The demand curve is a negatively sloping straight line with
the slope 3 and the p-intercept 40. The p-intercept of the parabolic graph of the supply curve can
be found taking quantity equal to 0: p = 0,5 0 2 0 + 10 = 10 . For value q = 1 the price is smaller:
p = 0,5 12 1 + 10 = 9,5 . Hence, the supply curve is first negatively and then positively sloping.
p
40 S
E
22
10
D
6 q
S consumer =
(40 22) 6 = 54 .
2
When using integral, the monetary utility:
6
2
(40 3q ) dq = 40q
3q 6
=
0 2 0
40
INTEGRATION Anneli Kaasa
3 62 3 02
= 40 6 40 0 = 186 ,
2 2
consumers expenditures: p * q* = 22 6 = 132 , and consumers surplus:
S consumer = 186 132 = 54 .
40 1
For another option, the demand function should be found: q = p . Then the consumers
3 3
surplus:
40 1
40
40 p p 2 40
S consumer = p dp = =
22
3 3 3 6 22
40 40 40 2 40 22 22 2
= = 54 .
3 6 3 6
The producers surplus can only be calculated with the help of integrals. As the inverse supply
function is a quadratic function, it is rather inconvenient to find the supply function (solve it for q)
Hence, it is sensible to calculate the area under the supply curve first:
q3 q2
(0,5q )
6 6
2
q + 10 dq = + 10q =
0 6 2 0
63 6 2 03 0 2
+ 10 6 + 10 0 = 78 ,
6 2 6 2
and then to subtract it from the revenue R = p * q* = 22 6 = 132 to get the producers surplus:
S producer = 132 78 = 54 .
It is worth noting here that the producers surplus is not equal to the profit, but is larger because
of the fixed costs: = R VC FC = H tootja FC .
Also, if, for example, the demand curve has no intersection point with the price-axis (for example
a a
a hyperbolic demand curve p = or q = ), then the area that estimates the consumers surplus is
q p
not a finite number and the consumers surplus cannot be calculated in this case. For the hyperbolic
function using the concept of limits (it is not possible to logarithm an infinity or 0) gives us:
q* a q*
S consumer = dq p * q* = a ln q p * q* = a(ln q * ln 0 ) p * q* =
q
0 0
or
= a(ln ln p *) = lim a(ln b ln p *) = a( ln p *) = .
a
S consumer = p* p dp = a ln p p* b
41
INTEGRATION Anneli Kaasa
c
x x
a b
the procedure is different. If we are calculating the area between the curve and horizontal (x-)axis,
then the limits are defined on horizontal (x-)axis, the function has to be a function of x, and we
integrate with respect to x
b
y(x ) dx = = .
b
a a
If we are calculating the area between the curve and vertical (y-)axis, then the limits are defined on
vertical (y-)axis, the function has to be a function of y, and we integrate with respect to y.
d
x( y ) dy = = .
d
c c
As a different example, the definite integral enables to find the size of the capital accumulated in a
time period as a result of the investments. For example, the capital accumulated between the periods
t1 and t 2 can be found as follows:
t2 t2
I (t ) dt = K (t )
t1 t1
= K (t 2 ) K (t1 ) .
A definite integral between 0 and a time point n gives the size of the capital that has been
accumulated in n periods:
n
I (t ) dt = K (t ) = K (n ) K (0)
n
0 0
and hence, the total value of the capital at a particular time point n is:
n
K (n ) = I (t ) dt + K (0 ) .
0
If, for example, the initial capital is 100 and the investments depend on the time as follows:
I = 30t 0,5 , then the value of the capital after 10 periods is:
( )
10
K (10) = 30t 0,5 dt + 100 = 20t 1,5
10
+ 100 = 20 101,5 0 + 100 20 31,6 + 100 = 732,5 .
0 0
corresponds to this utility function. At a particular value q = n the value of the marginal utility
function MU n = MU (n ) shows the utility received from consuming the n -th (last) unit. If we sum
all marginal utilities obtained from all additional units starting from the first unit until the last, n-th
unit, the total utility of n units can be found as a result:
n
MU 1 + MU 2 + + MU n = MU i = u (n ) .
i =1
u(n)
n q
MU
u(n)
MU(n)
n q
MU (q ) dq = u(n ) .
0
So, the area under the marginal utility curve between 0 and n gives us the total utility of n units.
That is because the utility of 0 units is assumed to be 0. Analogically, summing the marginal
revenues obtained from all additional units sold starting from the first unit until the last, n-th unit,
gives us the total revenue of the quantity n. Since the revenue is 0 when nothing is sold, there is no
constant in the revenue function and the area under the marginal revenue curve between 0 and n
gives us directly the total revenue of n units. Summing the marginal costs needed for all additional
units from the first unit until the last, n-th unit, gives us a function that describes the quantity-
dependent part of the cost function the variable costs for the quantity n. The fixed costs are
described by the constant of the total cost function and hence, they are not reflected in the marginal
cost function. So, the area under the marginal cost curve (mostly coinciding with the supply
curve) between 0 and n gives us not the total cost of n units, but the variable cost of n units.
Summing the marginal profits of all additional production units from the first unit until the n-th unit
gives us the quantity-dependent part of the profit function (the difference between the revenue and
costs) that is actually equal tot the producers surplus. So the area under the marginal profit
43
INTEGRATION Anneli Kaasa
curve between 0 and n gives us not the total profit of n units, but the producers surplus or
variable costs of n units.
In general: the definite integral of a marginal function between 0 and a particular value of the
argument x0 gives us the value of the initial or total function at x0 , but without the constant of the
initial function (in the case of the utility function constant is usually 0, as no utility is obtained when
nothing is consumed):
x0
f (x ) dx = F (x ) , where F (x ) = f (x ) .
0
0
In graphical context that can be described as follows for the example of utility. A marginal utility of
a unit of good is represented as a vertical line from the q-axis to the marginal utility curve. The
marginal utilities of all infinitely small quantities added can be imagined as lines closely next to
each other that are creating an area. Summing marginal utilities of all infinitely small quantities
added from 0 to n gives an area between the q-axis and the marginal utility curve corresponding to
the total utility of n units (lower part of Figure 3.4) that also can be depicted as a vertical line
segment between the q-axis and the total utility curve (upper part of Figure 3.4) Hence, in the case
of proportional figure, the area under the marginal utility curve has to be equal to the height of the
total utility function at n.
If the argument is time t, then
t
F (t ) = f (t ) dt + F (0 ) .
0
Thus, the value of F (t ) at a time point where t = n , is equal to the sum of the initial value and the
value accumulated during n periods.
x0
y ( x0 ) = F ( x0 ) + C ).
When finding a definite integral of a marginal function f ( x ) so that we replace the upper limit
with the variable x, we get:
x
f ( x ) dx = F ( x) = F ( x ) F (a ) .
x
a a
F ( x ) here is actually the total function (without the constant) and the second term is a constant that
is one possible value of the constant of integration. We can denote this constant as F (a ) = C .
Hence, a definite integral in this form can also be viewed as an indefinite integral:
x
f (x ) dx = F ( x) = F (x ) F (a ) = F (x ) + C = f (x )dx ,
x
a a
x
that is: f (x ) dx = f (x )dx .
a
44
4. MATRIX ALGEBRA
The first subscript of a matrix element shows, in which row it is and the second subscript, in which
column this element is. A matrix with only one row or column is called vector (row vector or
column vector, respectively). In economics mostly square matrices are used ( m = n ).
A matrix whose columns are the same as the rows of the initial matrix (and rows the same as the
columns of the initial matrix) is called transposed matrix and denoted as A' or AT . Matrix B is
equal to a transposed matrix AT , if:
[b ] = [a ] .
ij ji
In the case of square matrix, the transposed matrix is a reflection of the initial matrix over its
principal diagonal (from upper left to lower right corner).
Next, the basic operations with matrices are introduced. When adding matrices, the corresponding
elements (that are in the same place) in both matrices will be summed:
[a ] + [b ] = [a
ij ij ij + bij .]
When subtracting, analogically the elements of one matrix are subtracted from the corresponding
elements of the other matrix:
[a ] [b ] = [a
ij ij ij bij .]
For adding and subtracting, the matrices have to have the same number of rows and columns.
When multiplying matrices, in order to get a particular element (in i-th row and j-th column) the
elements of i-th row of the first matrix and the elements of j-th column of the second matrix are
multiplied with each other and these products are then summed:
[a ][b ] = [a
ij ij i1 b1 j + ai 2 b2 j + + ain bmj . ]
As it can be seen from the formula, the number of columns in the first matrix has to be equal to the
number of rows in the second matrix (in the case of square matrices with the same number of rows
and columns this is always so). Differently from the multiplication of numbers, in the case of
matrices the order of matrices is important when multiplying.
When a matrix is multiplied by a number, all elements of the matrix are multiplied by this number:
[ ] [ ]
k aij = k aij .
45
MATRIX ALGEBRA Anneli Kaasa
An analogue to the division in matrix algebra is multiplying a matrix with an inverse of the other
matrix. An inverse matrix A 1 is a matrix that, when multiplied with the initial matrix (both from
left and right), gives identity (unit) matrix E as a result:
1 00
0 10
A A 1 = A 1 A = E = .
0 01
Hence, only square matrices can have an inverse matrix. There are many ways to find an inverse
matrix, one of them is introduced later.
When dealing with matrices, determinant is also an important notion. A determinant is a certain
numerical value that corresponds to a particular matrix and is calculated according to certain rules.
A determinant (here a determinant of matrix A) can be denoted in different ways:
a11 a12 a1n
a a 22 a2n
A = aij = 21 = DA .
a n1 an2 a nn
When calculating the value of a determinant, all different sets of elements are found that have only
one element from every row and one element from every column. In the case of n-th order square
matrix every set has n elements. Then products of the elements for each set are found and summed
multiplying with the number 1 all those sets that are parallel with the secondary diagonal.
For example, the determinant of a second-order matrix is calculated as:
a11 a12
A= = a11 a 22 a 21 a12 .
a 21 a 22
Finding higher-order determinants can be simplified by finding minors corresponding to every
element of a, for example the first, row. For finding a minor, a row and a column where the
corresponding element is (for element aij i-th row and j-th column) are omitted and then the
determinant of the remaining matrix is calculated. When a minor is multiplied with a number
(1)i+ j , we get a subdeterminant that is called cofactor ( Aij ). For the first element in the first row
the sum i + j is 2, for the second element in first row 3 etc. Hence, when the first row was chosen,
every other minor has to be multiplied by 1 . The sum of the products of elements of the chosen
row and corresponding cofactors is equal to the determinant of the whole matrix. For example, the
determinant of the third-order matrix can be found as follows:
a11 a12 a13
A = a 21 a 22 a 23 =
a31 a32 a33
a 22 a 23 a a 23 a a 22
= a11 a12 21 + a13 21 .
a32 a33 a31 a33 a31 a32
In the case of higher-order determinants analogical decomposition is used.
46
MATRIX ALGEBRA Anneli Kaasa
A matrix is a rectangular array of elements (constants, parameters, variables) with m rows and n
a11 a12 a1n
a a 2 n
[ ]
columns, for example: A = aij = 21
a 22
.
a m1 a m 2 a mn
47
MATRIX ALGEBRA Anneli Kaasa
After denoting q D = q S = q , the model can be reduced to two equations and two unknowns:
bp q = a,
dp + q = c.
Often the models in economics have much more equations and unknowns. The market equilibrium
model can also be extended by adding equations describing more aspects (for example adding
excise tax to the model). In the case of more complex models it is rational to use matrix algebra.
Let there be a linear equation system that intends to analyse a problem in economics (it is sensible
to have the number of unknowns equal to the number of equations, let this number be n):
a11 x1 + a12 x 2 + + a1n x n = b1 ,
a x + a x + + a x = b ,
21 1 22 2 2n n 2
a n1 x1 + a n 2 x 2 + + a nn x n = bn .
The same relationships can be expressed with the help of matrices and vectors. At first, this
equation system can be rewritten as an equality of two vectors:
a11 x1 + a12 x 2 + + a1n x n b1
a x + a x + + a x b
21 1 22 2 2n n
= 2 .
a n1 x1 + a n 2 x 2 + + a nn x n bn
The vector on the left side can be rewritten as a product of a matrix and a vector based on the rules
of multiplying matrices. For example, the first element in the left-side vector is actually a first
element of a vector that could be obtained by multiplying the matrix of coefficients A and vector of
unknowns X:
a11 a12 a1n x1 b1
a a 22 a 2 n x 2 b2
21 =
a n1 an2 a nn x n bn
or
AX = B .
Examples of using matrix equations can be seen in comparative statics, for example, but one simple
example is provided here as well. For example, let there be a chocolate factory that produces three
types of chocolates: dark (D), milk (M) and white (W) chocolate. For producing one box of dark
chocolate 4 units of cocoa, 1 unit of sugar and 6 units of milk powder is needed. For producing one
box of milk chocolate 2 units of cocoa, 4 units of sugar and 5 units of milk powder is needed. For
producing one box of white chocolate 5 units of sugar and 6 units of milk powder is needed. There
are 100 units of cocoa, 230 units of sugar and 330 units of milk powder in the storeroom. What has
to be the production plan to finish all stocks?
This can be answered with the help of a system, where every equation describes using one input.
The quantity of one type of chocolate multiplied by the quantity of the input that is needed for one
unit of that chocolate gives a need for that input for that type of chocolate. By adding the needed
quantities of that type of input for all types of chocolate we can set it equal to the quantity of this
input in the storeroom. Three equations describing using the different types of inputs are:
48
MATRIX ALGEBRA Anneli Kaasa
4 D + 2M + 0W = 100,
1D + 4 M + 5W = 230,
6 D + 5M + 6W = 330.
There are many ways to solve a matrix equation. One possibility is to use Cramers rule. According
to that, xi is equal to a quotient, where the determinant of a coefficients matrix A serves as a
denominator and the same determinant, but with the i-th column replaced by the vector of constants
B (let us denote this determinant as Ai ) stands in the place of numerator.
In our case:
100 2 0
230 4 5
A1 330 5 6 440
D= = = = 10 ,
A 44 44
4 100 0
1 230 5
A2 6 330 6 1320
M = = = = 30 ,
A 44 44
4 2 100
1 4 230
A3 6 5 330 880
W = = = = 20 .
A 44 44
Hence, the production plan that spends all inventories includes 10 boxes of dark chocolate, 30
boxes of milk and 20 boxes of white chocolate. In this case, the same solution could be also found
with usual methods of solving an equation system. In the case of more complex systems matrix
algebra may provide a possibility to find the solution in a more effective way, although in more
complex cases calculating determinants becomes more time-consuming as well. Hence, let the
choice of method be a decision of the solver.
49
MATRIX ALGEBRA Anneli Kaasa
that reaches the consumers (can be called final demand or external demand) as y1 and y 2 , it can be
written:
x1 tx 2 = y1 ,
sx1 + x 2 = y 2 .
The output of the first industry minus the output of this industry that is consumed (used) in the
second industry gives what is left for the final demand (of the first industry consumed by the
consumers). The second equation describes the same for the second industry.
As a matrix equation:
1 t x1 y1
s 1 x = y .
2 2
Using Cramers rule gives us formulas for finding the need for output when the final demand is
known:
y1 t
y 1 y + ty 2
x1 = 2 = 1 ,
1 t 1 st
s 1
1 y1
s y2 y + sy1
x2 = = 2 .
1 t 1 st
s 1
x
j =1
ij + y i = xi ,
where:
xij output of i-th industry that is used in j-th industry,
xi total output of i-th industry,
y i final demand of i-th industry, that is consumed outside industries.
50
MATRIX ALGEBRA Anneli Kaasa
n
The equation of a first row for example is then: x
j =1
1j + y1 = x1 .
When looking at an industry as a consuming industry, it can be said that to produce the output of
one industry inputs from many other industries and besides that also the value added (can be called
also the primary input, denoted here with z; for example wages of workers, taxes, etc) are needed.
This is usually presented in the columns of input-output tables and that can be expressed as a
general equation of a j-th column:
n
x
i =1
ij + zj = xj ,
where:
z j value added in j-th industry,
x j total output in j-th industry.
n
The equation of a first column for example is then: x
i =1
i1 + z1 = x1 .
value
yi =
i
z1 zj zn
added = zj
j
total
x i =
i
x1 xj xn
output = xj
j
The rows show how the output is consumed in different industries and the columns how it is
produced in different industries.
It may be that sometimes the final demand changes. Then a new plan for total output may be
needed. In some other case, it could be that a new plan of total output is known and the quantities of
final demand that will be available at this new plan are of interest. For solving these problems,
matrix algebra can be used.
The row equations in from the input-output table can be expressed as a matrix equation:
x11 + + x1n y1 x1
+ = .
x n1 + + x nn y n x n
51
MATRIX ALGEBRA Anneli Kaasa
If the technology remains constant, then the coefficients that show the need of one product for
producing one unit in another industry are constant. Input coefficient aij shows how much of the
output of the i-th industry is spent in the j-th industry for producing one unit:
xij
aij = .
xj
All input coefficients constitute a matrix A:
a11 a1n
A = .
a n1 a nn
a11 x1 + + a1n x n y1 x1
+ = .
a n1 x1 + + a nn x n y n x n
It needs a little more effort to find total output, when input coefficients and final demand is known.
Let us solve the equation ( E A) X = Y for vector X. For that we may multiply this equation with a
matrix ( E A) 1 from the left side:
( E A) 1 ( E A) X = ( E A) 1 Y or
X = ( E A) 1 Y .
The elements of matrix ( E A) 1 show the need for an output of one industry for producing one
unit in another industry taking into account all intermediate demands, including the need for the
output of the first industry in other industries (that produce inputs for the second industry).
The solution of an input-output model (solution of the matrix equation) gives such structure of the
output of the economy that enables to satisfy the final demand without any deficits or surpluses. As
it is about aiming the equality of the demand and supply, it can be viewed as aiming an equilibrium
as well.
52
MATRIX ALGEBRA Anneli Kaasa
It is possible to transform this model into a closed model, where the households and government
sector (demanding, that is consuming the final demand and providing the value added) are viewed
as one industry. The inputs for this industry are actually the final demand and the output is the value
added. In that case the equations of the rows and columns are simpler:
n n
xij = xi and
j =1
x
i =1
ij = xj .
In that case the sum of the input coefficients of a column is always equal to 1 (because the
coefficients are calculated by dividing the intermediate demands of one industry for the quantities
from all industries by the quantity produced in this industry and in the input-output table the sum of
all previous elements in one column gives the last element in that column):
n n zj
aij = 1 (analogically to
i =1
a i =1
ij +
xj
= 1 on an open model).
From here:
0 = X AX = ( E A) X .
In this case it is not possible to determine a unique solution of the total output (the system has an
infinite number of solutions).
This can be seen also from the fact that the sums of the elements in the matrix A by columns are
1 a11 a1n
equal to 1 and in the matrix E A = these sums are equal to 0. The columns
a n1 1 a nn
of both matrices are linearly dependent. For example any row of the matrix E A can be found by
subtracting all other rows from a row where all elements are equal to 0. Hence, the determinant of
the matrix E A is equal to 0 and the matrix equation has no unique solution.
If the new planned outputs are 1100 for the first industry and 1400 for the other, then assuming that
the technology is constant, we can find the amounts left for final demand of these industries. First,
the matrix of the input coefficients can be found by dividing the intermediate demands with the total
output in a particular column:
53
MATRIX ALGEBRA Anneli Kaasa
0,2 0,1
A= .
0,5 0,4
The matrix E A can be found:
1 0,2 0 0,1 0,8 0,1
EA= = .
0 0,5 1 0,4 0,5 0,6
and multiplying it by the vector of total output gives the vector of final demand:
0,8 0,1 1100
Y = ( E A) X = =
0,5 0,6 1400
0,8 1100 0,1 1400 740
= = .
0,5 1100 + 0,6 1400 290
Hence, the mentioned plan leaves 740 and 290 units, respectively, for final demand.
If this plan seems inappropriate, then we can start with determining the final demand, for example
800 for the first industry and 300 for the second. The vector of total output can be found as follows:
X = ( E A) 1 Y . For that we need a matrix ( E A) 1 , the inverse matrix of
0,8 0,1
EA= .
0,5 0,6
An inverse matrix can be found using following formula:
1
A 1 = adj A ,
A
~
where A is a determinant of A and adj A (also denoted as A ) an adjunct matrix of A. The latter
can be found by replacing every element aij in the initial matrix by the corresponding cofactor Aij
and by transposing the result.
a11 a12 a1n
a a 22 a 2 n
If A = 21 ,
a n1 an2 a nn
0,8 0,1
In our case the cofactors are but one element. Form the matrix E A = , first the
0,5 0,6
matrix of minors is:
0,5
[M ] = 00,6,1
ij
0,8
,
the matrix of cofactors:
54
MATRIX ALGEBRA Anneli Kaasa
[A ] = 00,,16
ij
0,5
0,8
and by transposing it we find:
0,6 0,1
adj [E A] = .
0,5 0,8
The determinant of E A :
E A = 0,8 0,6 ( 0,5) ( 0,1) = 0,48 0,05 = 0,43
and the inverse matrix:
(E A)1 = adj [E A] =
1
EA
1 0,6 0,1 1,40 0,23
= =
0,43 0,5 0,8 1,16 1,86
.
If the new final demand vector is (800,300), we get the new total outputs:
1,40 0,23 800
X = ( E A) 1 Y = =
1,16 1,86 300
1,40 800 + 0,23 300 1189
= = .
1,16 800 + 1,86 300 1486
The new input-output table is then:
industry 1 2 y x
1 237,8 148,6 800 1189
2 594,5 594,4 300 1486 .
z 356,7 743 1100
x 1189 1486 2675
As an alternative, the new total outputs can also be found from the equation ( E A) X = Y or
0,8 0,1 x1 800
0,5 0,6 x = 300 using the Cramers rule as follows (differences are caused by
2
rounding):
The inverse matrix ( E A) 1 can also be estimated with the help of an approximation using the
formula: ( E A) 1 = E + A + A + A 3 + .
Let us study this formula. When multiplying a matrix and its inverse matrix we get a unit matrix:
(E A)(E A)1 = E .
Thus, in order E + A + A + A 3 + + A m to be an inverse of E A the following must be true:
55
MATRIX ALGEBRA Anneli Kaasa
(E A)(E + A + A + A3 + + A m ) = E
or when rearranging:
E + A + A 2 + A 3 + 2 + A m A A 2 A 3 2 A m +1 = E A m +1 = E .
Hence, the more similar is the matrix A m +1 to the null matrix (all zeros), the better is the estimation
that E + A + A + A 3 + + A m gives to the inverse matrix ( E A) 1 .
As the input coefficients are all positive and smaller than 1, then when multiplying the matrix A
continuously by itself the elements will approach to 0 and as m , the matrix A m +1 approaches
null matrix ( A m +1 [0] ) and E + A + A 2 + A 3 + 2 + A m ( E A) 1 .
Thus, the multiplication of the matrix A with itself has to be stopped at the moment when the
elements seem to be close enough to 0 depending on the admissible error.
Let us try to find an approximation ( E A) 1 E + A + A 2 + A 3 for the example used before:
(E A)1 =
1,40 0,23
.
1,16 1,86
The approximation method is related to another method for finding the solution in the input-
output model. This method is not used widely because of its labour intensity, but it enables to
clarify the relationships shown before.
First let us denote the matrix ( E A) 1 with B. The elements bij of ( E A) 1 = B show how much
of the output of the i-th industry is needed in order to produce one unit in the j-th industry
(including also all amounts of the i-th product that are used in other industries for producing the
inputs for the j-th industry). Let us try to find the coefficients b11 (the need of the product of the first
industry for producing one unit in the first industry) and b21 (the need of the product of the second
industry for producing one unit in the first industry).
One unit of the first product has to be produced. We can see from the matrix of input coefficients:
0,2 0,1
A=
0,5 0,4
that for producing one unit in the first industry 0,2 units of the first product and 0,5 units of the
second product is needed. Now, for producing 0,2 units of the first product 0,2 0,2 = 0,04 units of
the first product and 0,2 0,5 = 0,1 units of the second product are needed. When continuing this we
can construct the following scheme ( H 1 and H 2 in brackets specify which product is meant:
0,5( H 1 ) means that 0,5 units of the first product is needed):
56
MATRIX ALGEBRA Anneli Kaasa
0,008 ( H 1 )
0,04 ( H 1 )
0,02 ( H 2 )
0, 2 ( H 1 )
0,1 ( H ) 0,01 ( H 1 )
2
0,04 ( H 2 )
1 (H1 ) .
0, 01 ( H 1 )
0,05 ( H 1 )
0,5 ( H 2 ) 0,025 ( H 2 )
0,2 ( H ) 0,02 ( H 1 )
2
0,08 ( H 2 )
By summing all amounts of the first product that are needed, we get the approximation of b11 , by
summing all amounts of the second product that are needed, we get the approximation of b21 :
b11 1 + 0,2 + 0,04 + 0,05 + 0,008 + 0,01 + 0,01 + 0,02 = 1,338 ,
b21 0,5 + 0,1 + 0,2 + 0,02 + 0,04 + 0,025 + 0,08 = 0,965 .
The difference from the actual coefficients comes from the fact that not enough addends were
included to the formula of approximation (the admissible error was relatively large). If this scheme
were expanded, the results would be more precise. However, the coefficients found here are equal
to those found before with the help of the approximation method at the precision level m = 3 .
57
5. OPTIMIZATION
58
OPTIMIZATION Anneli Kaasa
The first-order condition brings out all points where there may be an extremum, but this condition
itself does not guarantee he existence of an extremum. Hence, it is necessary, but not sufficient
condition. Next, a sufficient condition is needed.
y
y'=0
y'=0
y'=0
y=f(x)
A B x
y'
y'=f'(x)
0
x
59
OPTIMIZATION Anneli Kaasa
60
OPTIMIZATION Anneli Kaasa
found successively until the first higher-order derivative is found that is not equal to 0 any more.
Hence, an n-th order derivative is aimed, for which it holds that
f ( n ) ( x0 ) 0,
( n 1)
f (x0 ) = 0.
The order n enables to determine the character of a critical point. If n is an odd number, it is an
inflection point. If n is an even number, it is an extremum. The character of an extremum is found
analogically to the second-order condition:
if f ( x0 ) = 0 and f ( n ) ( x0 ) < 0 , it is a local maximum point;
For example, given y = (3 x) 4 , critical points can be found by equating the first-order derivative
with 0:
y = 4 (3 x) 3 (1) = 4 (3 x) 3 = 0 , from here:
(3 x) 3 = 0 and
x0 = 3 .
Hence, there is one critical point with x equal to 3. Second-order derivative:
y = 4 3 (3 x) 2 (1) = 12 (3 x) 2 .
If x0 = 3 , then y (3) = 12 (3 3) 2 = 0 . Hence, the n-th order derivative test has to be used. The
value of the third-order derivative
y = 12 2 (3 x) (1) = 24 (3 x)
is also 0, if x0 = 3 :
y (3) = 24 (3 3) = 0 .
The value of the fourth-order derivative is different from 0:
y ( 4 ) = 24 (1) = 24 0 , y ( 4 ) (3) = 24 .
Since n = 4 is an even number, then this is an extremum point. Since y ( 4 ) (3) = 24 > 0 , we can
conclude that it is a minimum point.
As another example, let us assume that we have to find the quantity produced that guarantees the
maximum total revenue for the firm who faces a demand function q = 20 2 p (inverse demand
function: p = 10 0,5q ).
First, the function of total revenue can be found by multiplying price and quantity:
R = p q = (10 0,5q ) q = 10q 0,5q 2 .
The necessary condition is that the derivative of the revenue function that is also known as marginal
revenue has to be equal to 0 (the exclamation mark indicates that two sides of equations have been
set to be equal):
!
R = MR = 10 q = 0 ,
and the optimal quantity is probably q* = 10 . In order to test, whether it really is a maximum point,
we find the second-order derivative:
R = MR = 1 .
61
OPTIMIZATION Anneli Kaasa
As it is negative, the quantity q* = 10 really ensures the maximum revenue that is:
Rmax = 10 10 0,5 10 2 = 50 .
The situation is presented on Figure 5.1 with quantity on the horizontal axis. On the vertical axis
three variables are depicted: price, total revenue and marginal revenue. The vertical intercept of
demand and marginal revenue functions is the same, but the marginal revenue curve decreases two
times faster. If no unit are sold, then there is no revenue either. Hence, the total revenue curve starts
from the origin. At the same time, if price is 0, the revenue is 0 as well, so the total revenue curve
intersects the horizontal axis in the same point where demand curve intersects the horizontal axis.
The maximum point of total revenue is at the same quantity where the marginal revenue curve
intersects the horizontal axis (marginal revenue is 0 there).
p,
R,
MR
MR=10-q
pD=10-0,5q
10 R=10q-0,5q2
10 20 q
62
OPTIMIZATION Anneli Kaasa
63
OPTIMIZATION Anneli Kaasa
z z
y
x y
x
Figure 5.3. Maximum and minimum points of a function with two arguments
It can be seen from Figure 5.3 that in the extremum points the tangent plane is parallel to the xy-
plane. The partial derivative f x gives the slope of a tangent line in the particular point that is
parallel to the xz-plane. As mentioned at the geometrical interpretation of partial derivatives, when a
straight line parallel to the xz-plane that is on the tangent plane is projected to xz-plane, the slope of
this projection is f x . In the case of a critical point this straight line is horizontal with a slope equal
to 0 and the partial derivative f x has to be 0. Analogically, f y also has to be equal to 0.
The first-order condition is necessary for the extremums, but again not sufficient. A critical point
may appear to be a saddle point, where the value of z is maximal with respect of one argument, but
minimal with respect to another argument. It may also be an inflection point with respect to one or
both arguments. Hence, again a second-order condition is needed.
z z
x y y
x
64
OPTIMIZATION Anneli Kaasa
As it can be seen, instead of f xx we can write H 1 . If f xx f yy > f xy2 , then f xx f yy f xy2 > 0 and
H 2 > 0 . Thus, the first and second-order condition can be rewritten as:
65
OPTIMIZATION Anneli Kaasa
q = 3 q1 + 5 + q 2 = 0,
1
= q + 24 4q = 0.
!
q 2 1 2
Solving the equation system gives one critical point, where q1 * = 4 and q 2 * = 7 . The profit would
then be:
= 1,5 4 2 + 5 4 + 4 7 + 24 7 2 7 2 5 = 89 .
If it is a maximum, can be tested with the help of a Hessian:
f xx f xy 3 1
H = =
f yy 1 4
.
f yx
66
OPTIMIZATION Anneli Kaasa
point;
if f1 = f 2 = ... = f n = 0 and H 1 > 0 , H 2 > 0 , H 3 > 0 etc. or H i > 0 , it is a local minimum
point.
Hence, if the necessary condition is satisfied and all principal minors are positive, one can conclude
a local minimum. If the necessary condition is satisfied and the sign of principal minors alternates, a
local maximum can be concluded. In other cases the extremum cannot be confirmed, but this
possibility cannot be ruled out as well, because further analysis (that is not discussed here) may still
confirm the minimum or maximum.
Optimizing the function z = f ( x1 , x 2 , 2 , x n ) :
First-order condition, also known as necessary condition:
z
z1 = z 2 = 2 = z n = 0 (where f i = ), finding derivatives and setting them equal to 0, gives a
xi
system of n equation and n variables, solving it gives critical values of x1 , x 2 ,2 , x n a critical point
or points.
67
OPTIMIZATION Anneli Kaasa
d q
= Rn (q n ) c (q ) = Rn (q n ) c (q ) = MRn MC = 0 .
dq n q n
Hence, the profit is maximized if:
68
OPTIMIZATION Anneli Kaasa
MR1 = = MRn = MC .
Form here, usually a condition is derived that includes the elasticity. For that a transformation is
made with all marginal revenue functions in a following way ( p(q ) is an inverse demand function):
dR d (q p(q )) q dp q dp 1
= 1 p(q ) + q = p(q )1 +
dp
MR = = = = p1 + = p 1 + ,
dq dq dq p (q ) dq p dq
where is the price elasticity of demand that shows how sensitive the consumers are with respect
to the changes in price. (It has to be kept in mind here that in the case of traditional demand
functions is always negative.)
Hence, the profit-maximizing condition is:
1 1
p1 1 + = = p n 1 + = MC .
1 n
Assuming that the marginal cost is positive, the marginal revenues should be positive as well. For
1
that (since the price is positive) the expression 1 + should be positive. Since is negative,
1
then it must hold that < 1 and for that it must hold that > 1 .
Hence, in the case of a linear demand function, a monopolist chooses between such prices that are
higher than the half of the maximum price (at which consumers stop buying). It is really logical that
if the revenue started to decrease while moving further, there would be no point to expand the
production.
p
a
R
0,5a
MR D
69
OPTIMIZATION Anneli Kaasa
1
must hold that p1 + = MC , then the larger is the expression in brackets, the lower has to be the
price. In the case of more sensitive consumers lower price has to be asked.
70
6. OPTIMIZATION WITH CONSTRAINTS
y
local
maximum y=f(x)
local
minimum
a b x
As an example from economics, let us find the quantity that has to be produced to maximize profit
of a firm with the cost function C = 100 + 10q and a productive capacity of 150 units, if the firm
faces the inverse demand function p = 50 0,1q . First, we can derive the revenue function from the
demand:
R = q p = q (50 0,1q ) = 50q 0,1q 2 ,
and then the profit function as a difference between the revenue and costs:
= R C = (50q 0,1q 2 ) (100 + 10q ) or
= 0,1q 2 + 40q 100 .
The local extremums can be found with the help of a derivative:
d !
= 0,2q + 40 = 0 .
dq
The critical point is then q* = 200 , but it is outside the interval q = [0 ;150] . Hence, we have to
consider the limit points: if nothing is produced, then:
(0) = 0,1 0 2 + 40 0 100 = 100 ,
if maximum capacity is used, then:
(150) = 0,1 150 2 + 40 150 100 = 3650 .
Hence, it is optimal for this firm to produce as much as possible, 150 units. The situation is also
depicted on Figure 6.2.
3650
to minimize costs to produce a certain quantity. When optimizing with constraints the domain of the
function, from where the optimal solution can be found, is limited.
Sometimes the function has a form that makes optimizing without any constraints impossible. For
example, in the case of a simple Cobb-Douglas type utility function u = q1 q 22 , as quantities
increase, the utility increases and that can continue infinitely. However, in reality there is a budget
constraint and so the highest utility level can be found at a certain budget constraint.
Optimizing a function of more than one variable with constraints is in some sense similar to the
optimization in an interval. When optimizing with constraints the extremums are looked for in a
situation where the arguments of the function have to satisfy one or more equations that relate the
arguments to each other. These equations are called constraints. The function that is optimized is
called objective function. In the case of optimizing a function of two arguments with one
constraint, the objective function is:
z = f ( x, y )
and the constraint may be expressed as:
g ( x, y ) = b ,
where g ( x, y ) is a constraint function and b can be called a constraint constant.
The optimization problem then is to find minimum or maximum of the objective function
z = f ( x, y ) as the arguments x and y have to satisfy the constraint g ( x, y ) = b .
The extremums subject to constraints may not coincide with the free extremums (found without any
constraints), a maximum with constraints can be equal or smaller than the free maximum, a
minimum with constraints can be equal or larger than the free minimum.
y
x
number of arguments in objective functions will decrease by the number of constraints. Then, the
objective function is optimized with respect to the arguments that are still in the objective function
and the values of other arguments can be found from the constraint equations.
Let us try this method on an example from economics. Let the utility function that is to be
maximized, be u = q1 q 22 . The sum of expenditures for both goods (the price pi multiplied by the
quantity qi ) has to be equal to the consumers income m:
p1 q1 + p 2 q 2 = m .
Let the prices be 4 and 2, respectively, and the income 60. Hence, the budget constraint is:
4q1 + 2q 2 = 60 .
First, we can solve the constraint for the first quantity: q1 = 15 0,5q 2 and substitute it to the
objective function:
u = (15 0,5q 2 ) q 22 = 15q 22 0,5q 23 .
Now, we can take derivative and set it equal to 0:
du !
= 30q 2 1,5q 22 = 0 , from here
dq 2
(30 1,5q 2 ) q 2 = 0 .
There are two solutions: q 21 = 0 and q 22 = 20 . In order to determine the quantity that gives the
maximum utility, we determine the signs of the second-order derivative for both solutions:
d 2u
(0) = 30 3q 2 = 30 3 0 = 30 > 0 ,
dq 22
d 2u
(20) = 30 3q 2 = 30 3 20 = 30 < 0 .
dq 22
hence, q 21 = 0 gives the minimum and q 22 = 20 the maximum utility.
If q 2 * = 20 , the corresponding quantity of the first good is q1 * = 15 0,5 20 = 5 and the maximum
utility is u* = 5 20 2 = 2000 .
Although this method seems simple, the replacements and transformations may become quite
complex in the case of more arguments and constraints.
In the case of an objective function of two arguments and one constraint, the new function (L stands
for Lagrangian function) gets the following form:
L = f ( x, y ) + [b g ( x, y )].
It can be proved that the free extremums of Lagrangian function (with respect to all variables,
including Lagrange multiplier(s)) coincide with the extremums of the objective function at the
constraints that were inserted to the Lagrangian function. Hence, to optimize the objective function
with constraints, the extremums of Lagrangian function can be found.
In the case of a objective function of two arguments and one constraint, according to the first-order
condition in critical points the derivatives of Lagrangian function with respect to all variables ( x, y
and ) have to be equal to 0:
L !
x = L x = f x g x = 0,
L !
= L y = f y g y = 0,
y
L
= L = b g ( x, y ) = 0.
!
fx fy f fy
From the first two equations we get: = and = , hence x = . The last equation
gx gy gx gy
fx fy
represents the constraint: g ( x, y ) = b . Hence, the constraint and the condition = serve as the
gx gy
conditions for the extremums of the Lagrangian function.
Let us recall that when optimizing function z = f ( x, y ) without constraints the condition for critical
the points was that the value of a function neither increases nor decreases ( dz = 0 ):
dz = f x dx + f y dy = 0 .
Here, the values of arguments x and y have to satisfy the constraint g ( x, y ) = b . To hold the value of
the function g ( x, y ) constant (b), the total differential of g ( x, y ) has to be 0:
dg = g x dx + g y dy = 0 .
Hence, when optimizing with constraint, two conditions have to be satisfied in a critical point:
f x dx + f y dy = 0,
g x dx + g y dy = 0.
dy dy f dy g
When solving both equations for , we get = x and = x . From that:
dx dx fy dx gy
fx gx f fy
= or x = .
fy gy gx gy
Hence, indeed, the critical points (when optimising the objective function z = f ( x, y ) with the
f fy
constraint g ( x, y ) = b ) have to satisfy the condition x = and the constraint itself: g ( x, y ) = b
gx gy
and using a Lagrangian function for optimizing with constraint is appropriate.
75
OPTIMIZATION WITH CONSTRAINTS Anneli Kaasa
For investigating the critical point, the determinant of this bordered Hessian is calculated (see
Appendix 2):
H = 2 L xy g x g y L xx g y2 L yy g x2 .
It appears that the second-order condition for a maximum is the positive determinant of bordered
Hessian ( H > 0 ) and for a minimum the negative determinant ( H < 0 ).
At that, sufficient condition is not necessary, again. If the sufficient condition is not satisfied, one
cannot rule out the possibility that there still is an extremum in a critical point.
Taking together the necessary and sufficient conditions
if L x = L y = L = 0 and H > 0 , it is a local maximum point;
76
OPTIMIZATION WITH CONSTRAINTS Anneli Kaasa
Hence, the Lagrange multiplier shows, what happens to the optimal value of objective function as
the constraint constant b is increased by one unit.
From the Lagrangian function L = f ( x, y ) + [b g ( x, y )] we can see, that is equal to a
dL
derivative of the Lagrangian function with respect to b: = . If the constraint is satisfied, the
db
Lagrangian function and the objective function coincide and hence, the derivatives of the
Lagrangian and objective functions are equal:
dL dz
= = .
db db
If the constraint is included in the Lagrangian function in the form of g ( x, y ) b instead of
b g ( x, y ) , then the extremums remain the same, but the Lagrangian multiplier has an opposite
sign that has to be taken into account when interpreting the results. The same happens (opposite
sign of ), if we set minus before in the Lagrangian function:: L = f ( x, y ) [b g (x, y )] . If,
however, both changes are made L = f ( x, y ) [g ( x, y ) b] , nothing changes.
77
OPTIMIZATION WITH CONSTRAINTS Anneli Kaasa
dL L z opt
can be interpreted as = = , z opt can be either z max or z min . means that as b
db b b
increases by 1 unit, z opt increases/decreases (depending on the sign of ) approximately by
units.
Let us solve using Lagrangian method the example used before for illustrating substitution method.
The aim is to maximize the utility function u = q1 q 22 (objective function) subject to the budget
constraint 4q1 + 2q 2 = 60 . The constraint function is then g = f (q1 , q 2 ) = 4q1 + 2q 2 . The
Lagrangian function is in this case:
L = q1 q 22 + (60 4q1 2q 2 ) .
Next, we can find derivatives with respect to all variables (derivative of the Lagrangian function
with respect to qi is denoted as Li ):
!
L1 = q 2
2 4 = 0,
!
L2 = 2q1 q 2 2 = 0,
!
L = 60 4q1 2q 2 = 0.
One possibility to solve the equation system is to solve first two equations for :
q 22 2q1 q 2
= = or q 2 = 4q1 .
4 2
Now we can replace q 2 in third equation:
4q1 + 2 4q1 = 60
and find the optimal quantity of the first good: q1 * = 5 . The optimal quantity of the second good is
then: q 2 * = 4 5 = 20 . The maximum utility that is achieved is u* = 5 20 2 = 2000 .
In order to test, whether it is really a maximum point a following bordered Hessian can be
constructed:
0 g1 g 2 0 4 2
H = g1 L11 L12 = 4 0 2q 2 .
g 2 L21 L22 2 2q 2 2q1
In the critical point, the determinant is:
0 4 2
H = 4 0 40 = 0 + 320 + 320 0 160 0 = 480 > 0 .
2 40 10
Hence, it really is a maximum point.
is equal to a derivative of the objective function (utility function u) with respect to the constraint
du
constant (income m): = . Hence, shows approximate change in utility as income increases
dm
by one unit. The value of can be calculated from both the first and second equation:
q 22 20 2 2q q 2 5 20
= = = 100 or = 1 2 = = 100 .
4 4 2 2
78
OPTIMIZATION WITH CONSTRAINTS Anneli Kaasa
Hence, if a consumer gets one additional unit of income (euro, for example, if the income and prices
are measured in euros), his or her maximum utility increases approximately by 100 units.
Interesting results can be found when solving the problems in a general form. For example let us
maximize a Cobb-Douglas type utility function u = q1 q 2 subject to a budget constraint
p1 q1 + p 2 q 2 = m .
The Lagrangian function is then:
L = q1 q 2 + (m p1 q1 p 2 q 2 ) .
When taking derivatives with respect to q1 , q 2 and and setting them equal to 0, we can solve the
first two equations for :
q1 1 q 2 q1 q 2 1 q 2 p p
= = or = 1 or q 2 = 1 q1 .
p1 p2 q1 p2 p 2
This equation determines the optimal proportion of the two goods in a consumption bundle.
Replacing into the last equation (budget constraint) gives:
p 1 +
p1 q1 + p 2 q1 = p1 q1 = m
p 2
and solving this for q1 gives the demand function of the first good:
m
q1 = .
+ p1
Analogically, the demand function of the second good can be obtained:
m
q2 = .
+ p2
These results indicate that the share of in + determines, how much will the optimum demand
m
of the first good be relative to the amount that could be consumed when all the income would
p1
be used on the first good. Analogically the share of in + determines the optimum q 2
m
relative to the maximum possible amount .
p2
We can calculate the value of for example from the first equation:
1 +
q1 1 q 2 m m m +
= = = .
p1 p1 + p1 + p 2 p1 p 2 + m
The maximum utility is then:
+
m m m
u mx = = .
p1 + p 2 + p1 p 2 +
du
dm = + . This expression is an analogue of a growth rate (see Chapter 10.2)
Hence =
u mx u mx m
and shows the relative increase in utility as one unit is added to the budget constraint.
79
OPTIMIZATION WITH CONSTRAINTS Anneli Kaasa
When the same preferences are described by a transformed utility function obtained with the help
of logarithms: v = ln q1 + ln q 2 (see Chapter 10.1 for this transformation), then the Lagrangian
function is:
Lv = ln q1 + ln q 2 + lv (m p1 q1 p 2 q 2 ) .
When taking derivatives with respect to q1 , q 2 and and setting them equal to 0, we can solve the
first two equations for :
v = = ,
p1 q1 p2 q2
p 1
and from that the optimal proportion q 2 = q1 can be obtained that appears to be the same as
p 2
the proportion found for the untransformed form of the Cobb-Douglas type utility function. Since
the budget constraint has also remained the same, the optimal bundle and the demand functions are
the same as well.
The maximized utility and the value of the Lagrange multiplier are of course different. Since
v = ln u , then the Lagrange multipliers from two problems are related as follows:
dv
lv dv d ln u 1
= dm = = = .
l du du du u
dm
In order to confirm this, we can calculate v from the first equation:
+
v = = .
m m
p1
+ p1
a+
v m 1
Hence = = .
a a m a + u max
a +
p1a p 2 a + m
The maximum transformed utility is then
a
m a m m a m
v max = a ln + ln = ln =
p1 a + p2 a + p1 a + p 2 a +
a a m a +
= ln a .
p1 p 2 a +
80
OPTIMIZATION WITH CONSTRAINTS Anneli Kaasa
both cases it is an optimization with a constraint. In the first case the amount of production is
maximized at fixed costs and the Lagrangian function (since labour is denoted with L, in order to
avoid confusion the Lagrangian function is denoted here with ) can be written as follows:
L = q(K , L ) + (c rK wL ) .
When taking derivatives with respect to K, L and and setting them equal to 0, we get from the
first two equations:
MPK MPL q
MPK r = 0 and MPL w = 0 or = = . Since = , then the Lagrange multiplier
r w c
shows here the marginal product of money: the amount of the product that is added when an
additional money unit is spent on the input. Hence, the marginal products of money have to be
equal: there should be no difference whether to spend the additional money unit on one or another
input. From the last equation we get the equation of the isocost line.
In the second case the costs are minimized for producing a certain amount of production:
L = rK + wL + + (q q(K , L )) .
When taking derivatives with respect to K, L and and setting them equal to 0, we get from the
first two equations:
r w
r MPK = 0 and w MPL = 0 or = = .
MPK MPL
c
Since = , then the Lagrange multiplier shows here the marginal cost. Hence, the marginal
q
costs for different inputs have to be equal: there is should be no difference whether to achieve the
unitary increase in the output by increasing the amount of one or another input. From the last
equation we get the equation of the isoquant curve.
The optimal proportion of the inputs is the same however, both when the production is maximized
at fixed costs or when the costs are minimized for producing a certain amount of production. As a
remark: the same applies in the household theory: the optimal proportion of goods in a
consumption bundle is the same both when the utility is maximized at fixed costs or when the costs
are minimized for achieving a certain amount of utility.
In the case of maximizing the profit for the inputs a free optimization without any constraints is
used. When maximizing the profit (assuming the perfect competition and thus, fixed price):
p = R c = p q(K , L ) rK wL
we take derivatives with respect to K and L and set them equal to 0:
r w
p MPK r = 0 and p MPL w = 0 or p = = .
MPK MPL
When denoting the Lagrange multiplier in the production-maximizing problem as q and in the
cost-minimizing problem as c we can write:
1
p = c = .
q
The Lagrange multiplier of the cost-minimizing problem c gives the price, at which the
production-maximizing input bundle maximizes the profit as well. In another words: when we
maximize the profit knowing the price, the marginal cost of the optimal bundle has to be equal to
81
OPTIMIZATION WITH CONSTRAINTS Anneli Kaasa
the price. This result confirms the well known condition that for the maximum profit the price has
c
to be equal to the marginal cost: c = = MC , and the equation of the supply curve thus has to be
q
p = MC .
MPK MPL
We can also see that the condition for the optimal input bundle = (that gives the
r w
optimal proportion of the inputs) ensures the maximum profit as well. The optimal values of the
inputs K and L in an optimal input bundle are found from the equation of the isocost line or the
isoquant curve. When maximizing the profit, the optimal values of the inputs K and L are
determined by the price p.
MPK MPL
The condition = has a geometrical interpretation as well. As mentioned, this
r w
condition gives a proportion for the optimal values of the inputs (in the household theory the
analogical condition gives the proportion for the optimal quantities of goods in a consumption
bundle). In the case of the Cobb-Douglas type functions this proportion is always expressed by a
linear relationship, for example K = nL , where n is a constant that describes the relationship).
This proportion can be depicted as a curve on a traditional figure that describes the optimization
problem (see Figure A6.1). For our example, this curve is a straight line K = nL . Every point of
that curve is a possible optimal solution that is optimal at a particular level of costs or production
(depending on whether we are minimizing the costs or maximizing the product). This curve is often
called as an expansion path.
K
isoquants
K=nL
or the expansion path
isocosts
82
OPTIMIZATION WITH CONSTRAINTS Anneli Kaasa
K* w
d
L* = r = 1 . Thus, in the case of this type of production function,
and the elasticity =
w
w
d
r r
w
as the ratio of prices increases by 1% (for example the price of labour increases by 1%), then the
r
K*
ratio also increases by 1% (the amount of capital is increased and/or the amount of labour
L*
decreased).
g m ( x , x ,2, x ) = b .
1 2 n m
functions with respect to the arguments. These two submatrices reflect each other over the principal
diagonal. On the upper right there is a m m matrix of 0-s.
The bordered Hessian in a general case is:
0 0 0 g11 g 12 g 1n
0 0 0 g12 g 22 g n2
g1m g 2m g nm
H = 1
0 0 0
,
g g 2
g1m L11 L12 L1n
11 1
g2 g 2
2 g 2m L21 L22 L2 n
g 1n g n2 g nm Ln1 Ln 2 Lnn
where g ij stands for a derivative of i-th constraint function with respect to j-th argument.
For the second-order condition, the principal minors are found. Let H i be a principal minor that
includes i rows and columns from the lower right submatrix of the derivatives of the Lagrangian
function. Thus, H i is a determinant of a (m + i ) (m + i ) matrix. For example H 1 includes m + 1
rows and columns (starting from upper left corner), H 2 m + 2 rows and columns and so on.
When investigating the critical points, those principal minors are considered that are of m + 1 or
higher-order:
H i , where i = (m + 1), , (m + n ) .
if the considered principal minors are with the same sign ( 1) , that is (1) m H i > 0 , it is a local
m
minimum point.
Hence it depends on the number of constraints, whether for the minimum all principal minors have
to be positive or negative. If there is an odd number of constraints, then ( 1) < 0 and all principal
m
minors have to be negative, in the case of even number of constraints ( 1) > 0 and all principal
m
minors have to be positive. It depends also on the number of constraints, from which sign starts the
alternating of the sign for the maximum. If there is an odd number of constraints, then ( 1) > 0
m +1
and alternating starts from a positive principal minor, in the case of an even number of constraints
( 1)m+1 < 0 and alternating starts from a negative principal minor,.
In a case of two arguments and one constraint ( m = 1 ) the first principal minor considered is of
order m + 1 , that is H 2 . For a minimum all principal minors (in that case only H 2 that is equal to
H ) have to be negative, since ( 1) = ( 1) < 0 . For a maximum the sign of the principal minors
m 1
has to alternate starting from ( 1) = ( 1) > 0 , in this case H 2 has to be positive. This is in
m +1 2
84
7. COMPARATIVE STATICS
p*' p*
p* p*'
D
D D' D'
q q
c+em c+em' c+em
Figure A7.1. Changes of the demand curve caused by an increase in income m (left) and an increase
in the parameter d (right)
Hence, when income increases, the equilibrium price increases. But what happens to the
equilibrium price, if the slope of the demand curve decreases? If the slope of demand curve is
85
COMPARATIVE STATICS Anneli Kaasa
smaller, the parameter d (in the demand function) is larger (because usually demand curves are
drawn so that price is on vertical and quantity on horizontal axis, that is, based on an inverse
demand function). Hence, we are interested in the change caused by the increase of the parameter d.
Using the rule of the derivative of a quotient:
(+)
p * (a + c + em)
= < 0.
d (b + d )2
(+)
As income is assumed to be positive, it can be concluded that if the demand curve becomes flatter
the equilibrium price will be smaller.
For a quantitative comparative statics we need to know the values of the parameters. Let the
demand function be q D = 24 3 p + 0,01m and the supply function q S = 4 + 4 p . The equilibrium
price is then:
4 + 24 + 0,01m m
p* = = 4+ .
4+3 700
p* 1 1
Since = , as income increases by one unit, then the equilibrium price increases by
m 700 700
1
unit. Or as income increases by 1000 euros, the equilibrium price increases by 1000 1,43
700
euros.
Often only the general form of functions and the signs of the partial derivatives are known. In that
case, a derivative of implicit function becomes useful. For example, if the demand and supply
functions are defined as follows:
q D = q D ( p, m ) ,
q S = q S ( p)
and the equilibrium condition is:
q S ( p ) = q D ( p, m )
or
q S ( p ) q D ( p, m ) = 0 ,
we can denote f ( p, m ) = q S ( p ) q D ( p, m ) . If we need to know the impact of income on the
equilibrium price, we can find it with the help of a derivative of the implicit function. For that we
need the partial derivatives of f ( p, m ) with respect to p and m ( f p and f m ). The derivative of the
implicit function:
p* f qD qD
= m = S mD = S m D .
m fp qp qp qp qp
Knowing that the supplied quantity depends positively on the price ( q Sp > 0 ) and the demanded
quantity depends negatively on the price ( q pD < 0 ) and positively on the income ( q mD > 0 ), we can
determine the sign:
(+)
p* qD
= S m D > 0.
m qp qp
(+) ()
86
COMPARATIVE STATICS Anneli Kaasa
C = C 0 + c (Y T ) , 0 c 1.
Taxes are constituted of the autonomous taxes and a portion (determined by a parameter t) of the
total income:
T = T0 + t (Y ) , 0 t 1.
Three equilibrium conditions are then:
Y C I 0 G0 = 0,
C C 0 c (Y T ) = 0.
T T t Y = 0.
0
Solving this model, for example, for the variable Y (the other dependent variables are C and T) we
get: Y = C 0 + c (Y (T0 + t (Y ))) + I 0 + G0 or:
C 0 + I 0 + G0 cT0
Y* = .
1 c + ct
Now we can analyse qualitatively (a quantitative analysis needs known values of the parameters)
the impact of different independent variables on the equilibrium value of Y:
Y * Y * Y * 1 Y * c
= = = > 0, = < 0.
C 0 0 G0 1
c + ct
T0 1 c + ct
(+) (+) (+) (+)
We can also analyse the impact of parameters, for example the impact of the marginal propensity to
consume (c):
Y * T0 (1 c + ct ) (C 0 + I 0 + G0 cT0 )( 1 + t ) T0 + (1 t )Y *
= = =
c (1 c + ct )2 (1 c + ct )
(+)
*)+ )
Y * T0 tY * Y * T *
= = >0
(1 c + ct ) (1 c + ct )
))(
(+)
87
COMPARATIVE STATICS Anneli Kaasa
Hence, an increase in the autonomous consumption, the government expenditures and the marginal
propensity to consume increases and an increase in the autonomous taxes and the tax rate decreases
the equilibrium national income.
Comparative statics analyses the change of equilibrium values caused by the change of other
variables or parameters using derivatives. In the case of quantitative analysis, the amount of change
of the equilibrium value is calculated, which means the amount of the change causing it also has to
be known. More often only qualitative analysis is performed and that means only the sign of change
is determined. The interpretation of the results is the same as the interpretation of a derivative.
The matrix that consists of derivatives of functions F 1 to F n with respect to the dependent
variables, is called Jacobian and denoted with J. Here:
F 1 F 1
y y 2
J = 12 .
F F 2
y
1 y 2
Using Cramers rule we can find the impact of x1 on the dependent variables y i : we divide the
determinant of matrix J i (where the i-th column is replaced with the vector from the right side on
equation) by the determinant of Jacobian J:
88
COMPARATIVE STATICS Anneli Kaasa
y i Ji
= .
x1 J
F 1 F 1
y1 x1
F 2 F 2
y 2 J2 y1 x1
For example, = = .
x1 J J
If the impact of x 2 is of interest, the derivatives of the functions derived from the equilibrium
conditions have to be taken with respect to x 2 . The matrix equation includes then x 2 instead of x1 ,
but Jacobian remains the same that makes analysing the impacts of different variables easier.
In general, the number of dependent variables should be equal to the number of equilibrium
conditions. Next, briefly the main steps are shown that are used in general case. Let there be a
model with n equilibrium conditions and dependent variables and m independent variables
The equilibrium conditions can be expressed as follows:
F 1 ( y1 ,2, yn ; x1 ,2, xm ) = 0,
2
F ( y1 ,2, yn ; x1 ,2, xm ) = 0,
F n ( y ,2, y ; x ,2, x ) = 0,
1 n 1 m
and the matrix equation for examining the impact of i-th independent variable:
F 1 F 1 F 1 y1 F 1
y
12 y 2 y n xi xi
F F 2 F 2 y 2 F 2
y y 2 y n xi = xi .
1
n
n y
F F n F n F n
y1 y 2 y n xi xi
Now, with the help of Cramers rule it is possible to estimate the direction of the impact of i-th
independent variable (qualitative analysis) and with the help of known values of parameters also the
extent of the impact (quantitative analysis). In the case of more complex analyses the fact that the
same Jacobian can be used repeatedly, becomes very useful.
In general, it is always possible to transform n equilibrium conditions to a form where a function of
all variables is on the one side and 0 on the other side of equation:
F 1 ( y1 , , y n ; x1 , , x m ) = 0,
F n ( y , , y ; x , , x ) = 0,
1 n 1 m
For analysing the impact of the independent variables on the equilibrium values of the dependent
variables, partial total derivatives with respect to a particular independent variable can be used.
According to the formula for a total differential:
89
COMPARATIVE STATICS Anneli Kaasa
F 1 F 1 F 1 F 1
dy + + dy + dx + + dx n = 0 ,
1 n 1
y1 y n x1 x n
F n F n F n F n
dy1 + + dy n + dx1 + + dx m = 0.
y1 y n x1 x m
For analysing the impact of a particular independent variable (for example x1 ) on the equilibrium
values of the dependent variables, we divide the equations obtained by the differential of this
particular independent variable ( dx1 in our example). This brings us into a situation where on the
left side there are the partial total derivatives of the functions F 1 , , F n with respect of this
particular independent variable ( x1 here):
1 0
F 1
F y1
1
F y n F x1
1 1
F x n
1
x = y x + + y x + x x + + x x = 0 ,
1 1 1 n 1 1 1 n 1
F n F n y F n y n F n x1 F n x n
= 1
++ + ++ = 0.
x1 y1 x1 y n x1 x1 x1 x n x1
1 0
Since the independent variables are not related to each other, the corresponding derivatives are
x
equal to 0 and also we know that 1 = 1 :
x1
F 1 F 1 y1 F 1 y n F 1
= ++ = ,
x1 y1 x1 y n x1 x1
F n F n y F n y n F n
= 1
++ = .
x1 y1 x1 y n x1 x1
This system can be written as a matrix equation that enables to find the impact of x1 on the
equilibrium values:
F 1 F 1 y1 F 1 F 1 x1 F 1
y
1 y n x1 x1 x n x1 x1
n = =
F F n y n F n F n x n F n
y y n x1 x1 x n x1 x1
1
Analogically we can derive the matrix equations for analysing the impact of the other independent
variables x 2 , 2 , x m . In general, for an independent variable xi the matrix equation:
F 1 F 1 y1 F 1
y1 y n xi xi
n = .
F F n y n F n
y
y n xi xi
1
90
COMPARATIVE STATICS Anneli Kaasa
A matrix that consists of the derivatives of different functions with respect to the dependent
variables, is called Jacobian:
F 1 F 1
y1 y n
J =
F n F n
y1 y n
The elements of this matrix and the value of its determinant is the same regardless of the
independent variable xi ( i = 1, , m ) whose impact is under consideration at the moment. Therefore
once the value of this determinant has been found, it can be used repeatedly for analysing the
impact of any independent variable or parameter on the equilibrium values of the dependent
variables.
Using the Cramers rule, we can find the impact of a particular variable xi on the dependent
variable y j : we divide the determinant of a matrix J j , where the j-th column is replaced by the
vector of constants on the right side of matrix equation, by the determinant of the Jacobian J:
y j Jj
= .
xi J
At that, the vector of constants used is found by taking derivatives of the functions F 1 , , F n with
respect to xi and multiplying them by 1 . Hence, the dependent variable under discussion
determines, which column is replaced, and the independent variable under discussion determines
with what it is replaced.
When an impact of a parameter is of interest, the procedure is analogical and a particular parameter
is in the place of xi (the vector of constants is the found by taking derivatives of the functions
F 1 , , F n with respect to this parameter and multiplying them by 1 ).
Let us look at a national income model from macroeconomics, where the national income Y is
equal to the total expenditures that is divided between consumption C , investments I and
government expenditures G and the last two are independent variables denoted with a subscript 0:
Y = C + I 0 + G0 .
Consumption, in turn, depends on income. From the income after taxes (income minus taxes T)
only a certain portion (expressed by a parameter c) is used for consumption and in addition, there is
an autonomous consumption C 0 :
C = C 0 + c (Y T ) , 0 c 1.
Taxes are constituted of the autonomous taxes and a portion (determined by a parameter t) of the
total income:
T = T0 + t (Y ) , 0 t 1.
Three equilibrium conditions are then:
Y C I 0 G0 = 0,
C C 0 c (Y T ) = 0,
T T t Y = 0.
0
91
COMPARATIVE STATICS Anneli Kaasa
When constructing a matrix equation for examining the impact of a particular variable of parameter,
it is important to use the same order of dependent variables (here Y , C and T ) in constructing
Jacobian and in constructing the vector of the impacts of interest on the left side of the matrix
equation. The choice of order itself is not important, it just has to be used consistently.
For example, for the impact of investments I 0 on the equilibrium values of the dependent variables
the following matrix equation is used:
1 1 0 YI FI 1
1
c 1 c C = F 2 = 0
I I
t 0 1 TI FI3 0
and the impact of investments, for example, on the equilibrium income is:
1 1 0
0 1 c
(+)
Y 0 0 1 1 1
= = = > 0.
I 0 1 1 0 1 + tc c (1 c )+ tc
(+) (+)
c 1 c
t 0 1
Hence, investments have a positive impact on the equilibrium income.
The same method can be used for examining the impact of parameters, for example the impact of
the tax rate t on the equilibrium values of the dependent variables can be analysed with the help of
the following matrix equation:
1 1 0 Yt Ft 0
1
c 1 c C = F 2 = 0
t t
t 0 1 Tt Ft 3 Y
and the impact of the tax rate on the equilibrium income is:
0 1 0
0 1 c
(+)
Y Y cY
0 1 cY
= = = < 0.
t 1 1 0 1 + tc c (1 c )+ (tc+ )
(+)
c 1 c
t 0 1
Since 0 c 1 , then 1 c 0 and the impact of the tax rate on the equilibrium income is negative:
as the tax rate increases, the equilibrium income decreases.
These results are in accordance with those obtained before with a qualitative analysis performed
without using matrix algebra.
92
COMPARATIVE STATICS Anneli Kaasa
Next, let us look at the comparative statics analysis in the case of the IS-LM model covering both
the market for goods and the money market. The goods market is assumed to be described by the
following equations. The investments depend negatively on the interest rate:
I = I (i ) , I i < 0 ,
the savings depend positively on the interest rate and on the national income: S = S (i, Y ) , S i > 0 ,
SY > 0 ;
the government expenditures are exogenous:
G = G0 ,
and the governments tax revenue depends positively on the national income:
T = T (Y ) , TY > 0 ;
the amount of imports depends positively on the national income:
IM = IM (Y ) , IM Y > 0 ,
and of exports is exogenous:
X = X0 .
The money market is assumed to be described by the following equations. The money supply is
exogenous:
M = M0,
and the demand for money depends positively on the national income and negatively on the interest
rate:
L = L(i, Y ) , Li < 0 , LY > 0 .
The first equilibrium condition states that the leakages have to be equal to the injections:
S (i, Y ) + T (Y ) + IM (Y ) = I (i ) + G0 + X 0
or the difference between the savings and investments have to be equal to the sum of the budget
deficit and the net exports:
S (i, Y ) I (i ) = G0 T (Y ) + X 0 IM (Y ) .
The second equilibrium condition states that the money supply has to be equal to the demand for
money:
M 0 = L(i, Y ) .
Hence:
F 1 (i, Y ; G0 , X 0 , M 0 ) = S (i, Y ) + T (Y ) + IM (Y ) I (i ) G0 X 0 = 0,
2
F (i, Y ; G0 , X 0 , M 0 ) = M 0 L(i, Y ) = 0.
There are two dependent variables in this system: Y and i and the independent variables are G0 ,
X 0 and M 0 (S, I, T, IM and L denote functions in this model).
Let us find the determinant of the Jacobian:
93
COMPARATIVE STATICS Anneli Kaasa
F 1 F 1
J = i 2 Y = S i I i S Y + TY + IM Y
= LY S i I i + Li S Y + TY + IM Y < 0
F F 2 Li LY (+) (+) () () (+) (+) (+)
i Y
In order to analyse the impact of the government expenditures, we can use a matrix equation:
i * F
1
S i I i S Y + TY + IM Y G0 G0 1
L LY Y * =
F 2 = 0 .
i
G0 G0
From here, the impact of the government expenditures on the equilibrium interest rate and
equilibrium national income:
1 S Y + TY + IM Y
(+)
i * J1 0 LY LY
= = = > 0,
G0 J J J
()
Si I i 1
()
Y * J 1 Li 0 L
= = = i > 0.
G0 J J J
()
This result is in accordance with the theoretical assumptions: an increase in the government
expenditures shifts the IS-curve to the right and thus increases the equilibrium interest rate and the
equilibrium income (see Figure A7.1).
i
LM
IS'
IS
S dS i dS Y
= + > 0.
G0 di G0 dY G0
(+) (+) (+) (+)
Comparative statics analysis can also be performed when optimizing. For example the necessary
conditions for optimizing the function z = f ( x1 , x 2 , 2 , x n ) form a system:
f1 = 0
f =0
n
and for that system the element of the Jacobian jij is a the derivative of the function f j based on
the j-th condition with respect to the i-th variable xi . It appears that this Jacobian coincides with
the Hessian that is used for investigating whether the sufficient conditions are satisfied:
f11 f1n
J = = H .
f n1 f nn
Hence, if the sign of the determinant of a Hessian is known, the sign of the determinant of the
corresponding Jacobian is also known, since J = H .
For example, when maximizing the profit p = p q(K , L ) rK wL the necessary conditions are:
(K and L are the dependent variables here):
p K = F 1 ( K , L; r , w, p ) = p MPK r = 0,
p L = F 2 ( K , L; r , w, p ) = p MPL w = 0.
The Jacobian that is the Hessian as well:
F 1 F 1
p q KL p KK p KL
J = K2 L = p q KK = =H.
F F 2 p q LK p q LL p LK p LL
K L
Since for confirming a maximum profit the determinant of the Hessian should be positive, then:
J = H > 0.
We can examine, for example, the impact of the changes in price on the optimal quantities of the
inputs K and L by constructing a matrix equation:
K * F
1
p q KK p q KL p p MPK
= =
pq p q LL L * F 2 MPL
.
p p
LK
From here, the impact of the price on the optimal quantity of capital, for example, is:
MPK p q KL (+)
(+) () (+) (+)
p MPK q LL + MPL q KL
K * J 1 MPL p q LL
= = = > 0.
p J J J
(+)
95
COMPARATIVE STATICS Anneli Kaasa
The sign of this derivative can be found in a following way. A necessary condition for maximizing
the profit is that p MPK r = 0 and p MPL w = 0 . Since we assume all prices (p, w, r) to be
positive, then MPK and MPL have to be positive (it is logical: production increases as the input
increases). According to the second-order condition the first-order principal minor p KK = p q KK
should be negative and thus q KK should be negative. Since the order of variables is not important
when optimizing, then it can also be said the first-order principal minor should be p LL = p q LL
instead and thus, q LL should be negative as well. For determining the sign of q KL it is logical to
assume that as the amount of capital increases, the marginal product of labour increases, hence q KL
can be assumed to be positive. Taking all together it can be concluded that an increase in the price
of a product increases the optimal quantity of capital (the derivative of the equilibrium value of
capital with respect to price is positive).
In the case of optimization with constraints, the Jacobian used for comparative statics is similar
to the bordered Hessian used for optimization and the determinants of those two matrices are
equal. This can be easily shown by using a certain order for the optimality conditions, but in the
other cases also the transformations (one can change the signs of an even number of rows or
columns of a matrix or switch the rows or columns an even number times without changing the
value of the determinant of this matrix) can be made in order to show that these two matrices
coincide.
For example, when optimizing the objective function z = f ( x, y ) subject to a constraint g ( x, y ) = b
the optimality conditions can be found by taking derivatives of the Lagrangian function
L = f ( x, y ) + [b g ( x, y )] with respect to , x and y:
L = b g ( x, y ) = 0,
L x = f x g x = 0,
L = f g = 0.
y y y
For finding the Jacobian we take the derivatives of these functions L , Lx and L y again with
respect to the variables , x and y:
L L L
x y L
Lx Ly 0 gx gy
L Lx Lx
J = x = L Lxx Lxy = g x f xx g xx f xy g xy .
y
x
x
L L y
L y L y L yx L yy g y f yx g yx f yy g yy
y
x y
The commonly known form of the bordered Hessian for this optimization problem is:
0 gx gy 0 gx gy
H = g x L xx L xy = g x f xx g xx f xy g xy ,
g y
L yx L yy g y f yx g yx f yy g yy
by multiplying the first row and the first column by 1 (does not change the value of the
determinant) we get the Jacobian:
96
COMPARATIVE STATICS Anneli Kaasa
0 gx gy 0 gx gy
H = gx f xx g xx f xy g xy = g x f xx g xx f xy g xy = J .
gy f yx g yx f yy g yy gy f yx g yx f yy g yy
However, it must be noted that if the bordered Hessian is obtained by differentiating the first-order
conditions again, then it is exactly equal to the Jacobian.
In the case of the problem of production-maximization at fixed costs (Lagrangian function
L = q(K , L ) + (c rK wL ) ) the optimality conditions are:
= F 1 ( , K , , r , w, c) = c rK w = 0,
K = F ( , K , , r , w, c) = MPK r = 0,
2
= F ( , K , , r , w, c) = MP w = 0.
3
0 r w * c 1
r q
q KL K * = 0 .
KK
c
w q LK q LL L * 0
c
For example, the impact of the expenditures that can be made on the marginal product of money:
1 r w
0 q KK q KL
() () (+)
* J1 0 q LK q LL q KK q LL + q 2
= = = KL
.
c J J J
(+)
*
In the case where q KK q LL + q KL
2
> 0, > 0 and hence, as the expenditures increase, every
c
additional unit of money brings about more and more product. In the opposite case the conclusion is
the opposite.
97
COMPARATIVE STATICS Anneli Kaasa
However, we can use the connection of this problem of production-maximization at fixed costs with
the problem of profit-maximization. There, in order to maximize the profit, the Hessian should be
positive:
p q KK p q KL
H =
p q LK p q LL
(
= p q KK q LL q KL
2
)
>0
x n = bn1 y1 + bn 2 y 2 + 2 + bnn y n .
The coefficient bij can then be viewed as a derivative of the total output with respect to the final
demand:
xi x
bij = i .
y j y j
Hence, the coefficient bij shows the increase (approximately) of the total output in the i-th industry
xi , as the final demand of the j-th industry y j increases by one unit.
98
8. DIFFERENCE EQUATIONS
+ u (t ) y = w(t ) ,
dy
dt
where u and w are, like y, functions of time. If u and w are constants (for example a and b,
respectively), the equation is:
dy
+ ay = b .
dt
When solving a differential equation, the result is a function y = f (t ) that describes how variable y
depends on time. The solution of a differential equation y (t ) , thus is an expression where time is
the only variable (no derivatives or differentials).
In the case of discrete time the value of y changes as t changes to a new integer value (see Figure
8.1). Because of this, the graph of the function is discontinuous. In that case a function is not
differentiable and use of derivatives and differentials is impossible. Instead, differences are
analysed and the time path of y is described with the help of a difference equation. In the case of
discrete time, often the values of t are interpreted not as time moments, but as periods the value
of the function remains the same during the whole period between two moments. The change of y in
99
DIFFERENCE EQUATIONS Anneli Kaasa
a time unit is now described as the difference between the values in two time periods:
y = y t y t 1 .
y y
y=f(t) y=f(t)
0 1 2 3 4 5 t t
Figure 8.1. Graph of a function in the case of discrete and continuous argument
dy dy
If we replace in the simple differential equation + cy = b the derivative (that describes the
dt dt
change over time) with the difference y = y t y t 1 and the value of y with the initial value y t 1 ,
we get an equation:
y t + cyt 1 = b or y t y t 1 + cyt 1 = b .
After rearranging:
y t = (1 c ) y t 1 + b .
After denoting 1 c = a , we get a simple linear first-order difference equation:
y t = ay t 1 + b .
A difference equation includes the variable y in time period t ( y t ) and in previous time periods
( y t 1 , y t 2 etc.). The greatest number of periods lagged in the equation determines the order of the
equation.
When solving a difference equation, the result is a function y = f (t ) as well. The solution of a
difference equation y t , is an expression where time is the only variable (no values of y in different
time periods).
100
DIFFERENCE EQUATIONS Anneli Kaasa
dependent part of the function describes the difference of y in a particular period from the
equilibrium value.
An equilibrium can be understood as a stable state where there is no tendency to change. Hence, in
an equilibrium there is no change over time:
y* = y t = y t 1 .
After replacing in the difference equation all values of y in different time periods with y * , we get:
y* = ay * + b ,
and hence, the formula for finding the equilibrium value, is:
b
y* = , where a 1 .
1 a
The case where a = 1 , will be discussed later.
It can be proved (see Appendix 3) that the complementary function in the case under consideration
here is in a following form:
( y 0 y *) a t ,
where y 0 is the initial value of y (the solution of the difference equation y t describes the time path
of y after taking on the initial value y 0 ). The general solution of difference equation y t = ay t 1 + b
can be found as a sum of the equilibrium value and the complementary function:
y t = y * +( y 0 y *) a t .
0
It has to be pointed out that if constant b is 0, then the equilibrium value is y* =
= 0 and the
1 a
general solution is equal to the complementary function y t = y 0 a t . The derivation of the
complementary function that can be seen in Appendix 3 actually lies in solving the equation
y t = ay t 1 .
b
Since it is known that y* = , the formula:
1 a
b b t
yt = + y0 a
1 a 1 a
can be used as well.
For example, given a difference equation y t = 3 y t 1 8 , the equilibrium value can be found as:
8
y* = = 4,
1 3
and the general solution is:
y t = 4 + ( y 0 4 )3t .
If the initial value y 0 is known, a definite solution can be found. For example, if y 0 = 2 , then:
y t = 4 + (2 4)3t = 4 + ( 2 )3t .
101
DIFFERENCE EQUATIONS Anneli Kaasa
b b
In a special case, where a = 1 , the formula for the equilibrium value y* = = cannot be used.
1 a 0
The equation is then y t = y t 1 + b and it can be seen that in every period constant b is added to the
value of previous period:
y1 = y 0 + b ,
y 2 = y1 + b = y 0 + 2b ,
y t = y 0 + tb .
Hence, if a = 1 , the general solution is y t = y 0 + tb .
102
DIFFERENCE EQUATIONS Anneli Kaasa
The behaviour of a t over time is determined by the parameter a (that is actually the coefficient of
y t 1 in the difference equation). Namely:
if a > 1 , then at , as t ;
if a = 1 , then at = 1, as t ;
if 0 < a < 1 , then at 0 , as t ;
if a = 0 , then at = 0 , as t .
If a = 0 , the difference equation takes a following form y t = 0 y t 1 + b = b the value of y t is the
same (constant b) in every period and it is not possible that system leaves the equilibrium. Hence,
this case is left out of consideration.
If a = 1 , the general solution is y t = y 0 + tb and the value of y t converges to the positive or
negative (depending on the sign of b) infinity. As in this case converging of y t to the equilibrium
value is not possible as well, this case is also left out.
The value of the variable converges to its equilibrium value, if the complementary function
approaches 0 for that a t has to approach 0. This is possible, if the absolute value of the
parameter a remains between 0 and 1: 0 < a < 1 . In that case the process that follows to the event
that brought y out of the previous equilibrium is converging to the equilibrium value and the
equilibrium is stable. If a is positive, then y takes values only above (if initial value is larger than
equilibrium value, see Figure 8.2) or below (in the opposite case) the equilibrium value. If a is
negative, the value of y oscillates above and below the equilibrium value when converging to it.
If the absolute value of a is larger than 1 ( a > 1 ), then a t approaches the infinity and thus, the
value of the complementary function increases infinitely as well. Hence, the variable y diverges
from its equilibrium value and the equilibrium is unstable. Here, also the process is oscillatiory, if a
is negative.
Concluding:
if a > 1 , then y t , as t , and it is a diverging process and the equilibrium is unstable;
if 0 < a < 1 , then y t y * , as t , and it is a converging process and the equilibrium is stable;
If a = 1 , the sign of the complementary function alternates over time: ( y 0 y *)( 1) = const and
t
the value of y oscillates between two values. If t is an even number, then the value of y is equal to
the initial value: y t = y * +( y 0 y *)1 = y 0 ; if t is an odd number, then the value of y is:
y t = y * + ( y 0 y *)( 1) = 2 y * y 0 . Hence, if an initial value is given to y that is different from the
equilibrium value ( y 0 y * ), y never reaches the equilibrium value and hence, the equilibrium is
unstable.
All this can be illustrated by Figure 8.2 that presents the graphs of the function y t = f (t ) for
different values of a.
103
DIFFERENCE EQUATIONS Anneli Kaasa
The sign of the complementary function also depends on the sign of ( y 0 y *) , that is whether the
initial value is larger or smaller than the equilibrium value. The graphs on Figure 8.2 correspond to
the case where y 0 > y * , that is where ( y 0 y *) is positive. If the initial value is smaller than the
equilibrium value ( y 0 < y * ), then the graphs of y = f (t ) for different values of a are reflections of
those on Figure 8.2 over the horizontal line y t = y * . If y 0 = y * , the equilibrium holds and the
graph of y = f (t ) is a horizontal line: y t = y * .
When sketching graphs like these, following method can be used. If a sum of two functions has to
be graphed, first one can graph both functions separately and then add one to the other in the
direction of vertical axis (the values of the function and not of the argument are summed). One can
imagine that when sketching the second function, the graph of the first function is viewed as a
replacement to the horizontal axis. Here, the function y t = f (t ) is a sum of the constant y * and
complementary function ( y 0 y *) a t . Hence, we can first sketch the horizontal line corresponding
to the constant ( y t = y * ) and then sketch the complementary function using the line y t = y * as the
replacement of the horizontal axis. The latter is an exponential function multiplied with a constant.
As the time is discrete, the graph is discontinuous (value of y changes every period).
yt yt
a>1 a<-1
y* y*
yt t yt t
0<a<1 -1<a<0
y* y*
t yt t
a=-1
y*
Figure 8.2. Graphs describing different processes dependent on the value of parameter a
Last, it has to be pointed out that in the case of a converging process, the value of y becomes closer
and closer to the equilibrium value, but actually never exactly reaches the equilibrium value. The
difference between the actual value y t and equilibrium value y * decreases, though, and it depends
on the researcher, when he or she decides to consider this difference equal to 0. For example, if the
104
DIFFERENCE EQUATIONS Anneli Kaasa
admissible error is = 0,01 , then the value of y is considered as the equilibrium value, if the
absolute value of the difference is smaller than 0,01: y t y * < 0,01 . The admissible difference can
also be expressed in the following way: y t = y * 0,01 .
105
DIFFERENCE EQUATIONS Anneli Kaasa
a+c bc ad
p* = and q* = .
b+d b+d
Next, we can analyse whether the market equilibrium is stable: if some event causes the system to
leave the equilibrium, does the system itself reaches the equilibrium again after some time? Figure
8.3 shows demand and supply curves of the market of a hypothetical agricultural product. The
intersection point E shows the equilibrium ( p * and q * ).
p
S
A B
p0
E
p*
C
p1
F
D
q0 q2 q* q1 q
Figure 8.3. Adjustments in the cobweb mode in the case of stable equilibrium
Let us assume an event that rules out holding the present equilibrium, for example there is lots of
rain in a year t = 0 and because of that, it is not possible to supply the equilibrium quantity and thus
the supplied quantity q 0 is smaller. Now, the equilibrium price does not hold and the new price in
that period ( p 0 ) is probably higher than the equilibrium price. If the quantity in the market is
known, then the price with what this quantity is sold can be found from the demand curve: moving
up from the quantity q 0 to the demand curve (point A) and then to the left leads us to the price axis
and from there we read the new price p 0 . Next spring, producer decides the quantity for the next
year q1 on the basis of the price p 0 according to the supply function. Moving to the right to the
supply curve (point B) and then down leads us to the quantity axis and from there we read the
quantity q1 . As this quantity is relatively large, consumers are less willing to pay. Moving up to the
demand curve (point C) and then to the right gives us the price p1 . Then the producer decides q 2
on the basis of p1 (point F on the supply curve) and so on. The name of the model comes from the
fact that the graphical representation of these adjustments is similar to a cobweb.
It has to be mentioned that if, differently from the example here, price is depicted on the horizontal
and quantity on the vertical axis, then the adjustment path consisting of arrows is going not
clockwise, but counter-clockwise.
Whether the adjustment path leads to the equilibrium point or more away from it, depends on the
slopes of the demand and supply curve. If the supply curve is steeper than the demand curve, as on
Figure 8.3, market reaches the equilibrium after some time, but if the demand curve is steeper than
the supply curve, then the process is diverging from the equilibrium and the path leads away from
the equilibrium point (Figure 8.4, left). If the absolute values of the slopes are equal (Figure 8.4
right), then price and quantity will oscillate between two values (one larger and the other smaller
than the equilibrium value).
106
DIFFERENCE EQUATIONS Anneli Kaasa
p p
S
S
E
E
D
D
q q
a+c b
The time path of price is described by the equation: pt = pt 1 . The solution of that is:
d d
t
b
pt = p * +( p 0 p *)
d
or
t
c+a c + a b
pt = + p0 .
d +b d + b d
The first term is the equilibrium price and the second term describes the behaviour of price after
taking on the initial value p 0 . In order the price to converge to the equilibrium value over time, the
absolute value of the coefficient of pt 1 in difference equation has to be between 0 and 1:
b
0< < 1 or d > b , as found before.
d
b b
In that case, the equilibrium is stable. If, however > 1 , the process is diverging and if = 1 ,
d d
the process is neither converging nor diverging. In both cases the equilibrium is unstable. As the
b
coefficient is always negative ( b, d > 0 ), then the process of adjustments is always oscillatory
d
in the cobweb model.
For the previously used example, the demand and supply functions are qtD = 24 3 pt and
qtS = 4 + 4 pt 1 . From the condition qtD = qtS the difference equation for price is:
24 3 pt = 4 + 4 pt 1 or
28 4
pt = pt 1 .
3 3
107
DIFFERENCE EQUATIONS Anneli Kaasa
p
6,3
4
3
0 1 2 3 t
108
DIFFERENCE EQUATIONS Anneli Kaasa
y t is placed on the vertical and y t 1 on the horizontal axis, hence the difference equation is
transformed to the form y t = f ( y t 1 ) . The graph of this function forms a phase line (see Figure 8.6).
At that mostly only that quadrant is considered where both variables have positive values.
In the case of the y t 1 y t -plane the value of the variable y in different periods is equal ( y t = y t 1 ) on
the 45 line. Hence, the value of y at which the phase line intersects the 45 line is the equilibrium
value. The points, where the phase line intersects or touches the 45 line, are the equilibrium
points. Whether the equilibrium is stable or not can be found out by drawing the time path of y on
phase diagram. That can be explained with the help of Figure 8.6, where one hypothetical phase line
is shown. There is an equilibrium point in the intersection point (E) of the phase line and the 45
line if the variable takes on the value y * , then this value will hold until a change in the
conditions.
yt
phase line
y* E
y2
A yt = f ( yt -1 )
y1
45
y0 y1 y2 y* yt-1
109
DIFFERENCE EQUATIONS Anneli Kaasa
conditions must hold at least in the neighbourhood that covers the change that is given to the
variable.
yt yt
yt yt-1 yt yt-1
yt-1 yt-1
110
DIFFERENCE EQUATIONS Anneli Kaasa
If finding the equilibrium values is not possible, then the intersection point of the phase line and the
45 line is not known. Despite that analysing the shape of function y t = f ( y t 1 ) enables to make
some conclusions about the stability.
For example the phase diagram that corresponds to the difference equation for price from the
a+c b
previous chapter: pt = pt 1 , in the case of a stable equilibrium (since for that it must hold:
d d
b
0 < < 1 , then the phase line is flatter than the 45 line) is depicted on Figure A8.1:
d
pt
a+c
d
p*
a+c b
pt = - pt -1
d d
45
p0 p* pt-1
Figure A8.1. The phase diagram of the market equilibrium model in the case of stable equilibrium
Let us look at the example of a difference equation for price from previous chapter:
28 4
pt = pt 1 . The equilibrium price has already been found: p* = 4 , at that price the phase line
3 3
intersects the 45 line. The phase line of this linear difference equation is a straight line with the
4 4
slope . A negative slope refers to a negatively sloping line and as > 1 , then the phase line
3 3
is steeper than the 45 line. Hence, we can draw a phase line as show on Figure 8.8.
pt
28
3
28 4
4 pt = - pt -1
3 3
p0 4 pt-1
111
DIFFERENCE EQUATIONS Anneli Kaasa
y* = y *0,5 ,
y * y *0 , 5 = 0 ,
y* y* = y* ( )
y * 1 = 0 .
This gives us two equilibrium values: y1 * = 0 and y 2 * = 1 .
In order to examine, for example, the stability of y 2 * = 1 , we can find the derivative of the function
y t = y t0,15 :
dy t
= 0,5 y t01,5 ,
dy t 1
and find its value in the equilibrium point:
dy t
(1) = 0,5 10,5 = 0,5 .
dy t 1
As the derivative is positive and its absolute value is smaller than 1, then the phase line is
positively sloping and flatter than the 45 line around the equilibrium value y 2 * = 1 . In this case
the process is analogical to that depicted on Figure 8.6. The value of y converges to its equilibrium
value from one side and the equilibrium is stable.
Phase diagrams allow us to analyse more complex difference equations, but at the same time they
can add an additional view to the solutions of quite simple equations, such as the equations that
describe the change of the value of a single amount of money and the annuity over time:
y t = (1 + r ) y t 1 and y t = (1 + r ) y t 1 + B (see Chapter 10.3), respectively. The graphs of these
functions are both positively sloping straight lines steeper than the 45 line ( 1 + r > 1 ), but they
have different vertical intercepts (see Figure A8.2).
yt
yt = B + (1 + r ) yt -1
yt = (1 + r ) yt -1
45
y0=0 y0=A yt-1
Figure A8.2. Phase diagram for a single amount of money and for the annuity
Although financial mathematics does not discuss the terms like equilibrium or stability, this phase
diagram illustrates that the value of money increases over time acceleratingly.
The dynamic analysis is mostly used in macroeconomics. The number of the difference or
differential equations included in the model depends on how many variables are included in the
model with lags. As an example of a simple model with one difference equation, the national
112
DIFFERENCE EQUATIONS Anneli Kaasa
income model can be used that is also known as the income-expenditure model. Let us assume that
the national income ( Y ) is defined to be equal to the total expenditures (E) that is divided between
consumption C and investments I (government and foreign sector are left out here). Consumption
is assumed to consist of the autonomous consumption AC and a certain portion (expressed by a
parameter c) of the income in the previous period. Taking all together:
Yt = Et ,
Et = Ct + I ,
C = AC + cY .
t t 1
Yt=Et
Yt = AC + I + cYt -1
AC + I
45
Yt-1
increase in y (moving up when y t is depicted on the vertical axis) y t should be increasing and
hence, before the line y t = 0 y t is negative (y decreases over time and the direction of the phase
path is opposite to the y-axis (down)) and after the line y t = 0 y t becomes positive and the phase
path goes to the same direction as the y-axis (up)). Analogical reasoning applies for the case when
y t
< 0 and for the changes around the xt = 0 curve. The demarcation curves divide the xt y t -
y t
plane into four segments. In order to describe the direction of movements of these two variables in
these segments, similarly to the case of continuous time, small arrows describing the direction of
movements are marked. On the basis of these arrows now the phase paths can be sketched starting
from different initial points in different segments. The phase path may converge to the equilibrium
point, but can also diverge from it.
114
9. DIFFERENTIAL EQUATIONS
115
DIFFERENTIAL EQUATIONS Anneli Kaasa
dy dy
In a special case, where a = 0 , the equation + ay = b can be rewritten as = b . To solve this
dt dt
equation we can integrate both sides with respect to time:
dy
dt dt = bdt .
After cancelling differentials dt on the left side, an integral of 1 with respect to y remains. The
constants of integration can be both brought to one side and denoted as C. Hence:
y (t ) = bt + C .
At time t = 0 the value of y is y (0 ) = b 0 + C = C constant of integration C actually can be
viewed as the initial value of y. Hence, if a = 0 , then the general solution is:
y (t ) = bt + y (0 ) .
equilibrium value, then the complementary function is positive and the value of y diverges from the
equilibrium value to the direction of positive infinity ( y t + ). In the opposite case, the
complementary function is negative and y approaches the negative infinity ( y t ).
Concluding:
if a > 0 , then y t y * , as t , and it is a converging process and the equilibrium is stable;
y(t) y(t)
a>0 a<0
y* y*
t t
117
DIFFERENTIAL EQUATIONS Anneli Kaasa
The differential equation that describes this process can be derived form the three equations:
q S = a + bp ,
q D = c dp ,
= k (q D q S ),
dp
dt
where a, b, c, d > 0 .
When replacing the demand and supply in the last equation with the expressions from the first two
equations, we get:
= k (c dp + a bp ) .
dp
dt
Regrouping gives us a more familiar form:
+ k (b + d ) p = k (a + c ) .
dp
dt
dp
The equilibrium value can be found using a formula or by replacing: = 0 and p = p * . The
dt
result is the same as found before:
k (a + c ) a + c
p* = = .
k (b + d ) b + d
The general solution then is:
p(t ) = p * +[ p(0) p *] e k (b + d )t .
Next, the question arises about the values of parameters that are needed for the price converging to
the equilibrium price. As we know already, in order the equilibrium to be stable, the parameter a in
dy
the equation + ay = b has to be positive. Hence, in our example, the stability condition is
dt
k (b + d ) > 0 in that case the absolute value of complementary function decreases
( e k (b + d ) t 0 ). Since b, d > 0 , the process that takes place is converging and the equilibrium is
stable, if the parameter k is positive.
that a positive value of k ( k > 0 ) means that in the case of a positive excess demand ( q D q S > 0 )
dp
also known as deficit, price increases over time ( > 0 ). Hence, the market stabilizes itself, if
dt
price starts to increase because of the deficit. As the price increases, the quantity demanded
decreases and the quantity supplied increases the difference between the demand and supply
starts to decrease until at the equilibrium price they are equal (see Figure 9.2).
118
DIFFERENTIAL EQUATIONS Anneli Kaasa
p
S
p*
p0
excess demand D
q
q*
Figure 9.2. Possibilities fro stabilizing the market when price increases
119
DIFFERENTIAL EQUATIONS Anneli Kaasa
dy dy
dt dt
y y
d dy
dt ( y *) < 0 .
dy
In the opposite case, if the slope of the phase line in the equilibrium point is positive:
d dy
dt ( y *) > 0 ,
dy
the phase line is positively sloping and the equilibrium is unstable.
Let us continue the example of the market equilibrium model with the differential equation:
+ k (b + d ) p = k (a + c ) .
dp
dt
First, we have to transform the equation to the form:
= k (b + d ) p + k (a + c ) .
dp
dt
As it is a linear equation, the phase line is a straight line. We know that b, d > 0 , so the phase line
is determined by the sign of the parameter k. Previously we concluded that the market equilibrium
is stable, if k is positive. In the case of positive k the slope of the phase line is negative:
k (b + d ) < 0 and the phase line is thus a negatively sloping straight line with positive vertical
intercept k (a + c ) (we also know that a, c > 0 ). This phase line is depicted on Figure 9.4. We can
see that in that case the process indeed converges and the equilibrium is stable.
120
DIFFERENTIAL EQUATIONS Anneli Kaasa
dp
dt
= k (a + c ) k (b + d ) p
dp
k (a + c ) dt
p* p
Figure 9.4. Phase diagram showing the time path of price ( k > 0 )
dy
As another example, given differential equation = 8 y 2 y 2 , the equilibrium values can be
dt
dy
found by replacing y = y * and = 0:
dt
4 y * y *2 = 0 ,
y * (4 y *) = 0 .
resulting in two values: y1 * = 0 ja y 2 * = 4 .
For analysing stability, we take derivative:
d dy
( )
dt = d 8 y 2 y = 8 4 y ,
2
dy dy
and find its values in equilibrium points:
d dy
dt (0) = 8 4 0 = 8 > 0 ,
dy
d dy
dt (4) = 8 4 4 = 8 < 0 .
dy
Since the derivative for the equilibrium value y1 * = 0 is positive, then the phase line is positively
sloping in that point and equilibrium, thus unstable. For the other equilibrium value y 2 * = 4 the
derivative is negative and the phase line thus negatively sloping. Hence, y 2 * = 4 is an equilibrium
value that is stable: the value of y converges around that equilibrium over time. These conclusions
are illustrated by the phase diagram on Figure 9.5.
dy
dt
0 4 y
121
DIFFERENTIAL EQUATIONS Anneli Kaasa
Again, although the phase diagrams allow us to analyse more complex difference equations, at the
same time they can add an additional view to the solutions of quite simple equations, such as the
differential equations that describe the change of the value of a single amount of money and the
dy dy
annuity over time: = ry and = ry + B (see Chapter 10.3), respectively. The corresponding
dt dt
phase lines can be seen on the Figure A9.1.
dy dy
= B + ry
dt dt
dy
= ry
dt
B
0
y
Figure A9.1. Phase diagram for a single amount of money and for the annuity in the case of
continuous time
Again, although financial mathematics does not discuss terms like equilibrium or stability, this
phase diagram illustrates that the value of money increases over time acceleratingly.
The dynamic analysis is mostly used in macroeconomics. The number of the difference or
differential equations included in the model depends on how many variables are included in the
model as changing over time. When, for example, two variables changing over time are included
into the model, then the model includes two differential equations forming an equation system. In
this case the principles of drawing the phase diagrams are somewhat different.
In the case of two time-dependent variables (x and y) and two differential equations ( x = f ( x, y )
and y = g ( x, y ) ) the phase diagram is depicted on a xy -plane (see Figure A9.2 in the following
example). First, the demarcation curves are drawn. A demarcation curve x = 0 includes all pairs of
the values of x and y, at which x has no tendency to change and the other demarcation curve y = 0
includes all those pairs at which y has no tendency to change. Hence, a demarcation curve covers all
possible optimal solutions for a particular variable. The intersection point of these demarcation
curves can then be viewed as the equilibrium point of the model. For drawing, the demarcation
curves are expressed in the form: y = f (x) ).
When crossing the demarcation curve, hence, the sign of x or y , respectively, should change. This
y
can be marked on the figure with the help of plus and minus signs. If > 0 , then when moving in
y
the direction of the increase in y (moving up when y is depicted on the vertical axis) y should be
increasing and hence, before the line y = 0 y is negative (y decreases over time and the direction
of the phase path is opposite to the y-axis (down)) and after the line y = 0 y becomes positive and
y
the phase path goes to the same direction as the y-axis (up)). If < 0 , the opposite is true.
y
Analogically the direction of the phase path can be determined relative to the x-axis. If for example
x
> 0 , then when moving in the direction of the increase in x (moving to the right when x is
x
depicted on the horizontal axis), before the line x = 0 the direction of the phase path is opposite to
122
DIFFERENTIAL EQUATIONS Anneli Kaasa
the x-axis (to the left)) and after the line y = 0 to the same direction as the x-axis (to the right). If
x
< 0 , the opposite is true again.
x
The demarcation lines divide the xy-plane into four segments. In order to describe the direction of
movements of the two variables in these segments, small arrows describing the direction of
movements are marked. On the basis of these arrows now the phase paths can be sketched starting
from different initial points in different segments. The phase path may converge to the equilibrium
point, but can also diverge from it.
y x
Let us look at an example, where > 0 and > 0 . Then before the line y = 0 y is negative (y
y x
decreases over time, minus signs below this line) and after this line y is positive plus signs above
this line). Before the line x = 0 y time and the direction of the phase path is opposite to the y-axis
(down)) and after the line y = 0 x is negative (minus signs on the left of this line) and after this
line positive (plus signs to the right on this line). Then the arrows and possible time paths in each
segment can be sketched as shown on Figure A9.2. In this case whether the equilibrium is reached
depends on the starting point.
+
y(t) y = 0
-
-
+ x = 0
y*
-
+
+
-
x* x(t)
Figure A9.2. An example of a phase diagram in the case of two differential equations
Let us look at an example of the IS-LM model for a closed economy covering both the market for
goods and the money market. Let the goods market and capital market be described by the
following equations. In the equilibrium the national income is equal to the total expenditures:
Y (t ) = E (t ) ,
that, in turn is divided between consumption C and investments I and government expenditures G
(foreign sector is left out here):
E (t ) = C (t ) + I (t ) + G .
Consumption is assumed to consist of the autonomous consumption AC and a certain portion
(expressed by the propensity to consume c) of the income after taxes (g tax rate):
C (t ) = AC + c(1 g )Y (t ) , 0 < c, g < 1 , thus 0 < c(1 g ) < 1 ,
Investments are assumed to consist of the autonomous investments AI and an interest-rate
dependent part (depends negatively on the interest rate i):
I (t ) = AI + ir (t ) , i < 0 .
123
DIFFERENTIAL EQUATIONS Anneli Kaasa
Let the money market be described by the following equations. In the equilibrium the demand for
money is equal to the supply of money:
L(t ) = M ,
where the money supply is exogenous (time-independent) and the demand for money consist of the
transactional demand that depends positively on the national income and the speculative demand
that depends negatively on the interest rate:
L(t ) = yY (t ) + kr (t ) , y > 0 , k < 0 .
Let us assume that the change of the national income over time depends positively on the excess
demand on the goods and capital market and the change of the interest rate over time depends
positively on the excess demand on the money market:
Y = a(E (t ) Y (t )) , a > 0 ,
r = b(L(t ) M ) , b > 0
or
Y = a[C (t ) + I (t ) + G Y (t )] = a[AC + c(1 g )Y (t ) + AI + ir (t ) + G Y (t )] ,
r = b[ yY (t ) + kr (t ) M ] .
When solving this system the equilibrium values of the national income and the interest rate can be
found. However, since the solutions are expressions of a large number of parameters, the
interpretation could be quite troublesome and an alternative method here is to depict the situation in
a phase diagram.
Here, the phase diagram is drawn on the Yr -plane. First, the demarcation curves Y = 0 and r = 0
are drawn. (When these curves have a common point, it is the equilibrium point of this system
both variables have reached their equilibrium values). For that we set Y and r equal to 0:
Y = a[AC + c(1 g )Y (t ) + AI + ir (t ) + G Y (t )] = 0 (IS curve),
r = b[ yY (t ) + kr (t ) M ] = 0 (LM curve).
Basically these equations set the equilibrium conditions for the goods and capital market and the
money market, since for these conditions to be true in both cases the demand has to be equal to the
supply (excess demand is equal to 0). The Y = 0 is also known as the IS curve and r = 0 as the LM
curve.
Since r is usually depicted on the vertical axis and Y on the horizontal axis, then for drawing these
conditions have to be transformed into a form r = f (Y ) ).
Solving both conditions for the interest rate gives:
() ()
))) ))) )) ))
)()
) ) ) )) ()
Knowing the signs of the parameters and independent variables (AC, AI, G and M can be expected
to be positive) we can conclude that Y = 0 or the IS-curve is a negatively sloping straight line with
a positive vertical intercept and r = 0 or the LM-curve is a positively sloping straight line with a
negative vertical intercept (see Figure A9.3).
Now we can find the direction of the change in r and Y on both sides of the demarcation curves. The
derivatives:
Y r
= a[c(1 g ) 1] < 0 and = bk < 0 .
Y r
Hence, for both variables as the value of the variable increases, the derivative with respect to time
changes from positive to negative. We mark plus signs ( r > 0 ) below the r = 0 curve and minus
signs ( r < 0 ) above this curve. Also, we mark plus signs ( Y > 0 ) to the left of the Y = 0 curve and
minus signs ( Y < 0 ) to the right of this curve. Also, we can draw small arrows describing the
direction of movements in different segments.
After that we can sketch a hypothetical phase path starting from a hypothetical initial point A, in
order to determine whether the equilibrium can be reached or not. In the case depicted on Figure
A9.3 the phase path is going in circles counter-clockwise. When in doubt, one can first sketch the
diagonal (dotted) lines parallel to the average direction of the arrows in each segment.
r(t) r = 0
IS
-
+ - +
r*
- A
+ Y = 0
LM
+ -
Y* Y(t)
125
DIFFERENTIAL EQUATIONS Anneli Kaasa
When, for example, the government decides to decrease the money supply, then the decrease in M
M M
also decreases the absolute value of . Since the vertical intercept of the LM-curve is
k k
negative, then the vertical intercept is shifted upwards. As the change in M does not change the
slope of the LM-curve, the change can be depicted as a parallel shift upwards (or to the left, see
Figure A9.4).
Comparatively, it can be said that the new equilibrium is in the intersection point of the new LM
curve and IS-curve. How this point is reached, is described by the phase path. For this, now the new
LM curve has to be viewed as a demarcation curve r = 0 . The former equilibrium is now just the
initial point.
r(t)
IS LM'
LM
(2)
(1)
Y(t)
Figure A9.4. The IS-LM model after the change in the money supply
Analogically, the impact of other independent variables or parameters can be investigated, finding
the corresponding changes in the curves. If the independent variable or parameter that changes is
included in an expression for the slope, this curve has to be rotated accordingly (keeping the vertical
intercept in place), if the expression of the vertical intercept is changed, then there has to be a
parallel shift. (It may also happen that both changes occur). The new equilibrium in a new
intersection point gives the new equilibrium values of Y and r.
It has to be noted that when no parameter or independent variable has changed, the equilibrium
point cannot change as well. In the case of temporary deviations (as the point A on Figure A9.3) the
question is about converging to or diverging from the initial equilibrium point.
126
10. EXPONENTIAL FUNCTIONS
b
In economics often the natural exponential function is used, where the base is the number e:
y = ex .
Wide usability is reasoned by the simplicity of taking derivative. The derivative of an exponential
function is:
( )
a x = a x ln a .
If the base is e, then:
( )
e x = e x ln e = e x .
That makes many transformations and calculations much simpler.
Let us consider the base of natural exponential function. The value of the number e is related to the
n
1
function f (n ) = 1 + . If n increases, the value of this function also increases, but deceleratingly:
n
1
1
if n = 1 , then f (1) = 1 + = 2 ;
1
127
EXPONENTIAL FUNCTIONS Anneli Kaasa
2
1
if n = 2 , then f (2 ) = 1 + = 2,25 ;
2
3
1
if n = 3 , then f (3) = 1 + = 2,37 etc.
3
It is known that as n , then the value of this function approaches the number e:
n
1
lim1 + = e = ,7188
n
n
In economics the natural logarithmic function y = ln x is also used widely. Often the aim is to
bring a non-linear function to a linear form to use linear (matrix) algebra. For example, the Cobb-
Douglas type function is non-linear:
z = c xa yb .
When taking natural logarithm from both sides:
ln z = ln c + a ln x + b ln y ,
the resulting function is linear with respect to the logarithms ln x and ln y or log-linear.
In microeconomics, often the utility function is transformed, when some characteristics of the utility
function are aimed (it can be done, because there are no units for measuring utility in an absolute
scale and utility is measured on a relative scale: only the order of consumption bundles is important
and not the absolute value of utility). For example, in the case of the Cobb-Douglas type utility
function u = q1 q 2 the transformation often used is v = ln u = ln q1 + ln q 2 .
Logarithmic transformations are also often used in econometrics, because estimating linear
relationships is much simpler than estimating non-linear relationships. In addition, often the
elasticity (of the variable y with respect to the variable x) is of interest: the aim is to find the relative
change in y brought about by an 1% change in x. This can be found by finding a ratio that describes
how the natural logarithms of the variables ( ln x and ln y ) relate to each other. A derivative
d (ln y )
estimates the absolute change in ln y when ln x increases by one unit. At the same time, it
d (ln x )
gives the elasticity mentioned before, because we can make the following transformations in the
formula for this elasticity:
dy x dy 1 dy 1 1 dy d (ln y ) 1
= x = = =
dx y dx y dx y 1 dx dy d (ln x )
x dx
dy d (ln y ) dx d (ln y )
= = .
dx dy d (ln x ) d (ln x )
When a relationship y = ax is estimated in econometrics, then the coefficient a estimates the
y
quotient that describes how y changes (in units) as x increases by one unit. Analogically, when
x
(ln y )
a relationship ln y = b ln x is estimated, then the coefficient b estimates the quotient that
(ln x )
describes how ln y changes (in units) as ln x increases by one unit. At the same time b estimates
y y
the elasticity : it describes how y changes (in percents) as x increases by 1%.
x x
128
EXPONENTIAL FUNCTIONS Anneli Kaasa
129
EXPONENTIAL FUNCTIONS Anneli Kaasa
dv
1
(1000e ) t
1
rv = dt = 2 t
= .
t
v 1000e 2 t
Hence, the growth rate depends on time (is different in different time points) and as t
1
0 , so the growth is decelerating.
2 t
In order to simplify calculations, sometimes a following method is used: first a natural logarhitm is
taken from a function and then a derivative is taken from that with respect to time. This is reasoned
by the following conversion:
dy
ry = dt = 1 dy .
y y dt
1
The expression can be viewed as a derivative of ln y with respect to y. Hence, it can be written:
y
1 dy d (ln y ) dy d (ln y )
ry = = = .
y dt dy dt dt
d (ln y )
Hence, for calculating growth rate a formula ry = can be used as well.
dt
For example, in the case of y = a e ct the logarithm is
ln y = ln a + ct ln e = ln a + ct ( ln e = 1 )
and
d (ln y )
ry = = 0 + c = c.
dt
Given v = 1000e t
the logarithm is
ln v = ln 1000 + t ln e = ln 1000 + t and
d (ln v ) 1
rv = = .
dt 2 t
Growth rate is used in the case of functions like y = f (t ) (t is time) and it describes the relative
change of y per time unit (absolute change of time!). Growth rate can be calculated as
dy
dy 1
ry = dt = . We can also first find ln y and then take a derivative of it with respect to t,
y dt y
1 dy d (ln y ) dy d (ln y )
because: ry = = = .
y dt dy dt dt
dy 1
!!! ry = means that as t increases by 1 unit, y (depending on the sign the sign of ry being
dt y
positive of negative, respectively) increases/decreases approximately by ry 100 %. !!! NB! If
we are dealing with economic-related problems, then we should not use x and y, but the variables in
our problem instead.
130
EXPONENTIAL FUNCTIONS Anneli Kaasa
dy 1 dy 1
From the formula ry = or, if time t as an argument is denoted with x: ry = we can see
dt y dx y
dy
that in a sense the formula for a growth rate is placed between the formulas of the derivative
dx
dy x dy
and elasticity . When the derivative describes the chance in y in units per one unit change
dx y dx
dy y
in x and the elasticity describes the relative chance in y in percents per 1% change in x, then
dx x
dy y
the growth rate describes the relative change in y in percents per one unit change in x. The
dx
interpretation of these three indicators is similar in a sense, but there are also important differences
that are shown on Figure A10.1.
dy y
derivative y = 1 unit y units
dx x
dy x y y
elasticity e= e %
dx y x x 1%
dy 1 y y
growth ry = 1 unit ry 100 %
rate dx y x
dy 1
It is worth noting that if the growth rate is multiplied by the value of the argument x , we get
dx y
dy x
the elasticity :
dx y
ry x =
dy 1 dy
and if the growth rate is multiplied by the value of the function y , we get the derivative :
dx y dx
ry y = y .
We know that the derivative gives us the marginal function MY and elasticity is equal to the
dy x dy y MY
quotient of the marginal and average functions: = = = . Continuing the same
dx y dx x AY
row, the growth rate is equal to the quotient of the marginal function and the function itself:
dy 1 dy MY
ry = = y= .
dx y dx y
131
EXPONENTIAL FUNCTIONS Anneli Kaasa
132
EXPONENTIAL FUNCTIONS Anneli Kaasa
r by 100, we get the interest rate in percents. For example, if r = 0,12 , it can be said that the interest
rate is 12% .
In the case of simple interest, in every period the sum r y 0 is added to the initial sum and the result
after t periods is:
y t = y 0 + tr y 0 .
However, compound interest is used more often. In that case the sum that is already accumulated
serves as a basis for interest calculations. Thus, if the sum increases by r y 0 in the first period:
y1 = y 0 + r y 0 = (1 + r ) y 0 ,
then the interest in the second period is calculated based on the final sum in the first period: r y1 .
The final sum of the second period is then y 2 = (1 + r ) y1 . Hence, the time path of the value of
money is described by a simple difference equation:
y t = (1 + r ) y t 1 .
The solution can be found as:
= 0 and y t = 0 + ( y 0 0)(1 + r ) = y 0 (1 + r ) .
0
y* =
t t
1 (1 + r )
If we denote the initial sum ( y 0 ) with A and its future value ( y t ) with V, we can write the formula
for finding the future value as:
V = A(1 + r ) .
t
Often time is measured in years and then t stands for the number of years, interest rate r is then
annual interest rate. Sometimes however, compounding can take place many times a year, for
example quarterly. If the interest is calculated m times a year, then the interest rate of one
r
calculation must be and there are m times t periods:
m
mt
r
V = A1 + .
m
If we increase the frequency of compounding to infinity ( m ), it is called continuous
compounding. The formula for this case can be derived as follows. First, we can multiply the
r
power mt by the number 1 in a form :
r
mtr
r r
V = A1 + .
m
r 1
After replacing = and knowing that when exponentitating an exponential function, the
m m
r
powers are multiplied, we can write:
rt
m
r
V = A1 +
1
.
m r
133
EXPONENTIAL FUNCTIONS Anneli Kaasa
In the case of continuous compounding m , hence, the future value in the case of continuous
compounding is the limit value of this expression when m :
rt
m
r
V = lim A1 +
1
.
m r
m
m
1
r n
m 1
If we replace = n , we can replace the expression 1 + with 1 + . As mentioned
r m n
r
before, the limit value of the latter as n is the number e. Here, the assumption is m , but
m
as r can be viewed as a positive constant, we can say that if m , then and thus
r
n . Hence, the formula can be written as:
rt
1 n
V = lim A1 + = Ae rt .
n
n
Thus, the formula for the future value in the case of continuous compounding is V = Ae rt , where r
is the growth rate of the value of money.
This formula can also be derived using the concept of derivatives (assuming infinitely small
changes) and a differential equation. Since in the case of continuous compounding the
compounding takes place not every period, but after every infinitely small change in time, we can
use the concept of derivatives. The derivative of the value of money with respect to time estimates
the change in the value of money brought about by a unitary change in time. In the case of
continuous compounding this change per time unit is equal to the interest (value of money times the
interest rate):
dy
= ry .
dt
This is a differential equation that describes the situation under consideration here. As a remark:
when rearranging the difference equation used in the case of discrete time, we can see the
dy dy
similarity: y t = y t y t 1 = r y t 1 . Here, estimates this change: y t .
dt dt
After transforming the differential equation to the general form:
dy
ry = 0 ,
dt
we can find the equilibrium value:
0
y* = =0
r
and the solution:
y (t ) = 0 + [ y (0 ) 0] e rt or y (t ) = y (0 ) e rt .
When replacing the notations, we get the formula obtained before:
V = Ae rt .
134
EXPONENTIAL FUNCTIONS Anneli Kaasa
For example, let the deposited sum be 10 000 . If the interest rate is 10% or r = 0,1 , then in the case
of simple interest, the future value after five years is:
V = A + tr A = A(1 + tr ) = 10 000(1 + 5 0,1) = 15 000 .
In the case of compound interest with compounding once in a year, the future value after five years
is:
V = A(1 + r ) = 10 000(1 + 0,1) 16 105 ,
t 5
= V (1 + r ) .
V t
A=
(1 + r )
t
Analogically, for example in the case of continuous compounding, the present value:
135
EXPONENTIAL FUNCTIONS Anneli Kaasa
A = Ve rt .
Sometimes compound interest is combined with regular (constant) payments, for example when
paying back a loan, while interests are calculated every period. When both the number of periods
that are needed to finish the payments, and the final sum with all the interests are known, it is
possible to find out a constant sum that has to be paid every period (that can be a year, but other
types of periods are also possible). This kind of series of equal payments are called annuity.
Let there be a fixed amount of payments B and the initial value y 0 = 0 ; one example can be a
savings deposit, where every period a fixed sum is added to the deposit. In addition to the fixed
sum, the interest is also added to the value of previous period:
y1 = (1 + r ) y 0 + B = B ,
y 2 = (1 + r ) y1 + B ,
y 3 = (1 + r ) y 2 + B etc.
In general:
y t = (1 + r ) y t 1 + B .
Solving this difference equation gives:
B B
= and y t = + y 0 + (1 + r ) .
B B
y* =
t
1 (1 + r ) r r r
Knowing that the initial value is 0, we can find the formula for the future value of annuity:
V =
B
r
[ ]
(1 + r )t 1 .
Sometimes the present value of annuity is of interest. For example one needs a loan and knows the
sum that can be paid every period. A formula for the present value of annuity can be derived
analogically, with the help of difference equation, but we can also discount the future value of
V
annuity. We know that A = , hence:
(1 + r )t
B (1 + r ) 1 B 1
t
A= or A = 1 .
r (1 + r )t r (1 + r )t
A formula for calculating the size of payments B can be found by solving one or another formula
for B:
1 (1 + r )t
B = Vr and B = Ar .
(1 + r ) 1 (1 + r ) 1
t t
For example, let us assume that a loan of 100 000 euros is desired. Bank gives the loan for 10 years
with an annual interest of 15%. If annuity is used, then the sum that has to be paid every year is
(1 + 0,15)10
B = 100 000 0,15 19900 euros.
(1 + 0,15) 1
10
When the interest is compounded continuously for the annuity, then analogically to the derivation
of the formula for continuous compounding, we can write
dy
= ry + B ,
dt
136
EXPONENTIAL FUNCTIONS Anneli Kaasa
V =
r
[
B rt
e 1 . ]
The value of wine depends on time according to the function v = 1000e t . The present value of the
possible gain from selling wine in future then depends on time as follows:
v n = 1000e t e 0,1t = 1000e t 0 ,1t
.
For finding the time, when this value is maximal, we have to take derivative and set it equal to 0:
dv n 1 t 0 ,1t
!
= 1000 0,1 e =0.
dt 2 t
Since (always) e x > 0 and 1000 0 , it has to be that:
1
0,1 = 0 ,
2 t
from here t = 25 . Hence, it is optimal to sell the wine after 25 years.
When looking at the problem from another side, we can say that it is useful to keep the goods if the
growth rate of their value is higher than the growth rate of money (bank interest rate). If they
become equal, it is optimal to sell goods and deposit the gains. Hence, in our example the condition
for the optimal time is:
1
rv = 0,1 or = 0,1 .
2 t
As expected, this leads to the same result as before.
137
LITERATURE
Anthony, M., Biggs, N. Mathematics for economics and finance. Methods and modelling.
Cambridge: Cambridge University Press, 1996.
Chiang, A. C. Fundamental methods of mathematical economics. 3rd ed. McGraw-Hill, 1984.
Dowling, E. T. Schaums outline of theory and problems of introduction to mathematical
economics. 3rd ed. Shaums Outline Series: McGraw-Hill, 2001.
Kaasik, ., Abel, M. Eesti-inglise-vene matemaatikasnastik. Tartu: T, 1995.
Jrime, E., Velsker, K. Matemaatika ksiraamat IXXI klassile. Tallinn: Valgus, 1987.
Soper, J. Mathematics for Economics and Business. An interactive Introduction. Oxford; Malden:
Blackwell Publishers, 1999.
138
APPENDICES Anneli Kaasa
APPENDICES
Appendix 1
The second-order differential d 2 z of the function z = f ( x, y ) can be rewritten as follows. First we
find the total differential of the first-order differential dz according to the formula of total
differential:
(dz ) (dz )
d 2 z = d (dz ) = dx + dy .
x y
Then we replace the first-order differential dz with the formula of total differential of dz :
( f x dx + f y dy ) ( f x dx + f y dy )
d 2z = dx + dy
x y
and take derivatives of the expressions in brackets with respect to x and y, respectively, ( dx and dy
are constants in this context):
d 2 z = ( f xx dx + f yx dy )dx + ( f xy dx + f yy dy )dy =
= f xx dx 2 + 2 f xy dx dy + f yy dy 2 .
f xy2 f xy2
= f xx dx + 2 f xy dx dy +
2
d y + f yy dy
2 2
d2y =
f xx f xx
f xy f xy2 f xy2 2
= f xx dx 2 + 2 dx dy + 2 d 2 y + f yy d y =
f f f
xx xx xx
f xy f yy f xx f xy2 2
2
= f xx dx + dy + d y .
f xx f xx
Since a square is always positive, for any values of dx and dy (unless both are equal to 0):
Appendix 2
Analogically to the optimization of a function with two arguments without constraint, in the case of
a constraint the function has to be increasing with respect to both arguments before the maximum
139
APPENDICES Anneli Kaasa
point and decreasing after that. Before the minimum point, the function should be decreasing and
increasing after that. Hence, in the maximum point the differential of the function has to change
from positive to negative, and thus, the second-order differential has to be negative: d 2 z < 0 . In the
minimum point it has to be positive d 2 z > 0 . At that it has to be taken into account that the value of
the constraint function has to be constant: dg = 0 .
Because of the constraint, dx and dy are related and thus d 2 z and the second-order conditions are
different. Let us make some transformations to see, how d 2 z is expressed now with the help of
partial derivatives.
In order the constraint to be satisfied, dx and dy have to be related in a following way:
dg = g x dx + g y dy = 0 .
While without any constraint dx and dy can have any values, now when choosing some value of
dx , there is a corresponding dy :
gx
dy = dx .
gy
Earlier, when looking at the case without constraint, the formula for the second-order differentials
was:
(dz ) (dz )
d 2 z = d (dz ) = dx + dy =
x y
( f x dx + f y dy ) ( f x dx + f y dy )
= dx + dy .
x y
Now, when taking derivatives with respect to x and y, dy also has to be viewed as a variable that
depends on g x and g y and hence, through them also on x and y. When using the rule of the
derivative of a product and regrouping we get:
dy
d 2 z = f xx dx + f yx dy + f y dx +
x
dy
+ f xy dx + f yy dy + f y dy =
y
dy dy
= f xx dx 2 + 2 f xy dx dy + f yy dy 2 + f y dx + dy .
x y
The difference from the case without constraint lies in the last term. The expression in brackets is
the second-order differential of y ( d 2 y ), hence:
d 2 z = f xx dx 2 + 2 f xy dx dy + f yy dy 2 + f y d 2 y .
With the help of analogical transformations we can get the second-order differential of the
constraint function (this second-order differential has to be equal to 0, as the second-order
differential of constant b is 0):
d 2 g = g xx dx 2 + 2 g xy dx dy + g yy dy 2 + g y d 2 y = 0 .
140
APPENDICES Anneli Kaasa
g xx 2 g xy g yy 2
d2y = dx 2 dx dy dy
gy gy gy
and substitute the result to the formula of d 2 z and regroup:
d 2 z = f xx dx 2 + 2 f xy dx dy + f yy dy 2 +
g g xy g yy 2
+ f y xx dx 2 2 dx dy dy =
g g g
y y y
fy fy
= f xx g xx dx 2 + 2 f xy g xy dx dy +
gy gy
fy
+ f yy g yy dy 2 .
gy
fy
The quotient is actually a Lagrange multiplier . Hence, the expressions in brackets appear to
gy
be the second-order derivatives of the Lagrangian function and we can write more briefly:
d 2 z = L xx dx 2 + 2 L xy dx dy + L yy dy 2 .
For a maximum point, this d 2 z has to be negative and for a minimum positive. Let us make some
transformations to get the conditions that determine the sign of d 2 z . If we replace dy with an
g
expression found before x dx and regroup, we get:
gy
2
g g
d 2 z = L xx dx 2 + 2 L xy dx x dx + L yy x dx =
g g
y y
gx 2 g2
= L xx dx 2 2 L xy dx + L yy x2 dx 2 =
gy gy
( ) dx
2
= L xx g y2 2 L xy g x g y + L yy g x2 .
g y2
dx 2
As is positive (a square is always positive), then the sign of d 2 z is determined by the sign of
g y2
L xx g y2 2 L xy g x g y + L yy g x2 .
It appears that this expression, when multiplied by 1 , is equal to a following determinant :
0 gx gy
gx L xx L xy = 2 L xy g x g y L xx g y2 L yy g x2 .
gy L yx L yy
Hence, if this determinant is positive, it is a maximum and if negative, then a minimum.
141
APPENDICES Anneli Kaasa
Appendix 3
Here, the formula for finding the time-dependent part of the general solution for a first-order linear
difference equation complementary function is derived. If we denote the complementary
function with z t , we can express the general solution of the difference equation as a sum of the
equilibrium value and the complementary function: y t = y * + z t . In the period t 1 the general
solution is then y t 1 = y * + z t 1 . Now, we can replace yt and y t 1 with these expressions in the
difference equation y t = ay t 1 + b :
y * + z t = a ( y * + z t 1 ) + b .
Next, we can subtract y * from both sides. At that, from the right side we subtract y * in the form
of ay * + b (we know that y* = ay * + b ). The result shows how the complementary function
depends on its value in previous period:
z t = a z t 1 .
When moving back we can also write: z t 1 = a z t 2 , z t 2 = a z t 3 and so on. Hence, the
complementary function can be expressed through its value in two periods ago:
z t = a z t 1 = a (a z t 2 ) = a 2 z t 2 ,
and thus:
z t = a z t 1 = a z t = = a t z t t
or
zt = a t z0 .
In the 0-period the general solution is y 0 = y * + z 0 , so z 0 can be found as a difference between the
initial value and the equilibrium value of y: z 0 = y 0 y * . Hence, the formula for the
complementary function is:
z t = ( y 0 y *) a t .
Appendix 4
Here, the formula for finding the complementary function of the general solution for a first-order
linear differential equation. If we denote the complementary function with z (t ) , we can express the
general solution of a differential equation as:
y (t ) = y * + z (t ) .
dz
The derivative that describes how the complementary function behaves over time, also
dt
describes how y behaves over time. That can be shown by differentiating both sides of the equation
y (t ) = y * + z (t ) (derivative of a constant y * is 0):
dy dz
= 0+ .
dt dt
In order to bring out the time-dependent part of the solution, we can take derivatives of both sides of the
dy
differential equation + a y = b with respect to time:
dt
142
APPENDICES Anneli Kaasa
d2y dy
2
+a = 0.
dt dt
dz dy
As we know that = , we can write:
dt dt
d 2z dz
2
+a = 0.
dt dt
To find z (t ) , we integrate this equation with respect to time:
d 2z dz
dt 2 dt + a dt dt = 0 dt .
The result is (as we want to find the time-dependent part of solution, the constants of integration are
left out):
dz
+ az = 0 .
dt
After transforming we get:
1 dz
= a .
z dt
1 dz
Next, we can integrate this equation with respect to time: z dt dt = a dt . The left-side integral
1
can also be written as z d z . According to the rules of integration:
ln z = at + C .
To find z, we can use the opposite operation of logarithming: e ( ) . In order the result to be more
clear, we can make some transformations. We can replace the constant of integration C with ln A
( A is constant as well). Without changing the essence of the equation, the expression at can be
multiplied by the number 1 in the form of ln e . The equation then gets a form:
ln z = ln A at ln e .
According to the rules of logarithms:
A
ln z = ln .
e at
Hence:
z (t ) =
A
at
= A e at .
e
If t = 0 , then the complementary function is z (0 ) = A e a0 = A 1 = A and the general solution is
then y (0 ) = y * + z (0 ) = y * + A . We can see that the constant A can be viewed as the difference
between the initial and equilibrium values: A = y (0 ) y * .
Hence, the formula for the complementary function is:
z (t ) = z (0) e at .
143