Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
J.M. SCHUMACHER∗
Abstract. The paper provides a review of the basics of financial engineering, with
a few examples. We emphasize connections with control theory in a broad sense rather
than with stochastic control theory in particular, and the reader is not assumed to be
versed in stochastic processes. After a discussion of the main methods of financial risk
management, a state-space framework for modeling financial markets is presented and
used to explain crucial concepts of financial engineering such as absence of arbitrage,
market completeness, hedging, and the Black-Scholes partial differential equation. Two
brief case studies are presented: the construction of an indexed bond, and the hedging
of long-term contracts for delivery of oil.
which real-valued variables follow continuous paths are typical for financial
engineering, although substantial work has also been done on discrete-time
models and on models that do allow jumps.
Two important methods of reducing risk are diversification and hedg-
ing. These two ideas are intertwined and it would be difficult to make a
sharp distinction, but nevertheless one may say that while diversification
focuses on the joint characteristics of assets, hedging concentrates on the
risk factors behind asset prices. Both methods may be developed in a
static as well as in a dynamic setting. Of course the dynamic settings are
more close to control theory; however, to illustrate the two notions, let us
consider some simple static examples.
First, let us consider a typical diversification scheme. Suppose that
capital can be invested in two assets, one of which is more risky than the
other, and suppose we try to minimize uncertainty. Is it then optimal to
invest all capital in the safest asset? To make this problem precise, suppose
that the values of the two assets at the end of the investment period (say,
X1 and X2 ) are jointly normally distributed stochastic variables. Let the
expected values of X1 and X2 be µ1 and µ2 respectively, and let their
variances be σ12 and σ22 , with σ1 < σ2 ; so the second asset is the more
risky one. Finally, let the correlation coefficient of X1 and X2 be ρ. In
this very simple setting, the investment problem may be formulated as the
problem of finding two real numbers α1 and α2 , with α1 ≥ 0, α2 ≥ 0, and
α1 + α2 = 1, such that the variance of α1 X1 + α2 X2 is minimized. This is
a quadratic optimization problem that can readily be solved. It turns out
that the optimal solution is to invest everything in the safest asset if
σ1
ρ ≥ .
σ2
If the above inequality is not satisfied, then the minimum is reached for
a nontrivial combination of the two assets. In this case there is a diversi-
fication benefit from investing in both assets. Note that such a benefit is
always present when the two assets are negatively correlated, and even, as
long as σ1 is positive, when they are uncorrelated. To illustrate the size
of the effect in a numerical example, assume σ1 = 1, σ2 = 2, and ρ = 0.
With these parameter values, the optimal solution is to invest 80% in the
safe asset and 20% in the risky asset. The variance obtained with this
allocation is 0.82 · 1 + 0.22 · 4 = 0.64 + 0.16 = 0.8. If everything would have
been invested in the safest asset, the variance of the portfolio value would
have been 1.
There are many ways to take this simple story further, such as: con-
sider more than two asset categories; do not only take variance into ac-
count, but also expected value; allow non-normal distributions and use risk
measures other than variance; consider multiple periods or even continuous-
time models, and allow reallocation through time; drop the restriction that
portfolio weights should be nonnegative, so allow “short” positions; take
4 J.M. SCHUMACHER
noise
dynamical
system
observa−
assets liabilities
tions
controlled
inputs portfolio −
outputs
1 This is an example of an arbitrage argument; the validity (to a high extent) of such
take place.
6 J.M. SCHUMACHER
where the dot denotes vector product. This entails a single linear constraint
on the vector of control inputs at each time t. One can rewrite the above
using the forward difference operator:
∆(ut · Yt ) = ut · (∆Yt ).
Vt := ut · Yt .
In discrete time, and under the condition (2.1), the change in portfolio
value between time t and time t + 1 is given by
Vt+1 − Vt = ut · (Yt+1 − Yt )
or in ∆ notation
∆Vt = ut · ∆Yt .
T
X −1
(2.3) VT = V 0 + ut · ∆Yt .
t=0
but this needs to be interpreted with care, in order to give due weight
to the observation that asset prices are highly irregular. The following
section presents some modifications to the usual calculus rules which serve
to express the effects of irregularity.
CONTROL AND FINANCIAL ENGINEERING 7
We evaluate the function φ0 (x(·)) here at the left end of the interval [ti , ti+1 ]
rather than at any other point, because the intended application is to sums
of the form (2.3) where it is essential that evaluation of the integrand takes
place at the beginning of the time increment. Under this rule, the formula
(3.2) may not provide a good approximation if x(·) is a highly irregular
function of time. This is illustrated in Fig. 2, where the first panel shows
a continuous but fairly irregular function, and the last panel shows the
approximation error when the square of the given function is approximated
by a sum of first-order differences on the grid with step size 1, following
the formula (3.2). Actually, in this case where the function φ in (3.2) is
given by φ(x) = x2 , the approximation error is easy to write down: the
approximation is
j−1
X
ỹ(tj ) = y(0) + 2x(ti )(x(ti+1 ) − x(ti ))
i=0
Asymptotically, if the function x(t) has finite total variation, the sum at the
right hand side of (3.3) must converge to zero when it is computed over finer
and finer grids. However, if the function x(t) remains irregular, in the sense
3 The letter π is used here as a mnemonic for “price.”
8 J.M. SCHUMACHER
Original function
1
−1
−2
0 10 20 30 40 50 60 70 80 90 100
Square
1.5
0.5
0
0 10 20 30 40 50 60 70 80 90 100
−1
0 10 20 30 40 50 60 70 80 90 100
Approximation error
0
−0.5
−1
−1.5
0 10 20 30 40 50 60 70 80 90 100
that the sum at the right hand side of (3.3) is not small, at scales at which
the approximation is computed (in the finance application: time scales
corresponding to trading frequencies), then a second-order term should be
included to get a good approximation.4 Taking this down to infinitesimal
scales, one is led to a new form of the chain rule: if y(t) = φ(x(t)), then
1
(3.4) dy = φ0 (x) dx + 2 φ00 (x) d[x, x]
5 As the reader may have surmised, this is the way in which the function in the top
4. Financial models.
4.1. State equations. Our basic model will be the following:
In this model, all stochasticity derives from the driving Brownian motion
process Wt , and we will consider the above equation typically on an interval
[0, T ]. The state process Xt takes values in an open subset D of Rn . The
symbols µX and σX denote functions from D to Rn and to Rn×k , respec-
tively. They should be such that the stochastic differential equation (4.1a)
has a unique solution in the domain D given an initial condition X0 ; these
properties hold under fairly mild conditions (see for instance [7]). Prices of
assets in the market are collected in the m-vector Yt , which is related to the
state vector Xt by the function πY (t, x) from [0, T ] × D to Rm . It will be
a standing assumption that πY (t, x) 6= 0 ∈ Rm for all (t, x) ∈ [0, T ] × D; in
other words, it cannot happen that all asset prices are zero simultaneously.
On the basis of Itô’s rule, we can write
where the functions µY (t, x) and σY (t, x) can be expressed in terms of the
data in (4.1). Specifically, each component Y` of the vector Y has associated
to it a “drift function” µ` (t, x) and a “volatility function” σ` (t, x) which
are given by
∂π` ∂π` ∂ 2 π`
(4.3a) µ` = + µX + 12 tr >
σX σX
∂t ∂x ∂x2
∂π`
(4.3b) σ` = σX .
∂x
Here, the arguments t and x have been suppressed to alleviate the notation;
this will also often been done below. The gradient vector ∂π` /∂x is defined
as a row vector, and superscript T denotes transposition. The formula in
(4.3a) may be written more explicitly as
n n X n
∂π` X ∂π` 1
X ∂ 2 π` >
µ` = + (µX )i + 2 (σX σX )ij .
∂t i=1
∂xi i=1 j=1
∂x i ∂x j
4.3. Examples.
4.3.1. The Black-Scholes model. Consider the following model
which has one state variable, one uncertainty factor, and two assets de-
noted by S and B:
Clearly, the matrix that appears here is invertible for all t ≥ 0 and x > 0,
and so the equation (4.5) has a unique solution which is given by
µ−r
ρ = r, λ= .
σ
The pricing equation for an asset C that is added to this model becomes
∂πC ∂πC ∂ 2 πC
(4.15) + rx + 12 σ 2 x2 = rπC .
∂t ∂x ∂x2
This is the standard Black-Scholes equation. In the market defined by
(4.13), the equation holds for any asset C whose value can be expressed
in terms of the variables t and x. The standard example is of course the
European call option with maturity T and strike price a, which satisfies
πC (T, x) = max(x − a, 0). Solving the PDE (4.15) with this boundary
condition leads to the Black-Scholes formula for a call option.
In general, if sufficient boundary conditions for a derivative are sup-
plied to determine the price function uniquely, a replication scheme can be
16 J.M. SCHUMACHER
set up as follows. The required portfolio weights are obtained from (4.11),
which in the case at hand becomes
∂πC σx x
(4.16) σx πC = [ φS φB ] .
∂x 0 cert
∂πC
(4.17) φS = .
∂x
The component φB is then determined by the requirement that the portfolio
should be self-financing, and it can be computed explicitly from (4.16).
4.3.2. An Asian option. The model of the previous subsection may
have to be extended for other types of options, in particular for path-
dependent options. As an example, consider a continuously sampled Asian
option. Since such an option depends on the integral of the asset price over
a certain period, a state variable has to be added to the model which will
provide the information needed to determine the option price at expiry.
The extended model may be written down as follows:
if (X2 )0 is taken to be zero. In terms of the model above, the Asian option
with time of expiry T and strike a is defined by
In our new model, we have the following data (letting Y consist of S and
B as before): D = {x ∈ R2 | x1 > 0}, n = 2, k = 1, m = 2, and
µx1 σx1
µX (t, x) = , σX (t, x) = ,
x1 0
S x1
Y = , πY (t, x) = .
B cert
CONTROL AND FINANCIAL ENGINEERING 17
We find
µx1 σx1 x1
(4.20) µY (t, x) = , [σY (t, x) πY (t, x)] =
crert 0 cert
This equation holds for any derivative whose price can be determined in
terms of x1 (the price of the underlying asset S), x2 (the continuous average
of the asset price), and t. For the Asian option with expiry T and strike
a one should apply the boundary condition (4.19). It is possible to solve
the resulting equations analytically, but the solution is quite a bit more
complicated than in the case of a standard call option.
4.3.3. The Vasicek model. It is a well-known fact that interest
rates depend on the period to which they apply—the ten-year rate is usu-
ally not the same as the six-month rate. This fact is expressed by saying
that interest rates have a term structure. Interest rates are not themselves
traded assets, but of course there is a one-to-one connection between the
interest rate for period T and the price of a contract that will pay one unit
of currency after period T (a zero-coupon bond ).
A relatively simple model for interest rates may be constructed as
follows. Consider the equations
∂πT ∂πT ∂ 2 πT 2
(4.23) + [a(b − x1 ) − λσ] + 12 σ = π T x1
∂t ∂x1 ∂x21
with the boundary condition
(4.24) πT (T, x) = 1.
Since it has been assumed that λ is constant and that ρ depends linearly
on the state variable x, it follows that for any given T the random variable
log KT follows a normal distribution. Let mT and vT denote the mean
and the variance, respectively, of log KT ; then, according to a well-known
formula,
EKT = exp mT + 21 vT .
(5.5)
!
Z T
> F̃ (T −t) > F̃ > (T −t)
(5.9) vT = h̃ e G̃G̃ e dt h̃.
0
20 J.M. SCHUMACHER
Therefore we find
!
Z T −t
(5.14) σT (t, x) = −πT (t, x)h> eF s ds G.
0
is invertible for each t ∈ [0, T ]. This in turn is true if and only if the matrix
above has no nonzero null vectors, that is, if the relation
R
T −t
1 −h> 0 1 eF s ds G
w0
.. .. ..
(5.15) . . . = 0
R
1 −h > Tk+1 −t F s
e ds G wk
0
CONTROL AND FINANCIAL ENGINEERING 21
(5.18a) dX1 = X2 dt
with
where for all x at least one of the inequalities is satisfied with equality. The
above equations may be solved to find the market price of the risk of oil as
an investment good:
µX (x) − rπX (x) − c
(6.4) λ(x) = max ,0 .
σX (x)
This expression is due to Bühler et al. [4], who obtained it from an equi-
librium argument.
On the basis of the above model we can now discuss the price of con-
tracts for oil delivery. In general, a futures contract with maturity T is
a contract to deliver a given commodity, for instance one barrel of oil, at
time T for a price FT to be paid when delivery is made. So, in our present
context where we assume a fixed interest rate, the market value at time t
of a contract to deliver a barrel of oil at time T is e−r(T −t) FT . This value
should satisfy the Black-Scholes equation. After some algebra, we find that
the Black-Scholes equation for the T -futures price FT (x) becomes:
where
∂FT ∂FT 1 ∂ 2 FT 2
(6.6a) µT = + µX + σ
∂t ∂x 2 ∂x2 X
∂FT
(6.6b) σT = σX
∂x
with the final condition FT (T, x) = x. This may be solved numerically in
a straightforward way, for instance using a finite-difference approximation.
Various alternative models for oil futures have been formulated in the
literature; see for instance [2, 9]. Depending on the model, one finds dif-
ferent pricing rules. Two alternatives to the rule implicit in (6.4–6.5) are
obtained by replacing (6.4) by either
(6.7) λ(x) = 0
or
µX (x) − rπX (x) − c
(6.8) λ(x) = .
σX (x)
8 The following parameter values were used: Θ = log 20.5, γ = 2.5, σ = 0.35,
r = 0.05, c = 4 (dollars per barrel per year). These values have been suggested in [4].
26 J.M. SCHUMACHER
30
25
20
futures price
15
10
5
two−regime
arbitrage−based
expectation−based
0
5 10 15 20 25 30 35
spot price
Fig. 3. Six-month futures price of oil as a function of the price for immediate
delivery according to three different models.
The idea of the hedging scheme used by MGRM was that the prices
of both long-term and short-term futures contracts are both sensitive to
the spot price of oil. The value of a portfolio consisting of both short-
term and long-term contracts can therefore be made insensitive to changes
in the oil price if the two types of contracts are combined in appropriate
proportions. The existing liquidity of the market for contracts for delivery
of oil in three or six months can in this way be transferred to contracts
for delivery in eight or ten years. The scheme developed by MGRM was
successful in the sense that many customers were happy to enter into long-
term contracts that guaranteed delivery of oil for a fixed price. Once the
contracts were agreed upon, MGRM hedged them by short-term futures.
Of course the short-term contracts used for hedging expire after some time,
and then they need to be “rolled over” (replaced by new contracts). If the
futures price is higher than the spot price, which is not usually the case
but does happen sometimes, it is expensive to roll over a futures contract.
Theoretically the required cash input is counterbalanced by the gain in
value of the long-term contracts. It requires a strong belief in the theory
however to maintain such a scheme when in the short run it drains the
liquid assets of a company. In any case, at the end of 1993 the central
CONTROL AND FINANCIAL ENGINEERING 27
2 2
1.5 1.5
1 1
delta
delta
0.5 0.5
0 0
5 10 15 20 25 30 35 5 10 15 20 25 30 35
spot price spot price
1.5 1.5
J.M. SCHUMACHER
1 1
delta
delta
0.5 0.5
0 0
5 10 15 20 25 30 35 5 10 15 20 25 30 35
spot price spot price
Fig. 4. Ratios for hedging futures of several maturities by means of six-month futures as a function of the current oil price, according to
three different models (dotted: arbitrage-based, drawn: two-regime, dashed: expectation-based).
CONTROL AND FINANCIAL ENGINEERING 29
REFERENCES
[1] F. Black and M. Scholes, The pricing of options and corporate liabilities, Jour-
nal of Political Economy, 81: 637–659 (1973).
[2] M.J. Brennan and E.S. Schwartz, Evaluating natural resource investments,
Journal of Business, 58: 135–157 (1985).
30 J.M. SCHUMACHER
[3] M.J. Brennan and Y. Xia, Dynamic asset allocation under inflation, Journal of
Finance, 57: 1201–1238 (2002).
[4] W. Bühler, O. Korn, and R. Schöbel, Pricing and hedging of oil futures. A
unifying approach, Working Paper 00-01, Lehrstuhl für Finanzierung, Univ.
Mannheim, 2000.
[5] H. Föllmer and A. Schied, Robust preferences and convex measures of risk, in
Advances in Finance and Stochastics. Essays in Honour of Dieter Sondermann,
K. Sandmann and P. Schönbucher, eds., Springer, Berlin, 2002, pp. 39–56.
[6] I. Karatzas and S.E. Shreve, Brownian Motion and Stochastic Calculus (2nd
ed.), Springer, New York, 1991.
[7] B. Øksendal, Stochastic Differential Equations: An Introduction with Applica-
tions (5th ed.), Springer, Berlin, 1998.
[8] P.E. Protter, Stochastic Integration and Differential Equations: A New Ap-
proach, Applications of Mathematics, 21, Springer, New York, 1999.
[9] E.S. Schwartz, The stochastic behavior of commodity prices: Implications for
valuation and hedging, Journal of Finance, 52: 923–973 (1997).
[10] O.A. Vasicek, An equilibrium characterization of the term structure, Journal of
Financial Economics, 5: 177–188 (1977).