Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Lectures
1 Richard R. Lindsey
Portfolio Choice
Individual:
1. Strictly prefers more to less (strictly increasing utility
function)
2. Risk averse
0
initial wealth
riskless interest rate
random return on j-th risky asset
dollar investment in j-th asset
uncertain end of period wealth
f
j
j
w
r
r
a
w
55 Richard R. Lindsey
Portfolio Choice
0
0
( )(1 ) (1 )
(1 ) ( )
j f j j
j j
f j j f
j
w w a r a r
w w r a r r
0
{ }
max [ ( (1 ) ( ))]
j
f j j f
a
j
EU w r a r r
56 Richard R. Lindsey
Portfolio Choice
2
F.O.C. [ ( )( )] 0
S.O.C. [ ( )( ) ] 0
j f
j f
EU w r r j
EU w r r j
() 0 more preferred to less
() 0 concave utility or risk averse
U
U
57 Richard R. Lindsey
Portfolio Choice
Theorem: An individual who is risk averse and strictly
prefers more to less will invest in risky assets iff the rate
of return on at least one asset > r
f
.
Consider the case with a single risky asset
F.O.C. [ ( )( )] 0
f
EU w r r
58 Richard R. Lindsey
Portfolio Choice
Claim:
Consider the no investment case
*
*
*
0 iff [ ] 0
0 iff [ ] 0
0 iff [ ] 0
f
f
f
a E r r
a E r r
a E r r
0 0
[ ( (1 ))( )] ( (1 ))( [ ] )
f f f f
EU w r r r U w r E r r
59 Richard R. Lindsey
Portfolio Choice
60 Richard R. Lindsey
() 0 sign is entirely determined by [ ]
f
U E r r
[ ] 0 can increase utility by adding some of the risky asset
[ ] 0 can increase utility by shorting some of the risky asset
[ ] 0 utility is maximized
f
f
f
E r r
E r r
E r r
Portfolio Choice
Richard R. Lindsey 61
In the multi-asset case, to hold no risky assets or to short
them
And again
Therefore, a risk averse individual with strictly increasing
utility avoids any positive investment in risky assets only
if none of the investments have a positive risk premium.
0
0
[ ( (1 ))( )] 0
( (1 ))( [ ] ) 0
f f
f f
EU w r r r j
U w r E r r j
0 only if [ ] 0
j j f
a j E r r j
Portfolio Choice
Richard R. Lindsey 62
When one or more of the risky assets has a positive risk
premium, the investor will have positive holdings in some
risky assets
Note that j and j are not necessarily the same because with
more than one risky asset, a positive risk premium on an
asset does not necessarily mean a positive investment (e.g.
2 assets w/ + risk premium but one stochastically
dominates the other).
0 if [ ] 0
j j f
j a j E r r
Risk Aversion
Richard R. Lindsey 63
Consider now the case with one risky asset and one riskless
asset.
For a monotonically increasing strictly concave (MISC)
individual to invest all her wealth in the risky asset:
1
st
order Taylor series expansion around
0
[ ( (1 ))( )] 0
f
EU w r r r
0
( (1 ))
f
U w r
Risk Aversion
Richard R. Lindsey 64
Note that this is for a small risk.
The minimum risk premium to induce full investment is
0 0
2 2
0 0
[ ( (1 ))( )] ( (1 )) [ ]
( (1 )) [( ) ] o( [( ) ]
f f f
f f f
EU w r r r U w r E r r
U w r E r r w E r r
0
2
0
0
2
0 0
( (1 ))
[ ] [( ) ]
( (1 ))
( (1 )) [( ) ]
f
f f
f
A f f
U w r
E r r w E r r
U w r
R w r w E r r
Risk Aversion
Richard R. Lindsey 65
This is known as the Arrow-Pratt measure of absolute risk
aversion (the inverse of R
A
is the risk tolerance).
For small risks (or small changes in risk) it is a measure of
the intensity of an individuals aversion to risk.
It is a measure of curvature (but since vonNeumann-
Morgenstern utility is unique up to affine transformations,
the 2
nd
derivative is not sufficient).
Risk Aversion
Richard R. Lindsey 66
Theorem:
( )
0 decreasing absolute risk aversion
( )
0 increasing absolute risk aversion
( )
0 constant absolute risk aversion
A
A
A
dR z
z
dz
dR z
z
dz
dR z
z
dz
0
0
0
0
0
0
( )
0 if 0
( )
0 if 0
( )
0 if 0
A
A
A
dR z
da
w z
dw dz
dR z
da
w z
dw dz
dR z
da
w z
dw dz
Risk Aversion
Richard R. Lindsey 67
Decreasing absolute risk aversion implies that the risky asset
is a normal good (i.e. the dollar demand increases as
wealth increases).
Increasing absolute risk aversion implies that the risky asset
is an inferior good (i.e. the dollar demand decreases as
wealth increases).
Constant absolute risk aversion implies that the dollar
demand is invariant with respect to wealth.
Risk Aversion
Richard R. Lindsey 68
Absolute risk aversion is therefore related to the dollar
demand for the risky asset.
But under decreasing absolute risk aversion, an individual
may actually increase, hold constant, or decrease the
proportion of wealth in the risky asset as wealth increases.
This brings us to the Arrow-Pratt measure of relative risk
aversion
( )
R A
R zR z
Risk Aversion
Richard R. Lindsey 69
Theorem:
Where
Is the wealth elasticity of demand.
( )
1 if 0 (relatively elastic)
( )
1 if 0
( )
1 if 0 (relatively inelastic)
R
R
R
dR z
dz
dR z
dz
dR z
dz
0
0
w
da
dw a
Risk Aversion
Richard R. Lindsey 70
<1: the proportion of agents initial wealth invested in the
risky asset decreases as wealth increases
=1: the proportion of agents initial wealth invested in the
risky asset is constant as wealth increases
>1: the proportion of agents initial wealth invested in the
risky asset increases as wealth increases
Linear Risk Tolerance Utility
Richard R. Lindsey 71
To get sharper results and closed form solution for securities
holdings, we need to specify the form of the utility
function. Most typically we use a class of utility function
known as linear risk tolerance (LRT) utilities or HARA
utilities (hyperbolic absolute risk aversion). These utility
functions satisfy state independence and time additivity.
Linear Risk Tolerance Utility
Richard R. Lindsey 72
Definition: Linear risk tolerance utility, the time additive
and state dependent utility function U( ) satisfies linear
risk tolerance if it solves the differential equation:
Where and are independent of z.
Note: every LRT utility function is identified by 2
parameters: the intercept and the slope .
( )
( )
U z
z
U z
Linear Risk Tolerance Utility
Richard R. Lindsey 73
This differential equation has three sets of solutions
depending on the value of
Where means that the solutions are unique up to a positive
linear transform.
1
1
(A) 0,1 : ( ) where 0; max , 0
1
U z z z
(B) 1 : ( ) ln U z z
(C) 0 : ( ) exp where 0
z
U z
Linear Risk Tolerance Utility
Richard R. Lindsey 74
These three classes are:
(A) Generalized Power Utility (when = 0)
1
( )
A
R z
z |
=
+
2
( )
0
( )
A
dR z
dz z
|
|
= <
+
Linear Risk Tolerance Utility
Richard R. Lindsey 75
( )
R
z
R z
z |
=
+
2
( )
( )
R
dR z
dz z
|
=
+
Which is
0 iff 0
0 iff 0
0 iff 0
> >
= =
< <
Recall from Risk Aversion
Richard R. Lindsey 76
Theorem:
Where
Is the wealth elasticity of demand.
( )
1 if 0 (relatively elastic)
( )
1 if 0
( )
1 if 0 (relatively inelastic)
R
R
R
dR z
dz
dR z
dz
dR z
dz
0
0
w
da
dw a
Linear Risk Tolerance Utility
Richard R. Lindsey 77
When = 0 we have power utility which is CPRA or
constant proportional (relative) risk aversion. Also known
as iso-elastic utility.
The proportion of wealth in the risky asset is invariant to
changes in wealth.
When | = -1 we have quadratic utility.
Linear Risk Tolerance Utility
Richard R. Lindsey 78
(B) Generalized Log Utility (when = 0)
1
( )
A
R z
z
=
+
2
( ) 1
0
( )
A
dR z
dz z
= <
+
Linear Risk Tolerance Utility
Richard R. Lindsey 79
( )
R
z
R z
z
=
+
2
( )
( )
R
dR z
dz z
=
+
Which is
0 iff 0
0 iff 0
0 iff 0
> >
= =
< <
Recall from Risk Aversion
Richard R. Lindsey 80
Theorem:
Where
Is the wealth elasticity of demand.
( )
1 if 0 (relatively elastic)
( )
1 if 0
( )
1 if 0 (relatively inelastic)
R
R
R
dR z
dz
dR z
dz
dR z
dz
0
0
w
da
dw a
Linear Risk Tolerance Utility
Richard R. Lindsey 81
When = 0 we have log utility which is CPRA or constant
proportional (relative) risk aversion. Also known as iso-
elastic utility.
The proportion of wealth in the risky asset is invariant to
changes in wealth.
Note when = 0 we have R
R
(z) = 1.
Linear Risk Tolerance Utility
Richard R. Lindsey 82
(C) Negative Exponential Utility
Constant absolute risk aversion (CARA)
Dollar demand for risky assets is unaffected by changes in
wealth (riskless borrowing or lending absorbs all
changes).
1
( )
A
R z
=
( )
0
A
dR z
dz
=
Stochastic Dominance
Empirical Observations Properties of U(z)
Investors prefer more to less U'(z) > 0
Investors are risk averse U(z) > 0
The risky asset is a normal good dR
A
(z)/dz < 0
Richard R. Lindsey 83
We now want to relate these three properties of utility
functions to the properties of payoff distributions.
For example, one question we can ask is: Under what
circumstances can we unambiguously say that an
individual will prefer one risky asset to another if all
we know is that he prefers more to less?
Stochastic Dominance
We can answer questions like this using stochastic
dominance.
Note that stochastic dominance is:
1. Always a pairwise comparison.
2. Only a partial ordering among risky assets.
3. Much richer than what we will cover here (e.g. you can
develop much of modern portfolio theory just using
stochastic dominance).
Richard R. Lindsey 84
Stochastic Dominance
Definition: First Order Stochastic Dominance
Then X
A
FSD X
B
.
Richard R. Lindsey 85
( ) Pr[ ] F x X x s
( ) and ( ) are different distributions
( ) 0
A B
F x F x
a F a - =
If ( ) ( ) 0
0 some
A B
F x F x x
x
s
<
Stochastic Dominance
Richard R. Lindsey 86
Stochastic Dominance
Definition: Second Order Stochastic Dominance
Then X
A
SSD X
B
.
Richard R. Lindsey 87
( )
If ( ) ( ) 0
0 some
t
A B
a
F x F x dx t
t
s
<
}
and [ ] [ ]
A B
E X E X =
Stochastic Dominance
Richard R. Lindsey 88
Stochastic Dominance
Definition: Third Order Stochastic Dominance
Then X
A
TSD X
B
.
Richard R. Lindsey 89
( )
If ( ) ( ) 0
0 some
y t
A B
a a
F x F x dxdt y
y
s
<
} }
[ ] [ ] and [ ] [ ]
A B A B
E X E X Var X Var X = s
Stochastic Dominance
Richard R. Lindsey 90
Stochastic Dominance
Richard R. Lindsey 91
Stochastic Dominance
Theorem: X
A
FSD X
B
X
A
SSD X
B
X
A
TSD X
B
(these are
progressively weaker tests).
Theorem: E[U(X
A
)] > E[U(X
B
)] for all U( ) (that are finite
for all finite x) such that U'(x) > 0 everywhere iff X
A
FSD
X
B
(i.e. prefers more to less).
Theorem: E[U(X
A
)] > E[U(X
B
)] for all U( ) (that are finite
for all finite x) such that U'(x) > 0 and U(x) < 0
everywhere iff X
A
SSD X
B
(i.e. risk averse).
Richard R. Lindsey 92
Stochastic Dominance
Theorem: E[U(X
A
)] > E[U(X
B
)] for all U( ) (that are finite
for all finite x) such that U'(x) > 0, U(x) < 0 and U'(x) >
0 everywhere iff X
A
TSD X
B
.
Theorem: E[U(X
A
)] > E[U(X
B
)] for all U( ) (that are finite
for all finite x) such that U'(x) > 0, U(x) < 0 and R
A
'(x) <
0 everywhere iff X
A
TSD X
B
(i.e. risky asset is a normal
good).
Richard R. Lindsey 93
Stochastic Dominance
Theorem: The following three statements are equivalent:
1. A FSD B
2. F
A
(x) F
B
(x) for all x
3. x
A
= x
B
+ where 0
Theorem: The following three statements are equivalent:
1. A SSD B
2. E[x
A
] = E[x
B
] and
3. x
A
= x
B
+ where E[ |A] = 0
Richard R. Lindsey 94
( )
if ( ) ( ) 0 and 0 some
t
A B
a
F x F x dx t t s <
}
Stochastic Dominance
Lets consider an example
Which investment do we choose?
Richard R. Lindsey 95
1
1 with probability 0.25
4 with probability 0.75
X
=
2
2 with probability 0.50
4 with probability 0.25
5 with probability 0.25
X
1
1
[ ] 3.25
[ ] 1.6875
E X
Var X
=
=
2
2
[ ] 3.25
[ ] 1.6875
E X
Var X
=
=
Stochastic Dominance
Richard R. Lindsey 96
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 1 2 3 4 5 6
X1
X2
Stochastic Dominance
Cannot have FSD because the cumulative distribution
functions cross.
No SSD because both distribution functions are admissible.
Definition: A distribution is admissible or efficient with
respect to a set of distribution functions, S, if it is not
dominated by a member of S.
Richard R. Lindsey 97
Stochastic Dominance
Richard R. Lindsey 98
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 1 2 3 4 5 6
X1
X2
-0.3
-0.2
-0.1
0
0.1
0.2
0.3
0 1 2 3 4 5 6
g(t)
Stochastic Dominance
X
2
TSD X
1
so we would choose X
2
.
Note that this choice reflects a preference for skewness.
If you must take a risky gamble, do you prefer to take it
when wealth is high or low?
Richard R. Lindsey 99
Riskiness of Distributions
This is a partial ordering of distributions.
Definition: Distribution Y is more risky than distribution X
if:
1. Y=X+Z where E[Z|X]=0 and non-degenerate.
2. Y is obtained from X by the addition of a mean
preserving spread.
3. X is preferred to Y by all risk averters providing
E[X]=E[Y].
4. Var[Y] > Var[X] provided E[X]=E[Y].
Richard R. Lindsey 100
Riskiness of Distributions
Theorem: The partial orderings given by 1, 2, and 3 are
equivalent.
Theorem: The partial orderings given by 1, 2, 3, and 4 are
equivalent for normal distributions. (Reason: normals are
stable under addition if variances are finite.)
Richard R. Lindsey 101
Bibliography
Huang, Chi-fu, and Robert Litzenberger, Foundations for
Financial Economics, North-Holland.
Levy, Haim, Stochastic Dominance: Investment Decision
Making under Uncertainty, Springer.
Ohlson, James, The Theory of Financial Markets and
Information, North-Holland.
Rothschild, M. and J. E. Stiglitz (1970). Increasing Risk: I.
A Definition. Journal of Economic Theory 2: 225-43.
Richard R. Lindsey 102
Optimization: Definitions
Richard R. Lindsey 104
Our optimization problems will take the form:
Where f is a function, x is an n-vector and S is a set of n-
vectors. We call f the objective function, x the choice
variable or control variable, and S the constraint set or
opportunity set.
max ( ) subject to
x
f x x S
Optimization: Definitions
Richard R. Lindsey 105
Definition: The value x* of the variable x solves the problem
if
In this case, we say that x* is a maximizer of the function f
subject to the constraint x an element of S, and that f(x*) is
the maximum (or maximum value) of the function f
subject to the constraint.
max ( ) subject to
x
f x x S
*
( ) ( ) f x f x x S
Optimization: Definitions
Richard R. Lindsey 106
A minimizer is defined analogously.
x
1
is a local maximizer x
2
is a minimizer
x
3
is a maximizer x
4
is a ?
x
5
is a ?
Optimization: Definitions
Richard R. Lindsey 107
Note that we can transform the objective function f with any
strictly increasing function g. In other words:
Is identical to the set of solutions to the problem:
This fact is sometimes useful since it may be easier to work
with a transform of the objective function rather than the
original function.
max ( ) subject to
x
f x x S
max ( ( )) subject to
x
g f x x S
Optimization: Definitions
Richard R. Lindsey 108
Minimization problems are just the maximization of the
negative of the objective function
Has the same set of solutions as
max ( ) subject to
x
f x x S
min ( ) subject to
x
f x x S
Optimization: Definitions
Richard R. Lindsey 109
Note that a continuous function on a compact set (closed
and bounded) attains both a minimum and a maximum on
that set (this is the Extreme Value Theorem). This is a
sufficient condition for a maximum (and a minimum) to
exist.
2 2 2 2 2
2 2
4 ( 8 4 ) 2 (8 2 (2 4 )) 8(2 (2 ) (4 ))
8(6 (4 ))
x x xy y x y y x y y x y
y x y
+ = + +
= +
Envelope Theorem
Richard R. Lindsey 145
Often we are interested in how the maximal value of a
function depends on its parameters.
Consider the unconstrained maximization problem:
Assume that for any a the problem has a unique solution;
denote this solution x*(a). Denote the maximum value
of f , for any given value of a, by M*(a): M *(a)
= f (x*(a), a). We call M* the value function.
max ( ( ), )
x
f x a a
Envelope Theorem
Richard R. Lindsey 146
Taking the derivative of Musing the chain rule
The first term is the indirect effect of how changing a affects the optimal
choice of x and how that change in x affects the value of f. The second term
is the direct effect of how changing a changes f holding x fixed at x(a). This
expression can be simplified by noticing that since x*(a) is the optimal
choice for x at each value of a,
* * *
( ) ( , ) ( ) ( ( ), ) dM a f x a dx a f x a a
da x da a
c c
= +
c c
*
( , )
0
f x a
x
c
=
c
Envelope Theorem
Richard R. Lindsey 147
This means
Or the change in the objective function adjusting optimally
is equal to the change in the objective function when one
doesnt adjust x.
In other words, the total derivative of f(x(a),a) with respect
to a is equal to the partial derivative of f(x(a),a) with
respect to a, evaluated at the optimal choice of x.
This is known as the Envelope Theorem.
* *
( ) ( ( ), ) dM a f x a a
da a
c
=
c
Envelope Theorem
Richard R. Lindsey 148
Note that to compute the effect of changing a on x(a), we
differentiate the FOC
*
( , )
0
f x a
x
a
(
c
c
(
c
=
c
2 * 2 *
2
( , ) ( ) ( ( ), )
0
f x a dx a f x a a
da x a
x
c c
+ =
c c
c
Envelope Theorem
Richard R. Lindsey 149
The sign of the denominator is negative by the SOC,
therefore the sign of the expression is determined by the
sign of the mixed partial in the numerator.
2 *
2 *
2
( ( ), )
( )
( , )
f x a a
dx a
x a
da
f x a
x
c
c c
=
c
c
Envelope Theorem
Richard R. Lindsey 150
Now consider
Then the Lagrangian is
The envelope theorem states
Again, we only have to take into account the change in y,
not the associated change in x.
,
max ( , ) subject to g( , ) 0
x y
f x y x y
( ,y) ( , ) ( , ) x f x y g x y L
* * *
*
( (y),y) ( ( ), ) ( ( ), ) x f x y y g x y y
y y y
c c c
=
c c c
L
Envelope Theorem
Richard R. Lindsey 151
Example: Consider a utility maximization problem: max
x
U(x) subject to p x = w. where x is a vector (a bundle of
goods), p is the price vector, and w is the consumer's
wealth (a real number). Denote the solution of the problem
by x*(p, w), and denote the value function by v, so that
The function v is known as the indirect utility function.
*
( , ) ( ( , )) for every ( , ) v p w U x p w p w =
Envelope Theorem
Richard R. Lindsey 152
By the envelope theorem
Thus
This result is known as Roy's identity.
* *
( , )
( , ) ( , )
i
i
i
v p w
p w x p w
p
c
=
c
*
( , )
( , )
v p w
p w
w
c
=
c
*
( , )
( , )
( , )
i
i
i
v p w
p
x p w
v p w
w
c
c
=
c
c
=
'
= +
''
+
+
=
''
= +
+
(
(
1 1
[ ]
N N
p i j ij
i j
Var r ww w w o
= =
'
= = E
c
' (
= =
c
L
( )
1 0 wi
c
'
= =
c
L
'
= E
1
C i i
'
= E
2
D BC A =
B A
M
A C
(
=
(
C A
D
p
E r
(
=
B A
D
p
E r
(
=
Mean-Variance Analysis: Basics
Richard R. Lindsey 171
And substituting into our expression for w
p
gives
Any frontier portfolio can be found this way since the
expected return was arbitrary and this equation is a
necessary and sufficient solution.
( ) ( )
1 1
C [ ] A B A [ ]
D D
p p
p
E r E r
w e i
= E + E
1 1 1 1
1 1
C A [ ] B A
D D
p p
w e E r e i i
( (
= E E + E E
h [ ] g
p p
w E r = +
| | ( )
| |
1 1
1
m m
i i i i
i i
m
i
i
w g hE r
g h E r
o o
= =
=
= +
= +
Bibliography
Richard R. Lindsey 178
Cornuejols and Ttnc, Optimization Methods in Finance,
Cambridge.
Huang and Litzenberger, Foundations for Financial Economics,
North-Holland.
Intriligator, Mathematical Optimization and Economic Theory,
Prentice-Hall.
Marsden and Tromba, Vector Calculus, Freeman.
Varian, Microeconomic Analysis, Norton.
Mean-Variance Analysis: Risk Free Rate
Richard R. Lindsey 187
Everything we have done so far did not have a riskless asset.
Now consider N+1 assets with equal to the portfolio
weights on risky assets is the solution to
p
w
p
w
{ }
1
min
2
s.t. (1 )
w
f p
w w
w e w r E r i
'
E
' ' (
+ =
Mean-Variance Analysis: Risk Free Rate
Richard R. Lindsey 188
Which has the solution
( )
1
2
B 2A C
p f
p f
f f
E r r
w e r
r r
i
(
= E
+
( )
2
2
2
( )
B 2A C
p f
p
f f
E r r
r
r r
o
(
=
+
Mean-Variance Analysis: Risk Free Rate
Richard R. Lindsey 189
There are three cases.
1. A/C>r
f
( )
q f qp p f
E r r E r r | ( ( = +
( )
1
q qp f qp p q
r r r | | c = + +
( )
, 0
p q q
Cov r E c c ( = =
Mean-Variance Analysis
Richard R. Lindsey 193
Lets return to our minimization problem:
There are alternative ways to pose this problem; for
example, we could rewrite the constraints as:
{ }
1
min
2
s.t. and 1
w
p
w w
w e E r wi
'
E
' ' (
= =
Aw b
Mean-Variance Analysis
Richard R. Lindsey 194
Where
Note: If we wanted to include a riskless asset, we could also have N+1 assets
with one of the assets return equal to the risk-free rate.
1 2
1 1 1
N
A
e e e
1
[ ]
p
b
E r
Mean-Variance Analysis
Richard R. Lindsey 195
Forming the Lagrangian
With FOC
( )
1
2
w w b Aw
'
= E L
0 w A
w
c
'
= E + =
c
L
0 A w b
c
'
= =
c
L
Mean-Variance Analysis
Richard R. Lindsey 196
Solving now, from the first FOC
Substituting into the second FOC and solving for the
optimal weights gives
1
w A
1 1 1
( ) w A A A b
Mean-Variance Analysis
Richard R. Lindsey 197
Example: Assume that we have three stocks with the
following characteristics (what do you expect?)
1
2
3
0.100162
0.164244
0.182082
e
e e
e
11 12 13
21 22 23
31 32 33
0.100162 0.045864 0.005712
0.210773 0.028283
0.066884
Mean-Variance Analysis
Richard R. Lindsey 198
And that we want a 15% return on the portfolio (is this
feasible?). The constraints can be written
1 1 1
0.100162 0.164244 0.182082
A
1
0.15
b
Mean-Variance Analysis
Richard R. Lindsey 199
Now we can use the solution to find the optimal weights
1 1 1
( ) w A A A b
0.3830
0.0397
0.5773
w
Mean-Variance Analysis
Richard R. Lindsey 200
Do you see any problems or issues associated with the
solution to our portfolio problem?
Mean-Variance Analysis
Richard R. Lindsey 201
Do you see any problems or issues associated with the
solution to our portfolio problem?
There may be other constraints which must be imposed:
Diversification constraints
max or min
Short-sale constraints
Borrowing constraints
Leverage constraints
Tracking error constraints
Etc.
Mean-Variance Analysis
Richard R. Lindsey 202
For example, the Investment Company Act of 1940
Rule 12-d3 imposes certain investment constraints on
mutual funds:
Mutual funds cannot own more than 5% of other investment
companies (firms which derive more than 15% of revenue from
securities related activity)
If a mutual fund advertises as a diversified fund, it cannot hold
more than 5% of its assets in any company or hold more than
10% of the voting stock for any company for 75% of the fund
Mean-Variance Analysis
Richard R. Lindsey 203
This means that we may want to (or need to) place
additional constraints on our optimization. Further, these
constraints may be inequality constraints (for example a
short-sale constraint would be expressed as w
i
0 for
all i.).
So, lets revisit optimization this time with inequality
constraints.
max ( ) subject to ( ) 0
x
f x g x
max ( ) subject to ( ) 0 and ( ) 0
x
f x g x g x
Optimization with Inequalities
Richard R. Lindsey 206
To start thinking about how to solve the general problem,
first consider the case with a single constraint
There are two possible solutions for this problem, one where
the constraint is binding and the other is where the
constraint does not bind. In the latter case, where the
constraint is not binding for small changes in the
constraint, we say that the constraint is slack.
max ( ) subject to ( )
x
f x g x c
Optimization with Inequalities
Richard R. Lindsey 207
( ) ( ) ( ( ) ) x f x g x c L
Optimization with Inequalities
Richard R. Lindsey 209
In the first case (that is, if g(x*) = c) we have 0.
Suppose, to the contrary, that < 0. Then we know that a
small decrease in c raises the maximal value of f . That is,
moving x* inside the constraint raises the value of f ,
contradicting the fact that x* is the solution of the
problem.
In the second case, the value of does not enter the
conditions, so we can choose any value for it. Given the
interpretation of , setting = 0 makes sense. Under this
assumption we have f
i
'(x) = L'
i
(x) for all x, so that
L'
i
(x*) = 0 for all i.
*
( )
0 for 1, ,
j
x
j n
x
L
* *
0, ( ) , and either 0 or ( ) 0 g x c g x c
Optimization with Inequalities
Richard R. Lindsey 211
Alternatively, since the product of two numbers is zero if at
least one of them is zero, we can write
Note that we have not ruled out the possibility that both = 0 and g(x*) = c.
The inequalities 0 and g(x*) c are called
complementary slackness conditions; at most one of these
conditions is slack (i.e. not an equality).
*
( )
0 for 1, ,
j
x
j n
x
L
* *
0, ( ) , and ( ( ) ) 0 g x c g x c
Optimization with Inequalities
Richard R. Lindsey 212
For a problem with many constraints, we introduce a
multiplier for each constraint and obtain the Kuhn-Tucker
conditions. For the problem
The Kuhn-Tucker conditions are
1
( ) ( ) ( ( ) )
m
j j j
j
x f x g x c
=
=
L
Optimization with Inequalities
Richard R. Lindsey 214
Example: Consider the problem
The Lagrangian is
1 2
2 2
1 2
,
max ( 4) ( 4)
x x
x x
2 2
1 2 1 2 1 1 2 2 1 2
( , ) ( 4) ( 4) ( 4) ( 3 9) x x x x x x x x = + + L
1 2 1 2
subject to 4 and 3 9 x x x x + s + s
Optimization with Inequalities
Richard R. Lindsey 215
And the Kuhn-Tucker conditions are
1 1 2
2 1 2
1 2 1 1 1 2
1 2 2 2 1 2
2( 4) 0
2( 4) 0
4, 0, and ( 4) 0
3 9, 0, and ( 3 9) 0
x
x
x x x x
x x x x
=
=
+ s > + =
+ s > + =
Optimization with Inequalities
Richard R. Lindsey 216
We have seen that a solution x* of an optimization problem
with equality constraints is a stationary point of the
Lagrangean if the constraints satisfy a regularity condition
(g(x*) 0 in the case of a single constraint g(x) = c)). In
an optimization problem with inequality constraints a
related regularity condition guarantees that a solution
satisfies the Kuhn-Tucker conditions. The weakest forms
of this regularity condition are difficult to verify. The next
result gives three alternative strong forms that are much
easier to verify.
1 2
2 2
1 2
,
max ( 4) ( 4)
x x
x x
1 2 1 2
subject to 4 and 3 9 x x x x + s + s
Optimization with Inequalities
Richard R. Lindsey 232
We previously found the Kuhn-Tucker conditions,
What are the solutions of these conditions? Start by looking
at the two conditions
1
(x
1
+ x
2
4) = 0 and
2
(x
1
+ 3x
2
9) = 0. These two conditions yield the
following four cases.
1 1 2
2 1 2
1 2 1 1 1 2
1 2 2 2 1 2
2( 4) 0
2( 4) 0
4, 0, and ( 4) 0
3 9, 0, and ( 3 9) 0
x
x
x x x x
x x x x
=
=
+ s > + =
+ s > + =
Optimization with Inequalities
Richard R. Lindsey 233
(1) x
1
+ x
2
= 4 and x
1
+ 3x
2
= 9:
In this case we have x
1
= 3/2 and x
2
= 5/2. Then the first two
equations are
5
1
2
= 0
3
1
3
2
= 0
which imply that
1
= 6 and
2
= 1, which violates the
condition
2
0. We can rule out this case.
1
0, x + y 6,
1
(x + y 6) = 0
2
0, x 0,
2
x = 0
3
0, y 0,
3
y = 0.
1 2 3
( , ) ( 6) x y xy x y x y = + + + L
Optimization Summary
Richard R. Lindsey 241
Conditions under which FOC are necessary and sufficient:
Unconstrained Maximization Problems
If x* solves max
x
f (x) then f '
i
(x*) = 0 for i = 1, ..., n.
If f '
i
(x*) = 0 for i = 1, ..., n and if f is concave then x*
solves max
x
f (x).
Optimization Summary
Richard R. Lindsey 242
Equality Constrained Maximization Problems (one constraint)
If x* solves max
x
f (x) subject to g(x) = c, and if
g(x*) (0,...,0), then there exists such that L'
i
(x*) = 0
for i = 1, ..., n and g(x*) = c.
If there exists such that L'
i
(x*) = 0 for i = 1, ..., n and
g(x*) = c and if f is concave and g is convex then x*
solves max
x
f (x) subject to g(x) = c.
Optimization Summary
Richard R. Lindsey 243
Inequality Constrained Maximization Problems
If x* solves max
x
f (x) subject to g
j
(x) c
j
for j = 1, ..., m
and if {g
j
is concave for j = 1, ..., m} or {g
j
is convex for
j = 1, ..., m and there exists x such that g
j
(x) < c
j
for
j = 1, ..., m} or {g
j
is quasi-convex for j = 1, ..., m,
g
j
(x*) (0,...,0) for j = 1, ..., m, and there exists x such
that g
j
(x) < c
j
for j = 1, ..., m} then there exists (
1
,...,
m
)
such that L'
i
(x*) = 0 for i = 1, ..., n and
j
0, g
j
(x*) c
j
,
and
j
(g
j
(x*) c
j
) = 0 for j = 1, ..., m.
Optimization Summary
Richard R. Lindsey 244
Inequality Constrained Maximization Problems
If there exists (
1
,...,
m
) such that L'
i
(x*) = 0 for i = 1, ..., n
and
j
0, g
j
(x*) c
j
, and
j
(g
j
(x*) c
j
) = 0 for j = 1, ..., m
and if g
j
is quasi-convex for j = 1, ..., m and either {f is
concave} or {f is quasi-concave and twice differentiable
and f (x*) (0,...,0) where L(x) = f (x)
j=1
m
j
(g
j
(x)
c
j
)} then x* solves max
x
f (x) subject to g
j
(x) c
j
for
j = 1, ..., m.
Bibliography
Richard R. Lindsey 245
Cornuejols and Ttnc, Optimization Methods in Finance,
Cambridge.
Huang and Litzenberger, Foundations for Financial Economics,
North-Holland.
Intriligator, Mathematical Optimization and Economic Theory,
Prentice-Hall.
Marsden and Tromba, Vector Calculus, Freeman.
Varian, Microeconomic Analysis, Norton.
(
= E
+
1
2
2
A
C
B 2A C
A
A C
C
B 2A C
0
p f
p
f f
p f
f f
E r r
w e
r r
E r r
r r
i i i
(
| |
' '
= E
|
+ \ .
(
| |
=
|
+ \ .
=
1
min
2
s.t.
w
w w
Aw b
'
E
s
M-V Analysis Inequalities: Example
Richard R. Lindsey 252
Where
1 1 1
0.100162 0.164244 0.182082
1 0 0
0 1 0
0 0 1
1 0 1
A
M-V Analysis Inequalities: Example
Richard R. Lindsey 253
And
Notice to express the constraint that w
2
0.10, we used w
1
+w
3
0.90. Sometimes
we need to reengineer our constraints to reach a solution.
1
0.15
0
0
0
0.90
b
(
(
(
(
=
(
(
(
(
M-V Analysis Inequalities: Example
Richard R. Lindsey 254
The solution is
(using quadprog this took 1 iteration)
0.3699
0.1000
0.5301
w
(
(
=
(
(
M-V Analysis
Richard R. Lindsey 255
Congratulations!
M-V Analysis
Richard R. Lindsey 256
Congratulations!
Now you know how to do everything in portfolio analysis
you just need to set up the appropriate problem.
M-V Analysis
Richard R. Lindsey 257
Congratulations!
Now you know how to do everything in portfolio analysis
you just need to set up the appropriate problem.
Lets consider a few alternatives
M-V Analysis: Diversification Constraint
Richard R. Lindsey 258
As discussed last time, there are sometimes regulatory
requirements for diversification. In addition, many portfolios
are required (by their managers/investors) to have minimum
and/or maximum investment limits in certain stocks, industries,
sectors, or asset classes. These types of problems can be
generally expressed:
Where the vectors w
l
and w
u
represent lower and upper bounds.
1
min
2
s.t.
and
w
l u
w w
Aw b
w w w
'
E
s
s s
M-V Analysis: Trading Volume
Richard R. Lindsey 259
A typical constraint is one on trading volume. This
constraint may be used for a large portfolio where you
want to avoid price impact or for any portfolio where you
want to control the liquidity risk of the portfolio.
Where x is a vector of ADV in dollar terms and c is a
constant for the threshold.
(e.g. $500 million portfolio; 10% of ADV (in millions) of stock i
w
i
(0.1/500)x
i
) Can you generalize this?
1
min
2
s.t.
and
w
w w
Aw b
w cx
'
E
s
s
M-V Analysis: Beta Exposure
Richard R. Lindsey 260
Sometimes it is desirable to match the beta of a benchmark
portfolio:
Where:
(note that this will not bound the tracking error or asset specific risk only the
factor risk)
benchmark
1
min
2
s.t.
and
w
w w
Aw b
w | |
'
E
s
'
=
1
N
M-V Analysis: Beta Exposure
Richard R. Lindsey 261
Or we can specify a range for the beta exposure:
lower limit upper limit
1
min
2
s.t.
and
w
w w
Aw b
w | | |
'
E
s
'
s s
M-V Analysis: Factor Exposure
Richard R. Lindsey 262
Or sometimes we are matching multiple factors:
Where:
(NB: tilting)
lower limit upper limit
1
min
2
s.t.
and B
w
w w
Aw b
w | |
'
E
s
s s
11 12 1
21 22 2
1 2
B
K
K
N N NK
M-V Analysis: Tracking Error
Richard R. Lindsey 263
Most professionals with a benchmark use a minimization of
tracking error when weighting stocks in the portfolio.
M-V Analysis: Tracking Error
Richard R. Lindsey 264
Most professionals with a benchmark use a minimization of
tracking error when weighting stocks in the portfolio.
Two methods:
1. Minimize the tracking error for a given expected excess
return over the benchmark.
2. Maximize the expected excess return over the benchmark
without exceeding a maximum tracking error constraint,
M-V Analysis: Tracking Error
Richard R. Lindsey 265
Tracking error is generally defined as the standard deviation
of the portfolio returns minus the benchmark returns:
Consider the components of the variance
The last term is beyond our control and the first term is what
we usually minimize.
benchmark
TE ( )
( )
p
p b
StdDev r r
Var r r
( ) ( ) 2 ( , ) ( )
p b p p b b
Var r r Var r Cov r r Var r
M-V Analysis: Tracking Error
Richard R. Lindsey 266
Define
And our problem becomes
1
( , )
( , )
b
N b
Cov r r
Cov r r
min 2
s.t.
and
w
p
w w w
Aw b
w
' '
E
s
'
=
M-V Analysis: Tracking Error (Factors)
Richard R. Lindsey 267
If we are dealing with multiple factors and want to minimize
tracking error, we note:
Where the vector f are the factors into which we have
decomposed returns and the residual terms for different
securities have covariance of zero.
( ) ( ) ( )
i i i i
Var r Var f Var
1 1 i i j j K K i
r f f f
M-V Analysis: Tracking Error (Factors)
Richard R. Lindsey 268
We can then write the variance-covariance matrix as
Or
1,1 1, 1 1 1,1 ,1
,1 , 1 1, ,
1
( ) ( , )
( , ) ( )
( ) 0
0 ( )
K K N
N N K K K K N K
N
Var f Cov f f
Cov f f Var f
Var
Var
B ( )B ( ) Var f Var
M-V Analysis: Tracking Error (Factors)
Richard R. Lindsey 269
B then represents the N by K matrix of factor exposures;
Var(f ) is a K by K matrix of factor premium variances and
Var() is an N by N diagonal matrix of error variances.
The squared tracking error is then
If we add any other relevant constraints, we can solve this
using our quadratic optimizer.
(note: we are now minimizing the tracking error)
2
TE ( ) B ( )B( ) ( ) ( )( )
p b p b p b p b
w w Var f w w w w Var w w c
' ' '
= +
M-V Analysis: Tracking Error (Tilting)
Richard R. Lindsey 270
When we actually have specific values or weights for our
factor exposure, we can tilt the portfolio to those weights
by applying a constraint
Where B is as defined earlier and d is the vector
representing the tilt. For example, if we have five factors:
market, size, growth, country, and sector and we wanted to
overweight size and growth, we could use
B( ) d
p b
w w
'
=
d (0 0.1 0.1 0 0)
M-V Analysis: Tracking Error (Tilting)
Richard R. Lindsey 271
The zeros in d make sure that the portfolios exposures to
the benchmark with respect to market, country and sector
are the same, and the values make sure that the exposure
to size and growth will by higher than the benchmark by
0.1.
With factor tilting, the optimization problem becomes
min( ) ( )( )
s.t. B ( ) d
and any other constraints
p
p b p b
w
p b
w w Var w w
w w
c
'
'
=
M-V Analysis: Tracking Error (Ghost)
Richard R. Lindsey 272
There may be cases in which you do not know what the
underlying securities in the benchmark are or their
weights. In this case, you would minimize the tracking
error with respect to the history of returns of the
benchmark. One possible approach is to minimize
Where
b
is the benchmarks factor exposure and
b
is the
benchmarks error term. Now that we have described the
tracking error, we continue as before.
2
B B ( ) 0
TE ( )
0 ( ) 1 1 1 1
p p p p
b b b
Var w w w w
Var f
Var
c
| | c
' ' '
| | | | | | | | | | | | | |
= +
| | | | | | |
\ . \ . \ . \ . \ . \ . \ .
M-V Analysis: Tracking Error (Risk-Adj)
Richard R. Lindsey 273
As indicated earlier, an alternative approach is have a
maximum tracking error constraint and maximize
expected return of the portfolio subject to that constraint.
We could write this as
And any other constraints. Alternatively, if we did not have
a target mean or tracking error, we could use a tracking
error risk aversion parameter A and write
2
max
s.t. ( )
w
p b x
w
Var r r
o
'
=
max ( )
p b
w
w AVar r r
'
Richard R. Lindsey 274
Note that these two formulations are related. The set of
maximum-return portfolios obtained as we vary the
tracking error constraint is identical to the set of optimal
portfolios obtained as we vary the tracking-error risk
aversion parameter. In other words, we can always choose
parameters so the two formulations are equivalent. This
property may be useful for solving the optimization
problem depending on how our optimizer wants the
problem to be set.
M-V Analysis: Tracking Error (Risk-Adj)
M-V Analysis
Richard R. Lindsey 275
Get the idea?
One we know how to solve the portfolio optimization
problem, everything else is just a wrinkle.
M-V Analysis
Richard R. Lindsey 276
Get the idea?
One we know how to solve the portfolio optimization
problem, everything else is just a wrinkle.
That doesnt mean that its easy what it means is that we
have to figure out how to pose the problem that we want
to solve in a manner in which we can solve it (with the
help of an optimizer).
M-V Analysis
Richard R. Lindsey 277
Get the idea?
One we know how to solve the portfolio optimization
problem, everything else is just a wrinkle.
That doesnt mean that its easy what it means is that we
have to figure out how to pose the problem that we want
to solve in a manner in which we can solve it (with the
help of an optimizer).
But, just for fun, lets see if there is anything else we can learn.
M-V Analysis Utility
Richard R. Lindsey 278
Notice that in the numerical example at the beginning of
class, we assumed that we wanted an expected return for
the portfolio of 15% and optimized to achieve that
objective. What makes this right?
2
1 1
max max max
2 2
p p
x x x
U w w w o
( (
' '
~ = E
( (
M-V Analysis Utility
Richard R. Lindsey 280
Where
The unconstrained optimum is found using the FOC
Under the normal regularity conditions.
1
0
dU
w
dw
= E =
* 1
w
= E
( , )
ij i
Cov R c R c o =
[ ]
i i
E R c =
M-V Analysis Utility
Richard R. Lindsey 281
Or with equality constraints
Forming the standard Lagrangian
1
max max subject to
2
w w
U w w w Aw b
(
' '
~ E =
(
1
( )
2
w w w Aw b
1
0 w A
w
c
'
= E =
c
L
0 Aw b
c
= =
c
L
* 1
( ) w A
'
= E
Aw b =
M-V Analysis Utility/2-Fund Separation
Richard R. Lindsey 283
Solving for the optimal weights
Notice that the optimal solution is split into a constrained
minimum-variance portfolio and a speculative portfolio.
This is known as two-fund separation. The first term does
not depend either on the expected returns or on the risk
tolerance it is the constrained minimum-variance
portfolio. The second term depends on the expected
returns and the investors risk tolerance.
* 1 1 1 1 1 1
( ) ( ( ) ) w A A A b A A A A
' ' ' '
= E E + E E E
M-V Analysis Efficiency of Solution
Richard R. Lindsey 284
A brief aside:
Note that constrained optimization reduces the efficiency of the
solution. A constrained solution must be less optimal than an
unconstrained solution (assuming that the constraint is
binding). The loss in efficiency can be measured as the
difference between a constrained and unconstrained solution.
But, not every difference between constrained and unconstrained
portfolios is statistically or economically significant. So we
might want to test whether there is a difference. One way to
test for significance is to use the Sharpe ratio (SR).
* *
* * *2 2
2 , ( 1)
( )( )( )
F
(1 )
k N k k
N k k k SR SR
SR
+ +
+
f
r r
SR
o
=
Asset-Liability Management
Richard R. Lindsey 286
Now consider the problem when we also have stochastic
liabilities. In this case, we focus on the difference between
assets and liabilities. This is known as surplus. The
change in surplus depends directly on the returns of the
asset portfolio (R
p
) as well as the liability returns (R
l
).
We will express surplus returns as a change in surplus
relative to assets
Surplus Assets Liabilities
p l
R R A =
Surplus Liabilities
Assets Assets
p l p l
R R R fR
A
= =
Asset-Liability Management
Richard R. Lindsey 287
Where f is the ratio of liabilities to assets. If we set f = 1 and
R
l
= c, we are back in the world without liabilities (or
where cash is our liability).
If we want to use the same optimizer, we need to transform
this problem into one of surplus i.e. we need to express
covariance in terms of surplus risk and expected returns in
terms of the relative return of assets verses liabilities.
S S
1
max subject to
2
w
w w w Aw b
(
' '
E =
(
Asset-Liability Management
Richard R. Lindsey 288
11 1 1
S
1
1
1 0 0 1 0 0
0 1 0 1
0 0 1 0 0 1
k l
k kk kl
l lk ll
f f
f f
f f
o o o
o o o
o o o
'
( ( (
( ( (
( ( (
E =
( ( (
( ( (
1
S
(1 )
l
k l
f
c f
f
(
(
= +
(
(
Asset-Liability Management
Richard R. Lindsey 289
Now our solution is
By varying the risk-tolerance parameter, we can trace out
the surplus-efficient frontier.
* 1 1
S S
1 1 1 1
S S S S S
( )
( ( ) )
w A A A b
A A A A
' '
= E E +
' '
E E E
Asset-Liability Management
Richard R. Lindsey 290
The unconstrained (asset-only) frontier and the surplus-
efficient frontier coincide if:
Liabilities are cash (or, equivalently, if assets have zero covariance
with liabilities)
All assets have the have the same covariance with liabilities
There exists a liability-mimicking asset and it lies on the efficient
frontier
The Investment Universe
Richard R. Lindsey 291
The choice of the investment universe has a significant
impact on the outcome of portfolio construction. If we
constrain ourselves to NYSE equities, it is likely that our
optimizer will produce a solution skewed toward smaller
cap stocks (why?). If we add Nasdaq equities and foreign
equities, this is likely to change as the variance-covariance
structure changes.
In general, to avoid the accumulation of estimation errors,
we would like to limit our portfolio optimization to groups
of assets with high intragroup and low intergroup
correlations.
The Investment Universe
Richard R. Lindsey 292
In the two asset case, our unconstrained optimization
produces
* 1
w
= E
* 1 1
1 11 12 1 *
* 1 1
2
2 21 22
w
w
w
( (
E E
(
= =
( (
(
E E
( (
*
1
1 22
11
1 11 22 12 21
2 2
11 11 11
1 1
(1 )
dw
d
o
o o o o
o o o
= E =
= =
The Investment Universe
Richard R. Lindsey 293
As the correlation between the two assets approaches 1, the
portfolio weights will react very sensitively to changes in
means (or expected return estimates). As assets become
more similar, any expected return becomes increasingly
important for the allocation decision. Portfolio
optimization with highly correlated assets will almost
certainly lead to extreme and undiversified results.
In the next homework set, I have you explore a method of reducing this
problem using cluster analysis.
Risk Decomposition
Richard R. Lindsey 294
It is often useful to understand the sources of risk in and
how those risks are spread through our portfolio. To get at
this, we can decompose risk in the following way.
Consider the standard deviation of portfolio returns
The first question we would like to address is how does
portfolio risk as we change the holdings of a particular
asset?
1/2
1/2 2
( )
p i ii i j ij
i i j i
w w w ww o o o
=
(
'
= E = +
(
(
Risk Decomposition
Richard R. Lindsey 295
What we need is the marginal contribution to risk MCTR
which can be easily calculated
Where the ith element in the k by 1 vector is
1
MCTR
p
k
p
d
w
dw
i ii j ij
p j i ip
i p
i p p
w w
d
dw
Risk Decomposition
Richard R. Lindsey 296
Note that if we add the weighted MCTRs of all securities in
the portfolio, we get the volatility of the portfolio
as we would expect. If we divide this expression by the
volatility of the portfolio, we get
p ip
i i p
i p
i i
d
w w
dw
2
1
p ip
i
i i i
p i
p i i i
d
w
w w
dw
Risk Decomposition
Richard R. Lindsey 297
Which shows that the percentage contributions to risk
(PCTR), which add up to 100%, are equal to the weighted
betas. This can be written as a vector
Where W is a k by k diagonal matrix with portfolio weights
on the diagonal. Each element of the vector PCTR is
given by
1
W
PCTR
p
k
p
d
dw
PCTR
p
i
i i i
p i
d
w
w
dw
Bibliography
Richard R. Lindsey 298
Huang and Litzenberger, Foundations for Financial
Economics, North-Holland.
Intriligator, Mathematical Optimization and Economic
Theory, Prentice-Hall.
0
+
1
+
2
+
3
Convertible 0.55 (7.66) 0.09 0.22 0.25
Distressed 0.52 (6.86) 0.18 0.44 0.49
Event-Driven 0.28 (3.56) 0.29 0.38 0.38
Macro 0.18 (2.10) 0.29 0.37 0.52
The betas from ordinary regressions appear to
underestimate the true market exposure and therefore
overstate the diversifying effects associated with the
hedge funds.
| |
|
E =
|
|
\ .
1 2 3
( , , ) (2.0,1.29, 0.3) e e e =
| |
|
E =
|
|
\ .
1 2 3
( , , ) (1.77,1.22, 0) e e e =
= E
i ij j i
j i
r a r | c
=
= + +
2
i
R
(
(
(
(
(
E =
(
(
(
(
(
o
=
(
(
=
(
o
=
(
(
=
(
Estimation Error
Richard R. Lindsey 335
We should be clear that everything that we have done so far
is predicated on a couple of things:
1. We are using expected returns in other words,
forecasted returns for our assets.
2. We are using an expected variance-covariance structure
in other words, forecasted for our universe of assets.
3. If the future deviates from our forecasts by a significant
amount, we will not have an optimal portfolio. (This is an
issue of performance measurement)
Estimation Error
Richard R. Lindsey 336
As I have said, generally you will want to forecast the mean
in some manner (if we have time we will talk more about
this later in the course). Your forecast could be a simple
forecast (like last periods return or the sample mean) or it
could be more complex (Delphi method; time series
forecast; multi-factor forecast).
Estimation Error
Richard R. Lindsey 337
For the variance-covariance structure, one typically uses
simple approaches like the estimated structure based upon
the sample history, a 250 day moving average, or an
exponentially weighted average. You can add complexity
to this by embedding Arch-Garch processes or other
generalizations, but remember that if you are not using a
factor decomposition (and thereby reducing the space),
you are now attempting to forecast a large number of
variables for a problem of any size.
2
2
n
n
Estimation Error
Richard R. Lindsey 338
To review what I discussed last time, assume that we have
an estimated mean of 10% and an estimated volatility of
20%.
Estimation error for the mean is given by
And the confidence interval is calculated as
T
, z z
T T
Estimation Error
Richard R. Lindsey 339
For the variance, Campbell, Lo and MacKinlay have shown
We can see from these expressions that the estimation error
for the mean is effected by the length of the time series T
and the estimation error for the variance is effected both
by the length and by the frequency of sampling (t).
We also see this in the following tables:
1
2 2
( ) 1 2
T
Var
t
Estimation Error
Richard R. Lindsey 340
Estimation Period (yrs) Estimation Error % 95% Confidence Interval %
1 20 78
5 9 35
10 6 25
20 4 18
50 3 11
Effect of Sample Period on Estimation Error for Mean Returns
Estimation Error
Richard R. Lindsey 341
Effect of Sample Period on Estimation Error (%) for Variance
Estimation Estimation Frequency
Period yrs Daily Weekly Monthly Quarterly
1 0.35 0.79 1.71 3.27
5 0.16 0.35 0.74 1.30
10 0.11 0.25 0.52 0.91
20 0.08 0.18 0.37 0.64
50 0.05 0.11 0.23 0.40
Bibliography
Richard R. Lindsey 361
Blundell and Ward, Property Portfolio Allocation: A Multifactor
Model, Land Development Studies, 1987.
Chan and Hussey, Marginal Contribution to the Sharpe Ratio,
Northwater Capital Management Inc., January 2009.
Chow, Jacquier, Kritzman, and Lowry, Optimal Portfolios in
Good Times and Bad, Financial Analysts Journal, 1999.
Scholes and Williams, Estimating Beta from Nonsynchronous
Data, Journal of Financial Economics, 1977.
Stevens, On the Inverse of the Covariance Matrix in Portfolio
Analysis, Journal of Finance, 1998.
Bibliography
Richard R. Lindsey 362
Campbell, Lo, and MacKinlay, The Econometrics of
Financial Markets, Princeton University Press, 1997.
Jorion, Mean Variance Analysis of Currency Overlays,
Financial Analysts Journal, 1994.
Risk Revisited
Richard R. Lindsey 386
So far we have often relied on an assumption (or
presumption) of normal returns. But we know that asset
returns are not normal and, therefore, the mean and
variance do not fully describe the characteristics of the
joint asset return distribution. Specifically, the risk and
the undesirable outcomes associated with the portfolio
cannot be adequately captured by the variance.
Lets spend a bit of time looking at alternative portfolio risk
measures that are sometimes used in practice.
Risk Revisited
Richard R. Lindsey 387
Generally speaking, there are two different types of risk
measures:
1. Dispersion Measures: consider both positive and
negative deviations from the mean, and treat those
deviations as equally risky.
2. Downside Measures: maximize the probability that the
portfolio return is above a certain minimal acceptable
level known as the benchmark or disaster level.
Dispersion: Standard Deviation
Richard R. Lindsey 388
Of course, the best known and most used dispersion
measure is (for historical reasons) the foundation of
modern portfolio theory standard deviation
1/2
1/2 2
( )
p i ii i j ij
i i j i
w w w ww o o o
=
(
'
= E = +
(
(
Dispersion: Mean-Absolute Deviation
Richard R. Lindsey 389
The mean-absolute deviation or MAD approach doesnt use
squared deviations, but absolute deviations
Where
And r
i
is the return on the asset and
i
is the expected return
on the asset.
( )
p i i i i
i i
MAD r E wr w
(
=
(
p i i
i
r wr =
c
>
(
= + (
(
}
( , ) F w
c
(1 )
VaR ( ) w
( , ) F w
c
( , ) F w
c
(1 )
CVaR ( ) w
Downside: Conditional Value at Risk
Richard R. Lindsey 413
So we can find the optimal value of by
solving the optimization problem
If we denote as the solution to this optimization
problem, then is the optimal CVaR.
The optimal portfolio is given by and the corresponding
VaR is given by .
In other words, we can compute the optimal CVaR without first calculating
VaR.
(1 )
CVaR ( ) w
,
min ( , )
w
F w
c
* *
( , ) w
* *
( , ) F w
c
*
w
*
c
=
(
+
`
(
)
c
=
(
+
`
(
)
0, 1, ,
i
z i T > =
( , ) , 1, ,
i i
z f w y i T > =
max(( ( , ) ), 0)
i
f w y
Downside: Conditional Value at Risk
Richard R. Lindsey 417
Under the assumption that f(w,y) is linear in w, the above
optimization is linear and can be solved using standard
linear programming techniques.
Downside: Conditional Value at Risk
Richard R. Lindsey 418
This representation of CVaR can also be used to construct
other portfolio optimization problems. For example, the
mean-CVaR optimization problem
Subject to
Along with other constraints on w written as
max
w
w
'
(1 ) 0
CVaR ( ) w c
c
s
w
w C e
Downside: Conditional Value at Risk
Richard R. Lindsey 419
Results in the following
Subject to
max
w
w
'
0
1
1
T
i
i
z c
T
c
=
(
+ s
(
0, 1, ,
i
z i T > =
( , ) , 1, ,
i i
z f w y i T > =
w
w C e
Downside: Conditional Value at Risk
Richard R. Lindsey 420
Palmquist, Uryasev, and Krokhmal provide us with an
example of the mean-CVaR approach.
They considered two-week returns for all of the stocks in the
S&P 100 from July 1, 1997 to July 8, 1999 for scenario
generation. Optimal portfolios were constructed solving
the mean-CVaR optimization approach for a two-week
horizon at different levels of confidence.
Downside: Conditional Value at Risk
Richard R. Lindsey 421
Note risk is the percent of the portfolio allowed to be put at risk.
Downside: Conditional Value at Risk
Richard R. Lindsey 422
It can be shown that for a normally distributed loss function,
the mean-variance and mean-CVaR frameworks generate
the same efficient frontier. However, when distributions
are non-normal, these two approaches can be significantly
different.
M-V optimization relies on deviations on both sides of the
mean, while M-CVaR relies only on the part of the
distribution which contributes to high losses.
Downside: Conditional Value at Risk
Richard R. Lindsey 423
Bibliography
Richard R. Lindsey 424
Artzner, Delbaen, Eber, and Heath, Coherent Measures of Risk,
Mathematical Finance, 1999.
Grootveld and Hallerbach, Variance Verses Downside Risk: Is
There Really That Much Difference?, European Journal of
Operational Research, 1999.
Krokhmal, Palmquist, and Uryasev, Portfolio Optimization with
Conditional Value-At-Risk Objective and Constraints, Journal
of Risk, 2002.
Markowitz, Portfolio Selection, Journal of Finance, 1952.
Rockafellar and Uryasev, Optimization of Conditional Value-At-
Risk, Journal of Risk, 2000.
Roy, Safety-First and the Holding of Assets, Econometrica,
1952.
Uryasev, Conditional Value-At-Risk: Optimization Algorithms
and Applications, Financial Engineering News, 2000.
Asset Allocation
Allocation between asset classes accounts for the major
portion of risk and return in a portfolio
Selection of specific instruments is a decision with smaller
influence on portfolio performance
Asset Allocation should consider all financial aspects
Current and future wealth, income, and financial needs
Financial goals
Taxes and tax advantaged investments
Liquidity (for unexpected needs)
Investors (all types) need customized strategies
426 Richard R. Lindsey
k
T
x
1
, 1, ,
k
T
W k K
=
Practical Utility
Richard R. Lindsey 451
Represent utility as a piecewise exponential function with K
pieces represents a certain absolute risk aversion
i
where
i = 1,, K
Let be discrete wealth levels representing
the borders of each piece i, such that below the risk
aversion is
i
and above (until ) the risk aversion is
i+1
for all i = 1,, K.
For each piece i represent utility by an exponential function
, 1, ,
i
W i K
i
W
i
W
1
i
W
i i
W
i i i i
U W a b e
Practical Utility
Richard R. Lindsey 452
With a first derivative with respect to wealth
The
i
are chosen to represent the desired function of risk
aversion verses wealth.
The coefficients of the exponential functions for each piece i
are found by matching both the function values and the
first derivatives at the intersections . In other words, we
fit an spline function.
i i
i i W
i i
i
U W
b e
W
i
W
Practical Utility
Richard R. Lindsey 453
Thus at each wealth level , representing the border
between risk aversion
i
and
i+1
, we have the following
two equations
From which we calculate the coefficients (setting a
1
= 0 and
b
1
= 1)
1
1 1
i i i i
W W
i i i i
a b e a b e
1
1 1
i i i i
W W
i i i i
b e b e
1
( )
1
1
i i i
i W
i i
i
b b e
1
1
1
i i
i W
i i i
i
a a b e
Practical Utility
Richard R. Lindsey 454
Example 1
Richard R. Lindsey 455
Current wealth $100,000
Cash contributions (savings) of $15,000 per year
20 year investment horizon
US Stocks, International Stocks, Corporate Bonds,
Government Bonds, and Cash
Example 1
Richard R. Lindsey 456
US Stocks Int Stocks Corp Bonds Gvt Bonds Cash
Mean 10.80 10.37 9.49 7.90 5.61
Std 15.72 16.75 6.57 4.89 0.70
Example 1
Richard R. Lindsey 457
Four utility functions
A: exponential, absolute risk aversion = 2
B: Increasing relative risk aversion and decreasing absolute risk
aversion
2.0 @ W of $0.25M and below, increasing to 3.5 @ Wof $3.5 and above
C: Decreasing relative risk aversion and decreasing absolute
risk aversion
8.0 @ W of $1.0M and below, decreasing to 1.01 @ Wof $1.5M and above
D: Quadratic (downside)
Quadratic with linear penalty of 1000 for underperforming $1.0M
Recall from Lecture 2
Richard R. Lindsey 458
Example 1
Richard R. Lindsey 459
Utility CEW Mean Std 99% 95%
Exponential 1.412 1.564 0.424 0.770 0.943
Increasing RRA 1.440 1.575 0.452 0.771 0.937
Decreasing RRA 1.339 1.498 0.436 0.865 0.998
Quadratic 0.982 1.339 0.347 0.911 1.006
Example 1
Richard R. Lindsey 460
Exponential Increasing RRA
Quadratic
Decreasing RRA
Example 1
Richard R. Lindsey 461
57.4
16.9
25.7
0
0
Exponential
US Stock
Int Stock
Corp Bonds
Gvmt Bonds
Cash
34
13.7
52.3
0
0
Increasing RRA
US Stock
Int Stock
Corp Bonds
Gvmt Bonds
Cash
10.6
10
67.2
12.2
0
Decreasing RRA
US Stock
Int Stock
Corp Bonds
Gvmt Bonds
Cash
53.2
16.4
30.4
0 0
Quadratic
US Stock
Int Stock
Corp Bonds
Gvmt Bonds
Cash
Example 1
Richard R. Lindsey 462
Exponential
Example 1
Richard R. Lindsey 463
Exponential
Example 1
Richard R. Lindsey 464
Exponential
Example 1
Richard R. Lindsey 465
Exponential: 1 to go
Example 1
Richard R. Lindsey 466
Exponential: 10 to go
Example 1
Richard R. Lindsey 467
Exponential: 19 to go
Example 1
Richard R. Lindsey 468
Increasing RRA
Example 1
Richard R. Lindsey 469
Increasing RRA
Example 1
Richard R. Lindsey 470
Increasing RRA
Example 1
Richard R. Lindsey 471
Increasing RRA: 1 to go
Example 1
Richard R. Lindsey 472
Increasing RRA: 10 to go
Example 1
Richard R. Lindsey 473
Increasing RRA: 19 to go
Example 1
Richard R. Lindsey 474
Decreasing RRA
Example 1
Richard R. Lindsey 475
Decreasing RRA
Example 1
Richard R. Lindsey 476
Decreasing RRA
Example 1
Richard R. Lindsey 477
Decreasing RRA: 1 to go
Example 1
Richard R. Lindsey 478
Decreasing RRA: 10 to go
Example 1
Richard R. Lindsey 479
Decreasing RRA: 19 to go
Example 1
Richard R. Lindsey 480
Quadratic
Example 1
Richard R. Lindsey 481
Quadratic
Example 1
Richard R. Lindsey 482
Quadratic
Example 1
Richard R. Lindsey 483
Quadratic: 1 to go
Example 1
Richard R. Lindsey 484
Quadratic: 10 to go
Example 1
Richard R. Lindsey 485
Quadratic: 19 to go
Example 2
Richard R. Lindsey 486
Now compare these dynamic strategies with six fixed-mix
strategies.
US stocks only
Cash only
All asset classes equally weighted
Risk averse (conservative)
Medium risk (dynamic)
Risk prone (aggressive)
With the exception of equally weighted asset classes, all
strategies are the solution of the single period Markowitz
optimization.
Example 2
Richard R. Lindsey 487
Example 2
Richard R. Lindsey 488
Strategy Mean Std 99% 95%
US stocks 1.825 1.065 0.469 0.660
Cash 0.868 0.019 0.822 0.834
Equally weighted 1.349 0.301 0.799 0.920
Risk Averse 1.098 0.110 0.869 0.930
Medium Risk 1.538 0.407 0.825 0.975
Risk Prone 1.663 0.639 0.677 0.852
Example 2 CEW Improvement
Richard R. Lindsey 489
Exponential Increasing
RRA
Decreasing
RRA
Quadratic
US stocks 9.61% 7.17% 96.12% 12.06%
Cash 62.79% 66.04% 56.08% 13.36%
Equally wtd 11.10% 12.30% 14.56% 2.03%
Risk averse 29.93% 32.42% 27.45% 1.03%
Medium risk 0.55% 0.76% 0.62% 1.19%
Risk Prone 1.63% 0.44% 23.72% 4.81%
Bibliography
Richard R. Lindsey 490
Hakansson, On Myopic Portfolio Policies, With and Without
Serial Correlation of Yields, Journal of Business, 1971.
Infanger, Dynamic Asset Allocation Strategies Using a
Stochastic Dynamic Programming Approach, in Handbook of
Asset and Liability Management, Volume 1, Zenios and
Ziemba eds., 2006.
Merton, Lifetime Portfolio Selection Under Uncertainty: the
Continuous-time Case, Review of Economics and Statistics,
1969.
Mossin, Optimal Multiperiod Portfolio Policies, Journal of
Business, 1968.
Samuelson, Lifetime Portfolio Selection by Dynamic Stochastic
Programming, Review of Economics and Statistics, 1969.
Characteristic Portfolios
Richard R. Lindsey 492
Consider a single period problem with no rebalancing within
the period with the underlying assumptions:
There is a riskless asset
All first and second moments exist
It is not possible to build a fully invested portfolio that has zero
risk
The expected excess return on the fully invested portfolio with
minimum risk is positive.
Characteristic Portfolios
Richard R. Lindsey 493
Define a vector of asset attributes or characteristics (these
could be betas, expected returns, earnings-to-price ratios,
capitalization, membership in a an economic sector, etc.)
The exposure of portfolio to the attribute is .
1
2
N
a
a
a
a
p
w a
p
w
Characteristic Portfolios
4/14/2009
Richard R. Lindsey 494
The characteristic portfolio uniquely captures the defining
attribute.
Characteristic portfolio machinery connects attributes and
portfolios and to identify a portfolios exposure to an
attribute in terms of its covariance with the characteristic
portfolio.
The process works both ways, we can start with a portfolio
and find the attribute that the portfolio expresses most
effectively.
Characteristic Portfolios
Richard R. Lindsey 495
Proposition 1
1. For any non-zero attribute there is a unique portfolio that
has minimum risk and unit exposure to the attribute.
The weights of the characteristic portfolio are:
Characteristic portfolios are not necessarily fully
invested; they can have long and short positions, and
may have significant leverage.
1
1
a
a
w
a a
Characteristic Portfolios
Richard R. Lindsey 496
2. The variance of the characteristic portfolio is given by:
3. The beta of all assets with respect to the characteristic
portfolio is equal to
a
w
2
1
1
a a a
w w
a a
a
w a
2
a
a
w
a
Characteristic Portfolios
Richard R. Lindsey 497
4. Consider two attributes and with characteristic
portfolios and Let and be, respectively, the
exposure of portfolio to characteristic and the
exposure of portfolio to characteristic . The
covariance of the characteristic portfolios satisfies
a
w
d
w
d
w
a
w
a d
d
a
a
d
a
d
2 2
, a d d a a d
a d
Characteristic Portfolios
Richard R. Lindsey 498
5. If is a positive scalar, then the characteristic portfolio
of is . Because characteristic portfolios have
unit exposure to the attribute, if we multiply the attribute
by we will need to divide the characteristic portfolio
by to preserve unit exposure.
a
a
w
Characteristic Portfolios
Richard R. Lindsey 499
6. If characteristic is a weighted combination of
characteristics and , then the characteristic portfolio
of is a weighted combination of the characteristic
portfolios of and ; in particular, if
then
where
d f
a d f
a
d f
a
d f
2
2
2 2
f a
d a
a d f
d f
w w w
2 2 2
1
f f
d d
a d f
a
a
Characteristic Portfolios
Richard R. Lindsey 500
Proof
The holdings of the characteristic portfolio can be
determined by solving for the portfolio with minimum risk
given the constraint that the exposure to characteristic
equals 1.
The first order conditions are
Where is the Lagrange multiplier.
a
min s.t. 1 w w w a
1
0
w a
w a
Characteristic Portfolios
Richard R. Lindsey 501
The results are
And
Which proves item 1. Item 2 can be verified using and
the definition of portfolio variance. Item 3 can be verified
using the definition of beta with respect to portfolio P as
1
1
a
a
w
a a
1
1
a a
a
w
2
P P
w
Characteristic Portfolios
Richard R. Lindsey 502
For item 4, note and
Items 5 and 6 are straightforward.
2
2
{ }
{ }
ad a d
a d
a d
d a
w w
w w
a w
a
2
2
{ }
{ }
ad a d
a d
a a
a d
w w
w w
w d
d
Characteristic Portfolios
Richard R. Lindsey 503
Example 1:
Suppose is the attribute. Every
portfolios exposure to measures the extent of its
investment if then the portfolio is fully invested.
Portfolio C, the characteristic portfolio for attribute , is
the minimum-risk fully invested portfolio:
1 1 1
1
P
w
Characteristic Portfolios
Richard R. Lindsey 504
Note every asset has a beta of 1 with this portfolio; and the
covariance of any fully invested portfolio with C is .
1
1
2
1
2
1
C
C C C
C
C
w
w w
w
2
C
Characteristic Portfolios
Richard R. Lindsey 505
Example 2
Suppose beta is the attribute, where beta is defined by some
benchmark portfolio B
Then the benchmark is the characteristic portfolio of beta
2
B
B
w
Characteristic Portfolios
Richard R. Lindsey 506
So the benchmark is the minimum-risk portfolio with a beta
of 1.
Note that the relationship between portfolios C and B is
1
1
2
1
1
B
B B B
w w
w w
2 2
BC B C C B
Characteristic Portfolios
Richard R. Lindsey 507
Proposition 2
Let q be the characteristic portfolio of the characteristic
(expected excess returns)
Then
a. The Sharpe ratio is
f
1
1
q
f
w
f f
1
1
2
max{ | }
q P
SR SR P f f
Characteristic Portfolios
Richard R. Lindsey 508
b.
c.
2
1
1
1
q q
q
f w f
f f
2
q
q
q
q
q
w
f
w
SR
Characteristic Portfolios
Richard R. Lindsey 509
d. If is the correlation between portfolios P and q, then
e. The fraction of q invested in risky assets is given by
Pq
P Pq q
SR SR
2
2
C q
q
C
f
Characteristic Portfolios
Richard R. Lindsey 510
Proof
For any portfolio , the Sharpe ratio is . For
any positive constant , the portfolio with holdings
will also have a Sharpe ratio equal to . Thus, to find
the maximum Sharpe ratio, we can set the expected excess
return to 1 and minimize risk. We can then minimize
subject to the constraint that . This is just the
problem we solved to get , the characteristic portfolio
of .
Items b and c are properties of the characteristic portfolio.
P
w
P P P
SR f
P
w
P
SR
q
w
f
B B
w w
1 w f
Characteristic Portfolios
Richard R. Lindsey 511
For d, we use c:
And e follows from Proposition 1, item 4.
P P
P
P P
q
P
q
P q
P q
q Pq q
P q
f w f
SR
w
w
SR
w w
SR SR
Characteristic Portfolios
Richard R. Lindsey 512
Proposition 3
Assume
1. Portfolio q is net long
Let portfolio Q be the characteristic portfolio of .
Portfolio Q is fully invested with holdings
In addition SR
Q
=SR
q
, and for any portfolio P with a
correlation with portfolio Q, we have
0
C
f >
q
f
0
q
i >
Q q q
w w i =
PQ
P PQ Q
SR SR =
Characteristic Portfolios
Richard R. Lindsey 513
2.
Note that this specifies exactly how Portfolio Q explains
expected returns.
3.
2 2
Q
C
C Q
f
f
o o
=
wrt
2
Q
Q Q Q
Q
w
f f f |
o
| |
E
= =
|
|
\ .
2
2
B Q
Q
Q B
f
f
o
|
o
=
Characteristic Portfolios
Richard R. Lindsey 514
4. If the benchmark is fully invested, , then
1
B
i =
C B
Q
C
f
f
|
| =
Characteristic Portfolios
Richard R. Lindsey 515
Portfolio A (characteristic portfolio for alpha)
Define alpha as . Let be the characteristic
portfolio for alpha, the minimum risk portfolio with alpha
of 100% (note that this portfolio will have significant
leverage). According to Proposition 1, item 6, we can
express in terms of and . From item 4, we see
that the relationship between alpha and beta is
However, by construction, so portfolios A and B are
uncorrelated and
B
f f
A
w
A
w
B
w
q
w
2 2
, B A B A A B
0
B
0
A
Characteristic Portfolio of Alpha
Richard R. Lindsey 516
Consider the characteristic portfolio for alpha where
Is the vector of forecasted expected residual returns, where
the residual is relative to the benchmark portfolio. Since
the alphas are forecasts of residual return, both the
benchmark and the riskless asset have alphas of zero.
The portfolio weights are
1 2 N
1
1
A
w
Characteristic Portfolio of Alpha
Richard R. Lindsey 517
Portfolio A has an alpha of 1, and it has minimum
risk among all portfolios with that property. The variance
of portfolio A is
In addition, we can define alpha in terms of Portfolio A
1
A
w
2
1
1
A A A
w w
2
A
A
w
Alpha
Richard R. Lindsey 518
Looking forward (ex ante), a is a forecast of residual return.
Looking backward (ex post), a is the average of the realized
residual returns.
The term alpha (just like beta) comes from the use of linear
regression
The residual returns from this regression are
Realized alphas are for keeping score the job of an active manager is to
score for that you need to forecast alpha
( ) ( ) ( )
P P P B P
r t r t t o | c = + +
( ) ( )
P P P
t t u o c = +
Alpha
Richard R. Lindsey 519
Looking into the future, alpha is a forecast of residual return
Note that by definition, the benchmark portfolio always has
a residual return of 0. Therefore the alpha of the
benchmark portfolio must also be 0.
Similarly, the residual returns for a riskless portfolio is also
0 and its alpha must be 0.
| |
n n
E o u =
Information Ratio
Richard R. Lindsey 520
While is the primary measure of a portfolios excess
return, another metric, the information ratio, is often used
by professionals.
The information ratio adjusts the for the portfolios
residual risk and is written:
P
is predicted alpha;
P
is the predicted standard deviation
of the residual.
Typically, we consider the ex-ante information ratio for making decisions and
the ex-post information ratio for performance evaluation.
P
P
IR
o
e
=
Information Ratio
Richard R. Lindsey 521
If
P
is 0, we set IR
P
equal to 0, and, in general, we define
the information ratio IR as the largest possible value of
IR
P
given alphas {
n
}
max |
p
IR IR o ( =
Information Ratio
Richard R. Lindsey 522
Now, returning to Portfolio A (the characteristic portfolio for
alpha), we note that it has several interesting properties
Proposition 4
1. Portfolio A has zero beta; therefore it typically has long
and short positions
2. Portfolio A has the maximum information ratio
0
A A
w | |
'
= =
1
for all
A P
IR IR IR P o o
'
= = E >
Information Ratio
Richard R. Lindsey 523
3. Portfolio A has total and residual risk equal the inverse of
IR.
4. Any portfolio P that can be written as
has IR
P
= IR.
1
A A
IR
e o = =
with 0
P P B P A P
w w w | o o = + >
Information Ratio
Richard R. Lindsey 524
5. Recall Portfolio Q the characteristic portfolio of ).
This portfolio is a mixture of the benchmark and portfolio
A:
With and
Therefore IR
Q
= IR. The information ratio of Portfolio Q
equals that of Portfolio A.
q
f
Q Q B Q A
w w w | o = +
2
2
B Q
Q
Q B
f
f
o
|
o
=
2
2
Q
Q
Q A
f
o
o
e
=
Information Ratio
Richard R. Lindsey 525
6. Total holdings in risky assets for Portfolio A are
7. Let be the residual return on any portfolio P. The
information ratio of portfolio P is
2
2
C A
A
C
o e
i
o
=
{ , }
P Q P Q
IR IR Corr u u =
P
u
Information Ratio
Richard R. Lindsey 526
8. The maximum information ratio is related to portfolio Qs
maximum Sharpe ratio
9. Alpha can be represented as
So alpha is directly related to the marginal contribution to
residual risk by the information ratio.
Q Q
Q Q
IR SR
o e
e o
| |
= =
|
\ .
MCRR
A
Q
A
w
IR IR o
e
| | E
= =
|
\ .
Information Ratio
Richard R. Lindsey 527
10. The Sharpe ratio of the benchmark is related to the
maximal information ratio and Sharpe ratio
2 2 2
B
SR SR IR =
Fundamental Law of Active
Management
Richard R. Lindsey 528
A portfolio manager applies quantitative analysis to market
data to find and exploit the opportunities for excess return
hidden in market inefficiencies.
Quantitative analysis opens up the possibility of statistical
arbitrage if the methods and models used combine all
available information efficiently.
This is illustrated within the framework of the fundamental
law of active management (Grinold 1989; Grinold & Kahn
1997).
Fundamental Law of Active
Management
Richard R. Lindsey 529
The fundamental law states that the information ratio (IR) is
the product of the information coefficient (IC) and the
square root of breadth (BR)
Breadth is defined as the number of independent forecasts of
exceptional return (think of breadth as the number of
independent factors for which you make forecasts).
The information coefficient is the correlation of each
forecast with the actual outcomes (here assumed to be the
same for all forecasts).
IR IC BR =
Fundamental Law of Active
Management
Richard R. Lindsey 530
This equation says that a higher information ratio can be
achieved by increasing the information coefficient or by
increasing the breadth.
IC can be increased by finding factors that are more
significant than those that are already in the model.
BR can be increased by finding more factors that are
uncorrelated (or relatively uncorrelated) with the existing
factors in the model.
Fundamental Law of Active
Management
Richard R. Lindsey 531
Generally, for quantitative portfolio management, we use a
model something like
The fundamental law basically assesses how well our model
explains stock-return process, and it expresses the
equations goodness of fit as the product of the number of
explanatory variables and each variables average
contribution.
1 1 2 2 it i i t i t iK Kt it
r f f f o | | | c = + + + + +
Fundamental Law of Active
Management
Richard R. Lindsey 532
While the fundamental law can be expressed in different ways,
there are certain general facts which always hold:
1. IR
2
approximately equals the goodness of fit (R
2
) of the
forecasting equations.
2. The breadth is the number of explanatory variables in the
forecasting equations.
3. IC
2
is the average contribution of each explanatory variable
in increasing R
2
4. When the benchmark is ignored and the risk-free rate is
subtracted from the portfolio returns, IR is essentially the
maximum Sharpe ratio one can achieve and the fundamental
law decomposes the maximum Sharpe ratio into the number
of explanatory variables and their average contribution.
Bibliography
Richard R. Lindsey 533
Chincarini and Kim, Quantitative Equity Portfolio
Management, 2006.
Grinold, The Fundamental Law of Active Management,
Journal of Portfolio Management, 1989.
Grinold and Kahn, Active Portfolio Management, 2000.