Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Cutting edge
Risk.net April 2016 Refining Markowitz
T
he concept of correlations between different assets is a cor- on volatilities, we standardise these returns as follows: (i) we remove
nerstone of Markowitzs optimal portfolio theory, especially the sample mean of each asset; (ii) we normalise each return by an
for risk management purposes (Markowitz 1968). In a nut- estimate O it of its daily volatility, rQit WD rit =O it . There are many pos-
shell, correlations measure the tendency of different assets sible choices for O it , based on, for example, Garch or Figarch models
to vary together, and it is well known that large losses at a portfolio historical returns, or simply implied volatilities from option markets,
level are indeed mostly due to correlated moves of its constituents (see, and the reader can choose his/her favourite estimator which can easily
for example, Bouchaud & Potters 2003). Efcient and robust diversi- be combined with the correlation matrix cleaning schemes discussed
cation is needed to alleviate such events. Markowitzs theory is the below. For simplicity, we have chosen here the cross-sectional daily
simplest and best known method for constructing a portfolio with a volatility, that is: sX
given level of risk, using the correlations between assets as inputs. O it D 2
rjt
However, the Markowitz solution results in over-allocation on low j
variance modes (ie, eigenvectors) of the correlation matrix. It is there-
to remove a substantial amount of non-stationarity in the volatilities.
fore of crucial importance to use a correlation matrix that faithfully
The nal standardised return matrix X D .Xit / 2 RN T is then
represents future, and not past, risks otherwise the over-allocation on given by Xit WD rQit =i , where i is the sample estimator of the
spurious low risk combinations of assets might prove disastrous (see, volatility of rQi , which is now, to a rst approximation, stationary.
for example, Bouchaud & Potters (2011) for a detailed discussion of The most common (and simplest) estimator of the true underlying
this point). correlation matrix (that we henceforth denote by C ) is to use the sample
The question of building reliable estimators of covariance or of estimator:
correlation matrices has a long history in nance, and more generally 1
E WD XX (1)
in multivariate statistical analysis. The problem comes from the high- T
dimensionality of these matrices, as in many applications. When the We will use the following notation throughout:
size of the time series T is very large, each of the coefcients of the N
X
covariance/correlation matrix can be estimated with negligible error (if ED k uk u (2)
k
is assumed not to vary with time). But if N is also large and of the order kD1
of T , as is often the case in nance, the large number of noisy variables
for the eigenvalues 1 > 2 > > N > 0 and the associated
creates important systematic errors in the computation of the inverse
eigenvectors u1 ; u2 ; : : : ; uN of E . When q D N=T ! 0, ie, when
of the matrix, which is a direct input of Markowitzs optimisation
the data set is very long, one expects that E ! C , whereas impor-
formula. These systematic errors lead to sub-optimal portfolios with
tant distortions survive when q D O.1/, even if T ! 1. In fact,
grossly underestimated out-of-sample risk (Bouchaud & Potters 2011).
since the work of Marcenko & Pastur (1967) decades ago, one can
The aim of this note is to provide the reader with a short review of
show that the spectrum of E is a broadened version of that of C ,
the different cleaning recipesthat have been proposed in the literature
with an explicit q-dependent formula relating the two. In particular,
to cope with the problem of in-sample noise in the determination of
small i s are too small and large i s are too large compared to the
correlation matrices. While some of these recipes are simple and well
true eigenvalues C . Thus, recalling that Markowitzs optimal strat-
known, recent progress has been made that allows one to derive an
egy overweight low variance modes (see, for example, Bouchaud &
optimal and fully observable estimator of the true underlying corre-
Potters 2011), we understand why using the sample estimator E can
lation matrix, valid in the limit of large matrices. We want to popularise
lead to disastrous results, and why some cleaning procedure should
these new results and test the corresponding estimators on real nan-
be applied to E before any application to portfolio construction. Note
cial data, with very satisfactory benets: both the realised risk and
that the Marcenko and Pastur results do not require multivariate nor-
risk-adjusted returns of synthetic portfolios are improved for a wide
mality of the returns, which can have fat-tailed distributions. In fact,
class of investment strategies. the above normalisation by the cross-sectional volatility can be seen
as a proxy for a robust estimator of the covariance matrix (see, for
Setting the stage example, Couillet, Kammoun & Pascal 2016).
We consider a universe made of N different nancial assets that we
observe at (say) the daily frequency, dening a vector of returns Five cleaning recipes
r t D .r1t ; r2t ; : : : ; rN t / for each day t D 1; : : : ; T . Because the We list here ve estimators that have been proposed in the literature,
aim of the present study is to focus specically on correlations and not the last one being the most recent and, as we shall see in the next
54Reprintedrisk.net
from RiskDecember
April 2016 2015
Cutting edge investments: Portfolio management
Cutting edge investments: Portfolio management
section, also the most efcient.1 In the following, denotes a cleaned for small eigenvalues. For large N , the optimal rotational invariant
estimator of the true correlation matrix C and 2 0; 1 is a parameter estimator (RIE) of C reads:
that must be determined through optimisation or analytical arguments. N
X
1. Basic linear shrinkage: a linear combination of the sample RIE WD kRIE uk u
k
estimate and the identity matrix: kD1
p
where zk D k i= N is a complex number and k is the correction
1 Comparison of the dressed RIE (20) with the proxy (12) using
factor given in (17) in Box 1. Note that Ok is always greater than or 500 US stocks from 1970 to 2012
equal to kRIE : this allows one to correct nearly perfectly a systematic
downward bias in the trace of the RIE correlation matrix RIE . 4 0.45
RIE q = 0.5
Oracle 0.40
Test of the RIE on nancial data RIE q = 0.55
3 0.35
Interestingly, the oracle estimator (8) can be estimated empirically and
used to directly test the accuracy of the debiased RIE (10). The trick is 0.30
to remark that the oracle eigenvalues (8) can be interpreted as the true 0.25
2
risk associated to a portfolio whose weights are given by the ith eigen- 0.20
vector. Hence, assuming that the data generating process is stationary,
0.15
we estimate the oracle estimator through the realised risk associated 1
0.10
to such eigen-portfolios (Pafka & Kondor 2003). More precisely, we
divide the total length of our time series Ttot into n consecutive, non- 0.05
56Reprintedrisk.net
from RiskDecember
April 2016 2015
Cutting edge investments: Portfolio management
Cutting edge investments: Portfolio management
been factored out). We are condent that this method will become a with:
N
standard in the coming years; it also suggests various routes for fur- 1 X 1
sk .zk / D (16)
N zk j
ther improvements, including modelling a genuine evolution of the j D1
j k
correlation matrix itself, beyond measurement noise. 2. We evaluate at the same time for any k 2 1; N :
Jol Bun is a PhD candidate at the Universit Paris-Saclay and j1 q C qzk gmp .zk /j2
k D 2 (17)
k
at the Lonard de Vinci Ple Universitaire, and an analyst at
Capital Fund Management in Paris. Jean-Philippe Bouchaud where gmp .z/ is the Stieltjes transform of the (rescaled) Marcenko-
is chairman and chief scientist and Marc Potters is co-CEO Pastur density given by:
p p
and head of research of Capital Fund Management. We are z C 2 .q 1/ z N z C
indebted to R. Benichou, A. Beveratos, R. Chicheportiche, A. gmp .z/ D (18)
2qz 2
Rej, E. Sri and G. Simon for many insightful discussions and with: p
help in the preparation of this work. We also acknowledge use- 1C q 2 N
C D N p ; 2 D p (19)
ful conversations with R. Couillet on robust estimators. Finally, 1 q .1 q/2
we warmly thank A. Knowles for his contribution to this topic where N is the smallest empirical eigenvalue.
that led to the paper Bun & Knowles (2016). 3. Our debiased optimal RIE for any k 2 1; N reads:
Email: joel.bun@gmail.com, (
k kRIE if k > 1
jean-philippe.bouchaud@cfm.fr, Ok D (20)
kRIE otherwise
marc.potters@cfm.fr.
REFERENCES
Bouchaud J-P and M Potters, Burda Z, J Jurkiewicz and Ledoit O and S Pch, 2011 Markowitz HM, 1968
2003 B Wacaw, 2005 Eigenvectors of some large Portfolio selection: efcient
Theory of nancial risk and Spectral moments of correlated sample covariance matrix diversication of investments,
derivative pricing: from statistical Wishart matrices ensembles volume 16
physics to risk management Physical Review E 71(2), 026111 Probability Theory and Related Yale University Press
Cambridge University Press Fields 151(1-2), pages 233264
Couillet R, A Kammoun and Marcenko
VA and LA Pastur,
Bouchaud J-P and M Potters, F Pascal, 2016 Ledoit O and M Wolf, 2003 1967
2011 Second order statistics of robust Improved estimation of the Distribution of eigenvalues for
Financial applications of random estimators of scatter: application covariance matrix of stock some sets of random matrices
matrix theory: a short review to GLRT detection for elliptical returns with an application to Matematicheskii Sbornik 114(4),
In The Oxford Handbook of signals portfolio selection pages 507536
Random Matrix Theory Journal of Multivariate Analysis Journal of Empirical Finance
Oxford University Press 143, pages 249274 10(5), pages 603621 Monasson R and D Villamaina,
2015
El Karoui N, 2008 Ledoit O and M Wolf, 2014
Bun J, J-P Bouchaud and Estimating the principal
Spectrum estimation for large Nonlinear shrinkage of the
M Potters, 2016 components of correlation
dimensional covariance matrices covariance matrix for portfolio
Cleaning large correlation matrices from all their empirical
using random matrix theory selection: Markowitz meets
matrices: tools from random eigenvectors
Annals of Statistics 36(6), goldilocks
matrix theory Europhysics Letters 112(5), 50001
pages 27572790 Available at SSRN 2383361
In preparation
Haff L, 1980 Pafka S and I Kondor, 2003
Bun J and A Knowles, 2016 Empirical Bayes estimation of Noisy covariance matrices and
An optimal rotational invariant the multivariate normal portfolio optimization II
estimator for general covariance covariance matrix Physica A: Statistical Mechanics
matrices Annals of Statistics 8, and its Applications 319,
In preparation pages 586597 pages 487494
58Reprintedrisk.net
from RiskDecember
April 2016 2015