Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
z z
2 1
1
1 2
0
1
0
1
( ) ( )
default risk
54: Credit : May 2000
Figure 5. Type I CAP curve
x
y
Perfect Model
(Ideal CAP)
Nave Model
(RandomCAP)
Performance
Differential Model being
evaluated
Figure 5: Type I CAP curve
Source: M oodys R isk M anagem ent Services
The dark curved line shows the performance of the model being evaluated.
It depicts the percentage of defaults captured by the model (vertical axis)
vs. the model score (horizontal axis). The heavy dotted line represents the
nave case (which is equivalent to a randomassignment of scores).
The grey dashed line represents the case in which the model is able to dis-
criminate perfectly and all defaults are caught at the lowest model output.
The grey region represents the performance differential between the nave
model and the model being evaluated.
p51p56.qxd 15/05/00 12:22 Page 54
m ost m odels
16
.
Table 1 show s A R values for the tested m odels for
in-sam ple
17
and validation tests. To confirm the validity of the
A R figures, w e also checked w hether a particular m odel differed
significantly from the one ranked im m ediately above it by cal-
culating KS statistics, using about 9,000 independent observa-
tions selected from the validation set. M ore precisely, KS tests
show ed that only the reduced Z-score and RO A w ere not sig-
nificantly different.
Conditional information entropy ratio
A different perform ance m easure is based on the inform ation
about defaults contained in the distribution of m odel scores, or
inform ation entropy (IE). Intuitively, the inform ation entropy
m easures the overall am ount of uncertaintyrepresented by
a probability distribution. In the sam e w ay w e reduced the C A P
plot to a single A R statistic, w e can reduce the inform ation
entropy m easures into another useful sum m ary statistic: the
C onditional Inform ation Entropy Ratio
18
(C IER ).
To calculate the C IER , w e first calculate the inform ation
entropyH
0
=H
1
(p) w ithout attem pting to control for any know l-
edge that w e m ight have about credit quality. H ere P is the
aggregate default rate of the sam ple and H
1
is the inform ation
entropy defined in the A ppendix
19
. This entropy reflects know l-
edge com m on to all m odels: the likelihood of the event given by
the probability of default. W e then calculate the inform ation
entropy H
1
(R) after having taken into account the risk scores
R ={R
1
,,R
N
}of the selected m odel. The C IER is defined as
20
If the m odel held no predictive pow er, the C IER w ould be 0.
In this case the m odel provides no additional inform ation on the
likelihood of default that is not already know n. If it w ere per-
fectly predictive, the C IER w ould be 1. In this case, there w ould
be no uncertainty about the default event and, therefore, per-
fect default prediction. Because C IER m easures the reduction
of uncertainty, a higher value indicates a better m odel. Table 2
show s the C IER results. C IER errors are of the order of 0.02 and
are obtained w ith a resam pling schem e sim ilar to the one
described for the A R statistic.
Mutual information entropy
To this point w e have been describing m ethods of com paring
m odels on the assum ption that the best perform ing m odel
w ould be adopted. H ow ever, it is not unreasonable to question
w hether a com bination of m odels m ight perform better than
any individual one. Tw o m odels m ay both predict 10 out of 20
defaulters in a sam ple of 1,000 obligors.
U nfortunately, this inform ation does not provide guidance on
w hich m odel to choose. If each m odel predicted a different set
of 10 defaulters, then using both m odels w ould have double the
predictive accuracy of either m odel individually
21
. In practice,
there is considerable overlap, or dependence, in w hat tw o m od-
els w ill predict for any given data sam ple.
To quantify the dependence betw een any tw o m odels A and
B , w e use a m easure called the m utual inform ation entropy
(M IE). The m utual inform ation entropy is a m easure of how
m uch inform ation can be predicted about m odel B given the
CIER R
H H R
H
( )
( )
=
0 1
0
default risk
55: Credit : May 2000
Figure 6. CAP curves for
Selected CAP curves
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
0% 20% 40% 60% 80% 100%
Population
Random
ROA
reduced Z'-score
Z'-score
Hazard Model
Merton Model Variant
Moody's Model
Figure 6: CAP curves for the tested models
Source: M oodys R isk M anagem ent Services
This composite figure shows the CAP curves for six models. All models
were tested on the same data set. The 45 dashed grey line represents
the nave (which is equivalent to a randomassignment of scores).
Note that Moody's model performs better than the Merton model variant
at discriminating defaults in the middle ranges of credits.
In-sample AR Validation AR
ROA
Reduced Z-Score
Z -Score
Hazard model
Merton Model Variant
Moodys Model
Source: M oodys R isk M anagem ent Services
Table 1: Selected Accuracy Ratios
0.53 0.53
0.56 0.53
0.48 0.43
0.59 0.58
0.67 0.67
0.76 0.73
In-sample CIER Validation CIER
ROA
Reduced Z-Score
Z -Score
Hazard model
Merton Model Variant
Moodys Model
Source: M oodys R isk M anagem ent Services
Table 2: Selected Entropy Ratios
0.06 0.06
0.10 0.09
0.07 0.06
0.11 0.11
0.14 0.14
0.21 0.19
p51p56.qxd 15/05/00 12:22 Page 55
output of m odel A . M IE is defined as
w here r and Rare the risk score sets of m odels A and B respec-
tively, and H
2
(r,R) is the joint entropy defined in the A ppendix.
Because the M IE is calculated w ith the joint conditional distri-
bution of m odels A and B, this m easure requires a large num -
ber of defaults to be accurate. W hen default data are not w idely
available, this requirem ent can be relaxed by including reliable
degrees of credit quality, such as agency ratings, instead of
defaults only.
If m odels A and B are independent, the m utual inform ation
entropy is zero, w hile if m odel B is com pletely dependent on
m odel A then M IE = 1-C IER (A ). The additional uncertainty gen-
erated by m odel B can be estim ated by com paring w ith the
uncertainty generated by m odel A alone. In this context, the
statistic serves m uch the sam e function as a correlation coeffi-
cient in a classic regression sense. H ow ever, the M IE statistic
is based on the inform ation content of the m odels.
Table 3 show s the difference D = M IE(A ,B ) M IE(A ,A ),
w here A is M oodys m odel and B is any of the other selected
m odels. In this exam ple, w e have com pared all the benchm ark
m odels to M oodys m odel to determ ine if they contain redun-
dant inform ation.
Summary
The benefits of im plem enting and using quantitative risk m od-
els cannot be fully realised w ithout an understanding of how
accurately any given m odel represents the dynam ics of credit
risk. This m akes reliable validation techniques crucial for both
com m ercial and regulatory purposes.
In the course of our research into quantitative credit m odel-
ling, w e have found that sim ple statistics
22
(such as the num ber
of defaults correctly predicted) are often inappropriate in the
dom ain of credit m odels. As a result, w e have developed sev-
eral useful m etrics that give a sense of the value added by a
quantitative risk m odel.
The four such m easures presented here perm it analysts to
assess the am ount of additional predictive inform ation con-
tained in one credit m odel versus another. In situations w here
a specific m odel contains no additional inform ation relative to
another, the less inform ative should be discarded in favor of the
m ore inform ative. In the special case w here both m odels con-
tribute inform ation to each other, users m ay w ish to com bine
the tw o to garner additional insight. I
Jorge Sobehart is vice president, senior analyst, risk m an-
agem ent services at M oodys Investors Service
e-m ail: sobeharj@ m oodys.com
Sean Keenan is vice president, senior analyst, risk m anage-
m ent services at M oodys. Roger Stein is vice president, senior
credit officer, and director of quantitative m odelling analytics
A full version of this paper, including a m athem atical descrip-
tion of inform ation entropy and a full list of references, is avail-
able from the authors. C ontact +44 (0)20 7772 5454.
MIE r R
H
H r H R H r R ( , ) ( ) ( ) ( , ) = +
1
0
1 1 2
b g
default risk
56: Credit : May 2000
FOOTNOTES
1
See, for exam ple, H errity, Keenan, Sobehart, C arty and Falkenstein (1999).
2
Basel, op. cit., p. 50.
3
Accuracy m ay be only one of m any m easures of m odel quality. See D har
and Stein (1997).
4
In-sam ple refers to observations used to build a m odel. O ut-of-sam ple
refers to observations that are not included in the in-sam ple set. O ut-of-uni-
verse refers to observations w hose distribution differs from the in-sam ple pop-
ulation. O ut-of-tim e refers to observations that are not contem porary w ith the
in-sam ple set.
5
This presentation follow s closely that of D har and Stein (1998), Stein
(1999), and Keenan and Sobehart (1999), w ith additional clarifications from
Sobehart, Keenan and Stein (2000).
6
This case is particularly im portant w hen one type of error is m ore serious
than another. To illustrate, an error of tw o notches for an A a-rated credit is gen-
erally less costly than a sim ilar error for a B -rated credit.
7
See Sobehart, Keenan and Stein (2000).
8
The bootstrap (e.g., Efron, B. and R . J. Tibshirani (1993)), random isation
testing (e.g., Sprent, P. (1998)), and cross-validation (ibid.) are all exam ples of
resam pling tests.
9
See, for exam ple, H errity, Keenan, Sobehart, C arty and Falkenstein (1999).
10
See Keenan and Sobehart (1999).
11
For the definition of the original Z score and its various revisions Zsee
C aouette, A ltm an, N arayanan (1998).
12
For sim plicity w e selected the m odel based on Zm ijw eskis variables
described in Shum w ay (1998).
13
For this research, M oodys has adapted the M erton m odel (1974) in a
sim ilar fashion to w hich KM V has m odified it to produce their public firm m odel.
M ore specifically, w e calculate a D istance to D efault based on equity prices and
firm s liabilities. See also Vasicek (1984) and M cQ uow n (1993). For an exact def-
inition of M oodys distance to default m easure see Sobehart, Stein, M ikityan-
skaya and Li (2000).
14
H ere in-sam ple refers to the data set used to build M oodys m odel.
15
In fact, A R based on panel data sets w ill provide aggregated inform ation
about the tim e correlation of the risk scores.
16
D ue to the high levels of correlation in the resam pling, the m axim um
absolute deviation gives a m ore robust estim ate of an error range than a cor-
rected standard error.
17
H ere in-sam ple refers to the data set used to build M oodys m odel.
18
This is sim ilar to m easures such as gain ratios used in the inform ation
theory and tim e series analysis literature (see, for exam ple, Prichard and Theiler
(1995)). H ow ever, our definition m easures explicitly the uncertainty to predict
defaults instead of the overall uncertainty in the distribution of m odel outputs.
19
For additional details see Keenan and Sobehart (1999).
20
C IER = 1- IER , w here IER is the inform ation entropy ratio defined in H er-
rity, Keenan, Sobehart, C arty and Falkenstein (1999). H ere w e introduce C IER
for consistency w ith the concept of conditional entropy in Inform ation Theory
and C om m unication Theory.
21
O f course com bining the m odels could also create ancillary trade-offs
w ith respect to increased Type II error.
22
For an exam ple of a m ore standard approach to validation see: C aouette,
A ltm an and N arayanan (1998).
In-sample MIE In-sample D Validation MIE Validation D
ROA
Reduced Z-Score
Z -Score
Hazard model
Merton Model Variant
Moodys Model
Source: M oodys R isk M anagem ent Services
Table 3: Difference of Mutual Information Entropy
0.96 0.17 0.97 0.16
0.93 0.14 0.96 0.15
0.95 0.16 0.98 0.17
0.91 0.12 0.92 0.11
0.87 0.08 0.87 0.06
0.79 0 0.81 0
The additional uncertainty generated by a model can be estimated by
comparing it with the uncertainty generated by Moody's model alone.
Table 3 shows the difference D =MIE(A,B) - MIE(A,A), where A is
Moody's model and B is any of the other selected models.
p51p56.qxd 15/05/00 12:22 Page 56