Professional Documents
Culture Documents
1
We shall in the
ourse
over the most Estimated density of Danish fire data
important problems of this type in nan
e
and insuran
e. The presentation will be an
0.8
integrated one where simple mathemati
al
formulations, the use of histori
al data
and
omputation are brought together.
0.6
Examples
ome from non-life insuran
e
and nan
ial time series from the sto
k
0.4
and ele
tri
ity markets. The theoreti
al
on
epts presented will then be employed
in pra
ti
e and used to dis
uss modeling
0.2
and quanti
ation of real wold phenomena,
taking the view that most of what we
an
a
hieve by formal mathemati
al methods
0.0
2
21.1 25.6 25.9 30.3 30.8 or in re-insuran
e where u is the lower limit,
30.8 30.8 32.8 51.1 55.6 above whi
h the re-insurer is obliged to
57.2 65.0 69.5 82.5 103.9
ome in.
119.3 174.1 175.6 514.8 855.0
1210.6 An alternative measure of extreme risk
is the mean ex
ess fun
tion. To dene this
Table 1: Damages (in million NKR)
aused quantity we must invoke the
on
ept of
by natural disasters in Norway 1980 to 1999.
onditional probability and
onditional dis-
tribution. Introdu
e again a xed threshold
2.2 Model quantities u and
onsider the probability distribution
of X given that X ex
eeds u. This is a per-
Simple measures of extremes will now be in- fe
tly legitimate distribution and we denote
trodu
ed. We shall deal with the right tail it by referring to the
ondition X > u on
of the distribution, that is with large val- the left of a verti
al bar. With this notation
ues3 . The most
ommon des
ription of the the mean ex
ess fun
tion be
omes
risk of su
h phenomena is in terms quantiles.
If X is a random variable, then the upper p- eu = E (X ujX > u); (3)
quantile, denoted xp , is the value for whi
h
the probability is exa
tly p that X will ex- signifying how mu
h X on average is above
eed it, i.e. xp is the solution of the equation the threshold u if it is known to be larger.
Su
h quantities go into the so-
alled pure re-
P (X xp) = p: (1) insuran
e premiums and
ould also be rele-
vant for the pri
ing of
ertain options.
Extreme value methodology deals with the
estimation and
omputation of xp for small
values of p, say p = 0:05, 0:01 or even 0:001 2.3 Data quantities
and smaller. Quantiles are in statisti
s often All three measures (1), (2) and (3) have
alled per
entiles and are in nan
e known simple empiri
al
ounterparts. Suppose
as Value-at-Risks (VaR). x1; : : :; xn are histori
al observations of X .
If X is the damage of some type of industrial
An equally important
on
ept is that installation, then histori
al re
ords of earlier
of a threshold. We are then
on
erned with a
idents would be available, or if X is the
the probability that X ex
eeds a
ertain monthly gain (or loss) of some sto
k index,
given value u, i.e. in mathemati
al terms we
ould rely on experien
e of the
u
tu-
pu = P (X u): (2) ations of the index a
ertain time ba
k to
judge the likelihood of future extreme values.
This quantity
an be regarded as the inverse
of (1), and the two of them
an be derived For the quantile (1) we must rank the
from ea
h other. If we know xp for all p, observations, say in des
ending order
then we
an also nd pu for all thresholds x(1) x(2) : : : x(n) . Then x(pn) is the
u and the other way around, xp
an be empiri
al p-quantile (the analogy to xp ) and
found if all xu are known. The form pu is will be denoted
relevant to measure the risk of a nan
ial
portfolio falling below some lower threshold4 x^p = x(np): (4)
3
All inequalities in (1), (2) and (3) below must For the other two measures we must intro-
be swit
hed if the interest is in terms of the small du
e nu as the number of observations larger
values of X . than the threshold u. Then
4
We must then swit
h arguments to the other
tail, see footnote 3. p^u = nu =n; (5)
3
as the share of the observations ex
eeding the density f (x) = 1 exp( x) for x > 05.
threshold, is the natural way to estimate pu . The result automati
ally makes the Pareto
Moreover, the empiri
al parallel to eu is model a
andidate for extreme phenomena.
Sophisti
ated te
hniques based on this idea
e^u = (x(1) + : : : + x(nu ) )=nu u: (6) will be dis
ussed during the
ourse.
All three of x^p, p^u and e^u are statisti
al esti-
mates of xp , pu and eu , and they are
onsis- 2.5 The log-normal distribu-
tent in the sense that tion
x^p ! xp ; p^u ! pu ; e^u ! eu ; A number of other distributions suitable to
des
ribe extreme phenomena is reviewed in
as the number of observations n ! 1. It an appendix, but due to its high importan
e
may require prohibitively large values of n for in empiri
al nan
e it seems proper to dis-
these limits to materialize. However, when
uss the log-normal distribution here. This
x^p , p^u or e^u are applied to simulations, these model is dened by its relationship to the
large values are pre
isely what is en
oun- normal; i.e. X is log-normal (; ) if log(X )
tered; see Se
tion 5 for an example where is normal (; ). The expression for the prob-
the distribution of X is too
ompli
ated to ability density fun
tion is
ompute and where xp is obtained from sim-
ulations. f (x) = f2 2g 1=2
x 1 expf Q(x)g;
where
2.4 The Pareto distribution Q(x) = (log x )2=(2 2):
More sophisti
ated ways of estimating the The mean and the varian
e are
measures of extreme values utilize the Pareto
distribution. The probability density fun
- E (X ) = exp( + 2=2);
tion of this model is and
f (x) = ( 1 )=f1 + x=(r )g1+r ; x > 0: var(X ) = exp(2 + 2 )fexp( 2) 1g:
for positive parameters r and . Expressions In nan
e is known as the drift and as
for the mean and varian
e are the volatility.
E (X ) = r=(r 1);
3 Several variables
r 3
var(X ) = 2 ;
(r 1)2(r 2) 3.1 Preliminaries
The
on
ept of joint distributions is a
entral
whi
h are nite only if r > 2 (r > 1 for the one in all areas of appli
ations of sto
has-
mean). ti
modeling. In its simplest bivariate form6
The Pareto distribution has the rather 5
The exponential density
an be regarded as
surprising property that it be
omes the the limit of a Pareto model as r ! 1. The re-
only possibility for the ex
ess over large
sult
ited is proved in the book by Embre
hts,
thresholds. Suppose X follows a general Mi
kosh and Kuppelberg (1997). Stri
tly speak-
ing there is third possibility in addition to the
density fun
tion f and let fu be the
on- Pareto and the exponential, but this third model
ditional density fun
tion of X u given has properties that usually rules it out in pra
-
that X > u. Then, as u ! 1, fu either ti
e.
be
omes a Pareto density or the exponential 6
The only one to be
onsidered here.
4
The most widely used bivariate model
TOTX against SP, lag=one day TOTX against SP, lag=one week is the normal one. There is then a spe
i
expression for the probability density fun
-
0.10
• •
•
•• • • •• •••••• •• ••••
0.1
•• • ••••••••••• • •••••••••••••••••••••
•
•••••••••••• •••••• •••••
•••••••••••••••••••••••••••••••••••••••••••
•••••••••••••••
••••••••••••••••••••••••••••
••••••••••••••••• ••••
•••• ••••••
•••••••
••••••••
••••••
••••••••••
••
•••••
•
••••••••
••••
••••••••••••••• •• •••
••••••••
••
•••••••••
•••
•••••••••••••••••• •
••
••
•••••••••••• • • ••
•••••
••••••
•••••••••
•••••• •••••••••••••
•••••
•••••
••••••••••••••••
•••••••
•••••••••••
tion7 , but that is of no
on
ern to us here.
0.0
•••••••••• •••
••••••••• ••••••••• ••••••••• •••••••••••
•••••••
It is mu
h more
onvenient to dene the
••••
0.0
••••••••••• •••••••••••••
•••••• ••
•• ••••••••••••••
••••••••••••• •• ••••••••••••••••••••••••••••••
•••••• ••••••• •••
••••••
•••••••
••••••••••••••••• ••••• ••••••••••••••••
•• •• ••••••••••••••••••
••••
••••
•••••••
•••
••••••••••••••• •••••••••••••••••••• •
•••• ••••••• ••••
••••••
•••••
••
••••• •••••••••• •••••••••••••••••••••• •• • •
• ••••••••••••••••••••••••••••• •••••••••••••••••
•••••••• ••••••
TOTX
TOTX
• • • • • •
•• ••• •••••••••••••••• • • • ••• ••••• ••• •••••••••••••• •
• •
-0.1
•
joint normal model sto
hasti
ally. Let "1
• • • • •• •
• •
-0.10
• • • • •• •• •
•
• • •
•• •
•
and "2 be two independent normal random
-0.20
-0.3
0.2
X1 = 1 + 1"1 ;
0.1
0.1
••••••••••• •
•••••••••••••••• X2 = 2 + 2("1 + (1 2)1=2"2 );
-0.1 0.0
-0.1 0.0
•••••••••••••••••• • ••••••••
• •••••••
TOTX
TOTX
• •
•• ••••••••
• • ••
•
••
where i and i are the mean and standard
deviation for variable i and is their
orre-
•
-0.3
-0.3
-0.3 -0.2 -0.1 0.0 0.1 0.2 -0.3 -0.2 -0.1 0.0 0.1 lation. This type of denition is parti
ularly
Standard and Poor Standard and Poor
suitable for our purpose, as it shows how
Figure 2: Upper row: S
atter plots of daily dependent, normal random variables
an be
and weekly relative growth (log-s
ale )of simulated. It also immediately
lear that the
TOTX against the Standard and Poor 500 normal model for pairs of random variable is
index. Lower row: Regression
urves for a linear one, the relationships dening X1
the data in the upper row. and X2 being linear. The
onstru
tion of
normal models for an arbitrary number of
variables is similar, but needs matrix alge-
bra.
there are two random variables X1 and X2 ,
their distribution fun
tion being dened by
3.2 An example
F2 ( 1 x ; x2) = P (X1 x2 ; X2 x2):
Correlation between dierent variables is of
Note that the probability that both variables high interest in e
onomi
and nan
e. An
X1 and X2 are smaller than the limits x1 example is shown in Figure 2 where the daily
and x2 is spe
ied. and weekly relative growth of the Standard
and Poor (SP) nan
ial index and the total
Joint distribution fun
tions will be de- index (TOTX) from the Sto
k Ex
hange of
noted by bold fa
e (for example F2) to Oslo have been plotted against ea
h other on
distinguish from the marginal probability log-s
ale. Ea
h dot relates to the same day
distributions F1 and F2 for the two random (or week) for the two indexes. The period in
variables individually. The latter
an be question is from 1983 up to April 2000 and
obtained from the former sin
e there are altogether 4240 points in ea
h plot8
F2 (x1; x2) ! F1 (x1); as x2 ! 1;
F2 (x1; x2) ! F2 (x2); as x1 ! 1: It is not so easy to gauge the stru
ture
This shows that the notion of joint sto
has-
7
The joint distribution fun
tion F2(x1 ; x2) is
ti
models is a
on
ept ri
her in
ontents obtained from the probability density fun
tion
f2 (x1 ; x2 ) by integrating the latter over the in-
than the models for the individual variables nite re
tangle that runs up to x1 in one dire
tion
added together. The degree of
ovariation and x2 in the other.
of the two variables
omes in addition in F2 . 8
Some days or weeks had to be ex
luded due
to la
k of data.
5
eters were taken from the real data10. The
Normal, Lag=one day Normal, Lag=one week same number of points have been generated,
and exa
tly the same statisti
al methods
0.10
• •
0.4
Normal
•••••••••••••••••••••••••••••• •••••••••••••••••••••••••••••••••••••• •• •• ••••••••••••••••••••••••••••••••••••••••••• • •
••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••• • •••
•• ••••••••••••••••••••••••••••••••••• ••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••• •
0.0
••• ••••••••••••••••••••••••••••••••••••••••
2
• •• • •••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••
••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••• • •• • • • •• ••••••••••••••••••••••••••••••••••••••••••••••••• •••••••••••••••••••••••••••••••••• • •
turn out to be linear, but no linearity was
• • • • • • • •• • • • • • • • • •
• • • • • • • • • • • • •
• • •••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••• •• • • • • • • • • •••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••• • • •
• • ••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••• • •• ••••••••••• ••••••••••••••• •••••••• • •
-0.2
• • • • •• •
Normal 2
0.0
••
•••
••
••••••••••••••••••••••••••••• •••
••••••••••
•
•
• • •••• ••••••• •••••••••
•••••••••
• •••
One of the simplest ways to extend the
-0.2
-0.4 -0.2
Normal 1
0.0 0.2 0.4 -0.10 -0.05
Normal 1
0.0 0.05 0.10
empiri
al phenomena su
h as those in 3.2
is undoubtedly to introdu
e the
on
ept
Figure 3: Upper row: S
atter plots of sim- of a latent variable. A nan
ial market
ulated daily and weekly relative growth a
- may in a bear or bull
ondition, but in
ording to the normal model. Lower row: reality one might envisage dierent degrees
Regression
urves for the data in the upper of su
h states. This view may be formulated
row. mathemati
ally by introdu
ing a latent
variable b, whi
h we
ould refer to as the
amount of bullness in the market, with,
say negative values for high bearness. Of
from the s
atter plots in the upper row of
ourse, su
h a variable is not observable
Figure 2. The lower row shows smoothed dire
tly, but its presen
e would be felt by
non-linear regression
urves for the TOTX
the a
tions of the parti
ipants and by the
index against SP9 Noti
e the steeper line pri
e movements of the nan
ial variables.
for the weekly data, whi
h
orresponds to
a higher
orrelation than in the other
ase. The model for X and X is now de-
Also note that the
orrelation apparently 1 2
s
ribed
onditionally on b. One possibility
is higher at the lower extremes. This is
observation is important for the judging the
risk redu
ing ee
t of spreading investment. E (Xijb) = i +
i b; i = 1; 2
If it is true that the
orrelation tends to go var(Xijb) = i exp( i b); i = 1; 2 (7)
up as the sto
k market is falling, the ee
t
orr(X1; X2jb) = =f1 + exp(b)g:
of diversi
ation would be smaller.
Here
1 ,
2 , 1 ,2 and are additional pa-
Su
h a phenomenon
an not be explained rameters that allow volatility and
orrelation
by normal models. Simulations from su
h to depend on the market
ondition. We shall
models are shown in Figure 3. The param- study the kind of behavior we
an generate
with this tool during the
ourse. As example,
9
These
urves has been obtained by a so-
alled non-parametri
te
hnique, where the re- 10
For the daily data standard deviations were
gression line has been found by lo
al straight lines 0:010 and 0:012 for SP and TOTX respe
tively
tted in windows around ea
h point on the hor- and the
orrelation
oeÆ
ient 0:18. For the
izontal axis. The supsmu program of Splus was weekly ones the similar numbers were 0:02, 0:03
used. and 0:38.
6
suppose all of
1 ,
2 , 1,2 and are posi-
tive and that b varies around zero (say as a The SP and TOTX indexes from 1983 to 2000
standard normal random variable). Then a
small b diminishes the returns (rst line in
1400
(7)) and in
reases volatility and
orrelation
(se
ond and third line).
1200
1000
3.4 Copulas
Index
800
TOTX
A more general, but less transparent way
of modeling dependen
e is through
opulas.
600
Suppose V1 and V2 are dependent random
400
variables so that ea
h one is uniformly dis-
Standard and Poor
dened on the quadrate with verti
es at the Figure 4: The Standard and Poor and
origin and at (0; 1), (1; 0) and (1; 1). For C TOTX nan
ial indexes from 1983 to 2000.
to be a valid distribution fun
tion it must
satisfy the
onditions
v ; 0) = 0;
C( 1 C(0; v2) = 0; is attra
tive. We know how to model single
v ; 1) = v1;
C( 1 C(1; v2) = v2 : variables empiri
ally. On
e that has been
done mu
h more
ompli
ated and subtle
The simplest su
h fun
tion would be types of dependen
ies may
on
eivably be
C (v1; v2) = v1 v2 , whi
h
orresponds to des
ribed mathemati
ally by spe
ifying the
independent random variables V1 and V2 ,
opula fun
tion. One possibility
ould be
but the point here is to allow dependen
e.
C(v1; v2) = expfQ(v1; v2)g;
7
4.2 Sto
hasti
volatility
Estimated volatility
One of the issues that has been raised in
modern empiri
al nan
e is whether the
volatility is
onstant over time. This
0.04
TOTX
8
instead of the real data, yielding
omputer
Elprice on log-scale Elprice: Weekday effect approximations of the tail quantities xp,
pu and eu . They
an be made arbitrary
Daily elprice (log-scale)
4.85
4
Monday Sunday
0.5
January December
0 100 200 300 1995 1996 1997 1998 1999 2000 2001
month; see Se
tion 4 above. On average
ea
h index is up
Day number Years
year is the average of two months of pri
es around 5.2 The general idea
t, one month on either side.
13
All variables that are
omputer simulations The prin
iple involved in the pre
eding ex-
are marked with a . ample is that of a me
hanism M produ
ing
9
^k obtained from histori
al data, and the
Predi
tion True Data history dis
repan
ies ^k k and ^k k are the
time ahead estim 2 years 10 years only sour
es of errors in the Value-at-Risk
VaR SD SD evaluations.
One month 0.982 0.003 0.002
One year 0.988 0.034 0.017 The ee
t of this
an be studied through
Five years 1.191 0.235 0.102 nested simulation s
hemes. The situation is
as shown:
Table 2: 5% value-at-risk for nan
ial port-
folio as des
ribed in the text, true value and D ! M^ ! m
X
ee
t (standard error) due to estimation er- "
ror Histori
al data M
On the upper row real histori
al data D are
an output X (= X when emphasizing the used to determine the estimated model M^ ,
simulation) or s
hemati
ally from whi
h m simulations are drawn for the
Value-at-Risk approximations. The estima-
M ! X (M = real world): tion pro
ess leading to M^
an also be simu-
lated. We then use the drift parameters and
Although the mathemati
al treatment of volatilities ^ and ^ we already have, to gen-
k k
this relationship was
ompli
ated, it
ould erate simulated histori
al data D and from
be run in the
omputer, so that approxima- D reestimate M^ , now
alled M^ . Finally
tions to the distribution of X were obtained,
Value-at-Risk is
omputed from M^ . The
its tails in
luded. situation has now be
ome:
There is a problem with this pro
edure D ! M^ ! m
X
that has nothing to do with
omputations. "
Ideally M should have been the nan
ial Simulated data M^
system itself, but the working of this system Unlike in the real
ase we have now a
ess
is enormously
omplex and a
urate des
rip-
to as many repli
ations of the simulated
tion beyond rea
h. All we
an get hold is histori
al data D as we please. This means
a
rude approximation M^ and generate the that the estimated models M^ vary among
simulations a
ording to themselves. It is pre
isely the ee
t of this
M ! X
^ (in pra
ti
e M 6= M ): variation on the Value-at-Risk assessments
^
we want to examine.
What are the impli
ations of the fa
t
that M 6= M^ ? Clearly it means that our Sometimes su
h a study reveals that
Value-at-Risk assessments dier from those an estimate systemati
ally overestimate or
that would have been obtained under the underestimate the quantity under study.
real model. It is never possible to analyze Su
h was not the
ase in the present exam-
su
h issues
ompletely, but one aspe
t
an ple, and only the estimated standard error
be ta
kled. has been re
orded in Table 2 (
olumns 3
and 4). Note the ee
t of the length of the
Suppose the Gaussian random walk period of histori
al data. 10000 simulations
model used to des
ribe the
u
tuations were used for the Value-at-Risk evaluations
of the sto
k, was stri
tly true, ex
ept for and 100 repli
ations of the histori
al data
unknown drift and volatility, say k and k were generated, in both
ases suÆ
iently
for the k'th nan
ial variable. The model high numbers to limit the ee
t of Monte
M^ would then
orrespond to estimates ^k Carlo randomness.
10
6 Literature tail.
The reader will there nd the mathe- All parametri
distributions have unknown pa-
mati
s of extremes, mostly for single rameters that must be found from histori
al data
variables. A referen
e dealing with extremes x1; : : :; xn. An alternative is to use the so-
alled
for sto
hasti
pro
esses is empiri
al distribution
M
neil, A.J. (1999). Extreme value for r > 0 and where r and are positive param-
theory for risk managers. Departement eters . The mean and varian
e are
14
11
A.3 Shifting the distribu- where
p
tions f0 (x) = 1 p
exp( 2 p 2 + x)
Both the Pareto, the log-normal and the gamma K1 ( 1 + x2 )= 1 + x2;
distributions started at the origin. It is possible and where K is the so-
alled Bessel fun
tion of
to shift su
h a distribution to an arbitrary initial the rst kind,1 i.e.
point a by dening
Z1 p
Xa = X + a: K 1 (x ) = x exp( xt) 1 t2 dt:
1
The new random variable Xa has the same dis- Among the four parameters , , and the
tribution as the old one X , ex
ept that it
an former two represent lo
ation and s
ale, like the
not fall below a. Sometimes this parameter a is similar ones for the normal, whereas the remain-
added as a third parameter. ing two, whi
h must satisfy > 0 and j j < ,
aptures many dierent shapes, for example the
normal as ! 1, heavy-tailed distributions as
A.4 The t-distribution ! 0 and skewed ones as is raised from 0 in
The t-distribution extends over the whole real either dire
tion. The mean and varian
e are
line. Its density is =
E (X ) = + ;
f (x) = (
r =)(1 + r (x=) )
1 2 (r +1)=2
f1 (=)2 g1=2
1
var(X) = 2
where
r is a
onstant dened by the
ompli
ated f1 (=)2 g3=2
expression
It is of little importan
e that the expression for
((r + 1)=2) the density is a horrible mathemati
al expres-
r = p :
r (r=2) sion. What does matter are the properties of the
model, and that it
an be sampled
omparatively
Mean and varian
e is easy.
Suppose X1 and X2 are independently NIG-
E (X ) = 0 distributed with parameters 1 , 1, 1, 1 and
2, 2, 2, 2 . If 1=1 = 2=2, then the sum
and X1 + X2 is also NIG distributed, now with pa-
rameters 1 + 2, 1 + 2 , 1 + 2 and 1 + 2 .
var(X ) = r=(r 2) This is a useful property when using the model to
des
ribe
u
tuations in share pri
es on log-s
ale.
where the latter requires r > 2 to be valid. The Another point of interest is tail behavior, whi
h
t-model equals the normal in the limit as r ! 1. is
lose to, but not entirely equal to, the de
ay of
an ordinary exponential distribution.
A.5 Normal inverse Gaussian
The normal, inverse, Gaussian (NIG) family of
distributions runs, like the ordinary normal and
the t-distribution, over the whole axis. The
model is able to portray many dierent types of
shapes, but still, remarkably, possesses the
on-
volution property, i.e. that sums of independent
NIG variables remain NIG distributed. This does
not apply in full generality. It is true when the
NIG variables have identi
al distribution.
The probability density fun
tion is the
om-
pli
ated expression
f (x) = 1f0 f(x )=g
12