Professional Documents
Culture Documents
Dr Dale Roberts
Australian National University
Semester 2, 2014
FINM3007/MATH3015/MATH6015
1 / 561
Week 1
2 / 561
3 / 561
Vectors
A vector (or column) of length n is an n 1 array
x1
x2
x = (x 1 , x 2 , . . . , x n )0 =
... .
xn
The superscript 0 means transpose and converts the row
[x 1 x 2 x n ]
to a column.
4 / 561
Vector-Vector Product
Let x and y be vectors of length n.
We often write x Rn and y Rn .
The dot product or inner product of x and y is given by
0
x y := x y =
n
X
x i yi = x 1 y1 + x 2 y2 + + x n yn .
kxk :=
x0x =
v
uX
n
t
x i2
i=1
5 / 561
Matrices
An m n matrix is an array of objects consisting of m rows and n columns
6 / 561
Matrices
A n n diagonal matrix is a matrix of the form
d1 0
0
0
0 d2 0
.
..
.
.
D=
.
.
.
. 0 dn1 0
0
0
dn
I = diag(1, 1, . . . , 1).
Last Modified: 14/10/2014 08:19
7 / 561
Matrix-Vector Product
The (matrix-vector) product Ax is defined to be the column of length m
whose entry in the i th row is
(Ax)i =
n
X
(1)
j=1
Ax = [1 2 . . . n ] .. =
j x j.
.
j=1
xn
8 / 561
Matrix-Vector Product
then
(Ax)1
a1 x
(Ax)2 a2 x
Ax =
... = ...
(Ax)m
am x
9 / 561
Matrix-Vector Product
If A = [ai j ] and B = [bi j ] are two matrices of the same size. Then
A = B ai j = bi j ,
i, j
A B = [ai j bi j ]
A = [ai j ]
The following laws of matrix-vector multplication hold:
(A)x = (Ax) = A(x)
(Associative law)
A(x + y) = Ax + Ay (Distributive law)
(A + B)x = Ax + B x (Distributive law)
where A, B are n m matrices; , real numbers; and x, y vectors of
length n.
Dale Roberts (c) 2011-2014
10 / 561
Matrix-Matrix Product
Let A be an m n matrix. A j denotes the j th column of A and we write
A = [A1 A2 An ].
(A)i j denotes the element of A at row i and column j . Hence, if A = [ai j ]
then (A)i j = ai j .
Let A = [ai j ] be an m n matrix and B = [bi j ] be a p q matrix with
columns B j . The (matrix-matrix) product AB is defined if and only if n = p
and in this case
AB = [AB1 AB2 ABq ]
is a m q matrix.
11 / 561
Determinants
a b
det
= ad bc.
c d
Determinants of matrices of size n > 2 are computed in terms of
determinants of matrices of size n 1.
12 / 561
Determinants
The minor Mi j of ai j in A = [ai j ] is the determinant of the submatrix of A
obtained by deleting the i th row and j th column.
The cofactor of ai j is Ci j = (1)i+ j Mi j .
For each i, j the following hold:
|A| =
|A| =
n
X
k=1
n
X
k=1
13 / 561
Determinants
Example
Expanding along the first row:
1 2 3
5 6
4 6
4 5
2
det A = 4 5 6 = 1
7 9 + 3 7 8 = 0.
8 9
7 8 9
Expanding along the second column:
1 3
4 6
1 3
+ 5
|A| = 2
7 9 8 4 6 = 0.
7 9
14 / 561
Example
cos( ) sin( )
A=
.
sin( ) cos( )
cos( ) sin( )
=
.
sin( ) cos( )
15 / 561
16 / 561
Nondecreasing Functions
17 / 561
Setup
f : [a, b] R
P = {(x i1 , x i ) : i = 1, . . . , k} partition of [a, b]
P = set of all partitions of [a, b]
18 / 561
Variation
Definition
Take f : [a, b] R, P = {(x i1 , x i ) : i = 1, . . . , k} a partition of [a, b], and P
is the set of all partitions of [a, b]. The variation Vab ( f ) of f on [a, b] is
Vab ( f
) = sup
k
X
| f (x i ) f (x i1 )| : P P
i=1
19 / 561
Definition
If Vab ( f ) is finite then f is of bounded variation on [a, b].
Note: If Vab ( f ) = then we say that f has unbounded variation. For example,
Brownian motion has paths of unbounded variation.
20 / 561
Example 1
Example
If function f is non-decreasing on [a, b], then for any P P :
k
X
i=1
| f (x i ) f (x i1 )| =
k
X
f (x i ) f (x i1 ) = f (b) f (a)
i=1
Therefore,
Vab ( f ) = f (b) f (a).
21 / 561
Example 2
Example
The function
f (x) =
0,
x =0
sin( 1x ),
x > 0.
f
/ BV[0, 2 ].
1.0
0.5
0.1
0.2
0.3
0.4
0.5
0.6
-0.5
-1.0
22 / 561
Example 3
Example
The function
f (x) =
x =0
0,
x
sin( 1x ),
x > 0.
f
/ BV[0, 2 ].
0.6
0.4
0.2
0.1
0.2
0.3
0.4
0.5
0.6
-0.2
23 / 561
Stieltjes Integration
24 / 561
Stieltjes Integral
25 / 561
Stieltjes Sum
n
X
k=1
26 / 561
Riemann-Stieljes Integral
Definition
If lim|P|0 S( f , g; P) exists, then this limit is called the Riemann-Stieljes integral
of f with respect to g and is denoted
Z b
Z b
f (t) d g(t)
or
f d g.
a
The function f is called the integrand and the function g is called the
integrator.
Note: A Lebesgue-Stieljes integral can also be constructed.
27 / 561
Lemma
If
Rb
a
f1 d g and
Rb
a
( f1 + f2 ) d g =
a
f1 d g +
a
f2 d g.
a
28 / 561
Lemma
If
Rb
a
f d g1 and
Rb
a
b
f d(g1 + g2 ) =
a
f d g1 +
a
f d g2 .
a
29 / 561
Integration by Parts
Lemma
If f integrable with respect to g , then g is integrable with respect to f and
Z b
b Z b
g d f = f g
f d g.
a
30 / 561
Existence Criteria
Theorem
If f C[a, b] and g BV[a, b] then
Z b
f dg
exists.
Theorem
If f BV[a, b] and g C[a, b], then
Z b
f dg
exists.
31 / 561
f (t) dWt
a
32 / 561
33 / 561
(x) :=
2
p1 e u /2 du.
2
34 / 561
Characterisation
Lemma
A random variable X N(, 2 ) under a measure P if and only if
1
E[e X ] = e + 2
35 / 561
Gaussian Shifts
/2
E[h(X + c)]
36 / 561
Gaussian Shifts
Example
Take h(x) = 1 then
E[eZ ] = e
/2
Note: Z N(0, 2 )
37 / 561
Gaussian Shifts
Example
If Z N(0, 1) then X := + Z N(, 2 ) so
E[e X ] = E[e+Z ]
/2
e = e+
/2
38 / 561
39 / 561
Multivariate Gaussians
Definition
Assume det Q > 0, then the (random) vector X has a multivariate normal
distribution if it has the density
1
1
0 1
(x) = p
exp (x ) Q (x ) , x Rn .
n
2
(2) det Q
We write X N(, Q) if this is the case.
For example: x Rn ,
Z
P{X > x} =
(x 1 , x 2 , . . . , x n ) d x 1 d x 2 d x n .
x1
x2
xn
40 / 561
Multivariate Gaussians
If Z N(0, Q) and c = (c1 , c2 , . . . , cn )0 Rn then X := c T Z is Gaussian with
X N(0, c 0Qc).
If C Rmn , i.e. a m n matrix, then we have
X = C Z N(0, CQC 0 ),
1 0
0X
0
E[e ] = exp Q .
2
41 / 561
Lemma
Let X N(0, Q), let h be a measurable function of x Rn , and c Rn . Then
0
1 0
42 / 561
Week 2
Brownian Motion
Monte Carlo Method
Simulating Stochastic Processes
43 / 561
Brownian Motion
44 / 561
Probability Space
A probability space (, F , P) consists of three parts:
the sample space (the set of all possible outcomes),
the set of events F (each event is a set containing zero or more outcomes),
P({H}) = P({T }) =
1
2
P({H, T }) = 1
45 / 561
46 / 561
Stochastic Processes
Definition
A stochastic process X is a family (X t )0tT of random variables defined on the
same probability space (, F , P) taking values in Rn (the state space).
The process X can be viewed in terms of:
a (random) path t 7 X t ()
the probability distribution of X at some fixed time t , i.e., P{X t < x}.
47 / 561
Finite-dimensional Distributions
Definition
If X := (X t )0tT is a Rn -valued stochastic process, then for every finite
sequence of times t 1 , t 2 , . . . , t k the probability distribution of the random
variable
X := (X t 1 , X t 2 , . . . , X t k )0
is called the finite-dimensional distribution of X .
Note: Since X is Rn valued and we are evaluating at k times t 1 , . . . , t k then X is
Rnk -dimensional.
48 / 561
s<t
49 / 561
Brownian Motion
Definition
A stochastic process W := (Wt ) t0 is a called a Brownian motion (under P) if it
has all of the following properties:
1. the paths t 7 Wt () are continuous for P-almost all ;
2. W (0) = 0, P-almost surely;
3. for 0 s < t < , Wt Ws N(0, t s);
4. for k N and 0 t 1 < t 2 < . . . < t k , the increments Wt i+1 Wt i with i < k,
are independent.
We shall denote by F t = (F t ) t0 the natural filtration generated by the
process W : the event set F is generated by all values of Ws up to time s = t .
50 / 561
Existence
51 / 561
Simple Properties
E[e Wt ] = e 2 t ;
2
E[Wt4 ] = 3t 2 .
52 / 561
Invariances
53 / 561
Donskers Theorem
Theorem (Donsker 1951)
Let (n )nN be a sequence of independent and identically distributed random
variables with E[i ] = 0 and 0 < Var[i ] = 2 < . Let
S0 = 0,
Sn =
n
X
i .
i=1
1
1
p S[nt] () + (nt [nt]) p [nt]+1 ()
n
n
for t [0, 1], n N. Then this sequence converges (weakly) towards the
Brownian motion B := (B t )0t1 , i.e., we have X n B in probability dist.
Dale Roberts (c) 2011-2014
54 / 561
55 / 561
Main Idea
The main idea of the Monte Carlo method is to approximate an expected value
E[X ] by an arithmetic average of the results of a big number of independent
experiments which all have the same distribution as X .
Trivia: In 1946, physicists needed to perform complicated calculations for the
Manhattan project. They couldnt solve the problem using standard
mathematical methods. The Mathematician Stanislaw Ulam (1909 - 1984)
suggested using random experiments. Being a secret project, this required a
code name. Von Neumann chose the name Monte Carlo.
56 / 561
57 / 561
1X
X i ()
n i=1
The arithmetic mean of the (realisations of) X i tends to the theoretical mean
of every X i .
58 / 561
59 / 561
Unbiased Estimator
Theorem
Let (X n )nN be a sequence of integrable R-valued random varaibles that are i.i.d.
as X . Then the Monte Carlo estimator
N
1X
X N :=
Xi,
N i=1
N N,
Note: This ensures that the MC Estimator is correct in the mean, it does not
help us get a feel for the absolute value of the error.
Dale Roberts (c) 2011-2014
60 / 561
Increasing Accuracy
We look at the standard deviation of the difference between X N and
As
1 X
2
Var[X
]
=
i
N 2 i=1
N
p
the standard deviation is of order O(1/ N )
Var[X N ] = Var[X N ] =
This means that to increase the accuracy of the crude MC estimate by one
digit (i.e., reduce the std. dev by a factor of 0.1) requires increasing N by a
factor of 100.
To acheive higher accuracy, we need a significant effort.
61 / 561
X
1
D
X i N N(0, 1) as N .
p
N i=1
We can infer from the CLT that for large values of N the crude Monte Carlo
estimator is approximately N(, 2 /N )-distributed.
62 / 561
N
1X
X i z1/2 p ,
X i + z1/2 p
N i=1
N N i=1
N
N
1X
Xi 2p ,
Xi + 2p
N i=1
N N i=1
N
63 / 561
u
u
N
N
X
t 1 X
t
N
1
N =
(X i X N )2 =
X i2 X 2N
N 1 i=1
N 1 N i=1
Giving our approximation of 95% confidence
N
1 X
N
N 1 X
Xi 2p ,
Xi + 2pN
N i=1
N N i=1
N
64 / 561
Calculating
Example ()
The classic example of the Monte Carlo method is calculating the value of .
Consider the part of the unit circle C with center at the origin (0, 0) that
intersects with the positive unit square [0, 1]2 . Our experiment is to randomly
choose points p1 , p2 , . . . , pN of the unit square (i.e., uniformly distributed) and
consider
X i = 1{pi C} .
We have then P(pi C) = /4 as the probability of hitting C is its area r 2 /4
and r = 1. Since
E[1{pi C} ] = P(pi C) = /4
we can estimate by
N
4X
() =
1{pi C} ().
N i=1
Dale Roberts (c) 2011-2014
65 / 561
g(x) d x =
[0,1]n
66 / 561
where X 1 , . . . , X N are all i.i.d. uniformly distributed random variables on [0, 1]n
p
Remark: By the SLLN the rate of convergence is O(1/ N ) but is independent
of the dimension n. As opposed to deterministic quadrature formulas that
have a rate of convergence O(N 2/n ). The MC approximation outperforms
when n > 4.
67 / 561
68 / 561
69 / 561
70 / 561
Special Cases
If the function g only depends on the value of the stochastic process X at a
particular time, i.e., if we have
g(X ) = h(X T )
for a fixed time T and R-valued function h, then we only have to know the
distribution of X T . Either the distribution of X T is known, or we have to
simulate the whole path to get X T .
If g only depends on X at a finite number of times t 1 , t 2 , . . . , t n , i.e.,
g(X ) = h(X t 1 , . . . , X t n )
71 / 561
General Case
72 / 561
The fact that the random variables X t for all t s are related to each other
has the consequence that we cannot simply simulate independent
random variables that have the same distribution as the different X t
We have to take care of the relation among the X t which can be extremely
strong
Since the times t [0, T ] are uncountable, we cannot simulate the true
path of the process X . We need to approximate the path at finite times
0 = t1 < t2 < < t n = T
73 / 561
Algorithm
Let 0 = t 0 < t 1 < < t n = T be a partition of [0, T ]. Let Pk denote the
conditional distribution of X t k given X t k1 . We obtain the approximate path
X () by:
1. Set X 0 () = 0
2. For k = 1 to n do:
(a) Simulate a random number Yk () with Yk Pk
(b) Set X t k () = X t k1 () + Yk ()
(c) Between t k+1 and t k obtain X t via linear interpolation
X t () = X t k1 () +
t t k1
Yk (),
t k t k1
t (t k1 , t k ).
74 / 561
Brownian Motion
Let 0 = t 0 < t 1 < < t n = T be a partition of [0, T ]. We obtain the
approximate path t 7 B t () by:
1. Set B0 () = 0
2. For k = 1 to n do:
(a) Simulate a random number
p Zk N(0, 1)
(b) Set B t k () = B t k1 () + t k t k1 Zk
(c) Between t k=1 and t k obtain X t via linear interpolation
B t () = B t k1 () +
t t k1
(B t () B t k1 ()),
t k t k1 k
t (t k1 , t k ).
75 / 561
Week 3
Stochastic Integration
Stochastic Differential Equations
Simulating Solutions of SDE
76 / 561
Stochastic Integration
77 / 561
Simple Functions
Let F = (F t ) t0 and h an F-adapted process on [0, T ] taking values in R.
Definition
We call h a simple process if for a partition := {0 = t 0 < t 1 < < t n = T }
and F t i -measurable random variables (hi )ni=1 , the process h(t) := h(, t)
satisfies
h(, t) =
n1
X
for 0 t T and
i=0
Pn1
i=0
78 / 561
Properties
79 / 561
Definition
The stochastic integral (aka. It integral) of a simple process h is defined as
Z T
n1
X
hi ()(Wt i+1 Wt i )
h(t) dWt =
i=0
80 / 561
E[h2 ] d t < .
81 / 561
Theorem
2
For simple processes h H[0,T
we have
]
Z T
h(t) dWt = 0
1. E
0
2. E
h(t) dWt
E[h(t)2 ] d t
82 / 561
Proof of 1
Since,
Z
0
h(t) dWt =
n1
X
hi ()(Wt i+1 Wt i )
i=0
83 / 561
Proof of 2
From the definition, we have
Z T
h(t) dWt =
n1
X
hi ()(Wt i+1 Wt i ).
i=0
So
E
h(s) dWs
=E
n1
X
2
hi (Wt i+1 Wt i )
i=0
=E
n1
X
Wt i )
i=0
n1
+ 2E
hi h j (Wt i+1 Wt i )(Wt j+1 Wt j )
i, j=0
i6= j
84 / 561
Proof of 2
For i < j , by the tower property of conditional expectations
E[hi h j (Wt i+1 Wt i )(Wt j+1 Wt j )] = E[hi h j (Wt i+1 Wt i )E[(Wt j+1 Wt j )|F t j ]] = 0
Moreover,
E[h2i (Wt i+1 Wt i )2 ] = E[h2i E[(Wt i+1 Wt i )2 |F t i ]]
Substituting, we obtain
n1
Z T
Z
X
2
2
=
E
h(t) dWt
E[hi ](t i+1 t i ) =
0
i=0
E[h(t)2 ] d t
85 / 561
Martingales
Definition
A F-adapted stochastic process X := (X t ) t0 is a martingale if E[|X t |] < for
each t > 0 and E[X t |Fs ] = X s for all s t .
The following processes are examples of martingales:
the Brownian motion W ;
X t := Wt2 t ;
E t := exp(cWt 12 c 2 t), where c R.
86 / 561
Lemma
2
If h is a simple process in H[0,T
, then the process
]
Z
X t :=
h(s) dWs ,
tT
is a martingale.
87 / 561
Construction
88 / 561
Square-integrable Processes
2
Let h := (h t )0tT be a stochastic process. We say that h is in the set of H[0,T
]
processes if, for all t [0, T ], we have
Z t
|h(s)|2 ds < .
E
0
2
Note: Processes in H[0,T
can be approximated arbitrarily close by simple
]
2
processes in H[0,T ] .
89 / 561
Construction
Theorem
2
Let W be a BM. For every h H[0,T
the stochastic integral
]
I t (h) :=
h(s) dWs ,
tT
(2)
exists.
90 / 561
Properties
Let W be a BM. Then the stochastic integral given by (2) has the following
properties:
2
linearity: I t (h + g) = I t (h) + I t (g) for h, g H[0,T
and , R;
]
2
then I is a continuous martingale;
If h H[0,T
]
Z T
2
Z
h(t) dWt =
E
0
E[|h(t)|2 ] d t.
91 / 561
It Processes
An It process is the sum of an absolutely continuous drift plus a continuous
local martingale of the form
Z t
Z t
X t = X0 +
(s) ds +
(s) dWs ,
(3)
2
where H[0,T
and is a progressively measurable process satisfying
]
Rt
|(s)| ds < for all t > 0, such that the above integrals are defined.
0
We also write
dX t = (t) d t + (t) dWt .
92 / 561
93 / 561
Novikovs Condition
Z T
1
2 ds
< X is a martingale.
E exp
2 0 s
94 / 561
Quadratic covariation
In addition to (3), let
Yt = Y0 +
(s) ds +
(s) dWs ,
(s)(s) ds,
95 / 561
Product Rule
Given X t and Yt adapted to the same Brownian motion W ,
dX t = t dWt + t d t,
dYt = t dWt + t d t.
Then
d(X t Yt ) = X t dYt + Yt dX t + t t d t
In the other case, if X t and Yt are adapted to two different and independent
,
Brownian motions W and W
dX t = t dWt + t d t
t + t d t
dYt = t d W
Then
d(X t Yt ) = X t dYt + Yt d X t
Dale Roberts (c) 2011-2014
96 / 561
97 / 561
X a = R,
(4)
Xt = +
(s, X s ) ds +
(s, X s ) dWs ,
a t b.
(5)
98 / 561
Solutions
A stochastic process X := (X t ) t0 is a (strong) solution to (4) if it satisfies:
The stochastic process (t, X t ) H 2 so that
Z t
(s, X s ) dWs
|(s, X s )| ds < ;
99 / 561
100 / 561
a t b, x, y R.
a t b, x R.
101 / 561
Theorem
Suppose that (t, x) and (t, x) satisfy the Lipschitz and linear growth
conditions in x for all t 0. Then for every initial time a R+ and point R,
there exists a unique solution X of the SDE (4).
102 / 561
103 / 561
Overview
104 / 561
105 / 561
Goal of Approximation
Numerical schemes for SDE depend on our specific goal. Are we interested in
obtaining a path t 7 Xe (t, ) that is as close as possible to the (unknown)
solution path t 7 X (t, ),
or in computing an expectation of a functional E[g(X )] of the SDE?
The first is called strong approximation and the second is weak
approximation.
106 / 561
Exact Simulation
In some rare cases, we can find an explicit solution to the SDE. For example,
the SDE
dX t = aX t d t + bX t dWt , X 0 = x,
has the explicit solution
1 2
X t = x exp (a b )t + bWt = f (t, Wt )
2
107 / 561
Approximating Paths
The task becomes a lot more difficult when we want to calculate the
expectation of the function of the whole path t 7 X t
Even in the case of Brownian motion, we need to find a way to
approximate the path
We follow the same approximation idea: approximate the paths on an
partition of the interval [0, T ] by simulating the Brownian motion on that
partition and then constructing the values of X t on the partition
108 / 561
Approximating Paths
Theorem
Assume that the SDE
d X t = a(t, X t ) d t + (t, X t ) dWt ,
X0 = x
if t = iT /n for some i = 0, 1, . . . , n
and extended to all t [0, T ] by linear interpolation. Then, for each bounded
and continuous function g of the path of X , we have
E[g(Xen )] E[g(X )],
109 / 561
Strong Approximation
X0 = x
110 / 561
Euler-Maruyama Scheme
111 / 561
Week 4
Its Formula
Multidimensional Stochastic Calculus
Stochastic Exponential
Black-Scholes Market
112 / 561
Its Formula
113 / 561
Chain Rule
Remember the (calculus) chain rule states that
d
f (g(t)) = f 0 (g(t))g 0 (t)
dt
Or for a function f (t, x t ) in terms of total derivatives
df
f d t f d xt
=
+
dt
t dt x dt
d f (t, x t ) =
f
f
(t, x t )d t +
(t, x t )d x t
t
x
114 / 561
Its Formula
115 / 561
Theorem
For a function f : [0, T ] R such that the partial derivatives
f
2
we have
exist and are continuous and for which x H[0,T
]
f (t, Wt ) f (0, 0) =
Z
0
f
(s, Ws ) dWs +
x
Z
0
f
1
(s, Ws ) ds+
s
2
f
t
, x and x 2
2f
2f
(s, Ws ) ds
x2
Alternatively written:
d f (t, Wt ) =
f
f
12f
(t, Wt ) dWt +
(t, Wt ) d t +
(t, Wt ) d t
x
t
2 x2
116 / 561
Applications
E[Wtn ]
Solving SDEs
117 / 561
Theorem
For a function f (t, x) that is twice differentiable in x and once in t . Then for
Yt = f (t, X t ), we have
d Yt =
f
f
2f
1
(t, X t ) d t +
(t, X t ) dX t + (t, X t )2 2 (t, X t )d t
t
x
2
x
2
f
f
f
1
2 f
(t, X t ) + (t, X t )
(t, X t ) + (t, X t )
(t,
X
)
d
t+(t)
(t, X t ) dWt
t
t
x
2
x2
x
118 / 561
119 / 561
W1 (t)
W2 (t)
W (t) =
...
Wn (t)
120 / 561
Example
Let (W1 (t)) t0 and (W2 (t)) t0 be two independent Brownian motions. For
1 +1, let
121 / 561
or
122 / 561
such that
Z
kh(t)k d t < .
Recall: kxk =
x 12 + x 22 + + x n2 .
123 / 561
h(t) dW (t) =
I T :=
S
n Z
X
i=1
h j (t) dW j (t).
2
If h H[0,T
then the process (I t ) tS is a martingale and
]
2
h(t) dW (t)
=E
kh(t)k2 d t .
124 / 561
I =
Z
S
H(t) dW (t) =
h11 (t)
...
h1n (t)
dW1 (t)
.. ..
.
.
dWn (t)
j=1
125 / 561
X (0) = x R
where a(t) R, b(t) = [b1 (t) b2 (t) bn (t)] with W an n-dim BM.
Or of the form
d X t = a(t) d t + B(t) dW (t),
X (0) = x Rn
126 / 561
m
X
bi j (t)dW j (t),
j=1
ai (s)ds +
bi j (s)dWsj ,
j=1
127 / 561
X (0) = x Rn
a1 (t)
X 1 (t)
X (t) = ... , a(t) = ... and W (t) = [W1 (t), . . . , Wm (t)] T .
X n (t)
an (t)
b11 (t) . . .
..
...
b(t) =
.
bn1 (t) . . .
Last Modified: 14/10/2014 08:19
b1m (t)
.. .
.
bnm (t)
128 / 561
Quadratic Variation
Consider the n-dimensional It process
Z t
X (t) = X (0) +
a(s) ds +
B(s) dW (s).
X i t =
129 / 561
Quadratic Covariation
Consider the n-dimensional It process
Z t
a(s) ds +
X (t) = X (0) +
B(s) dW (s).
X i , X j (t) =
dX i , X j t = Bi (t)0 B j (t) d t.
130 / 561
Multidimensional It Formula
The multi-dimensional It formula gives the stochastic differential for the
process Y (t) = f (t, X (t)) for some transformation function
f (t, x) = ( f1 (t, x), . . . , f m (t, x))0
The function f is assumed to map from [0, ) Rn Rm and be of class
f (t,x)
2 f (t,x)
C 1,2 (R, Rn ): i.e., the functions (t, x) t and (t, x) x i x j are
continuous on (0, ) Rn for i, j = 1, . . . , n.
Its formula states that the stochastic differential for
Y (t) = (Y1 (t), Y2 (t), . . . , Ym (t))0 for t in [0, T ] is given by
X f (t, X (t))
f k (t, X (t))
k
dt +
d X i (t)
d Yk (t) =
t
x
i
i=1
n
2
X
f k (t, X (t))
1
+
dX i , X j (t).
2 i, j =1 x i x j
Last Modified: 14/10/2014 08:19
131 / 561
Multidimensional It Formula
The multi-dimensional It formula using drift and diffusion coefficients is
given by
n
f (t, X (t))
f (t, X (t)) X
d Y (t) = d f (t, X (t)) =
ai (t)
+
t
xi
i=1
!
n
2
f
(t,
X
(t))
1 X
bi (t) T b j (t)
+
dt
2 i, j =1
xi x j
n
m
X
X
f (t, X (t))
+
bi j (t)
dW j (t).
x
i
j=1 i=1
This
by substitution of the differentials for d X i (t) and
is obtained
d X i , X j (t) using the covariation form of Its formula.
Dale Roberts (c) 2011-2014
132 / 561
Change of Measure
133 / 561
Change of Measure
Let (W (t)) t0 be an m-dimensional Brownian motion under P .
Let ( (t)) t0 be an F-adapted n-dimensional diffusion process that
satisfies the Novikov condition.
Then there is a measure Q with the properties:
1. Q is equivalent to P .
f with
2. The process W
Z t
f(t) = W (t) +
(s) ds
W
0
is an m-dimensional BM under Q.
In differential form we have
f(t) = dW (t) + (t)d t.
dW
Dale Roberts (c) 2011-2014
134 / 561
Girsanovs Theorem
Theorem (Girsanov)
2
and X satisfy dX t = (t) dWt . If E t (X ) is a uniformly integrable
Let H[0,T
]
martingale with E (X ) > 0 then
n Z T
Z T
X
dQ
1
i (s) dWsi +
= E T (X ) = exp
2i (s) ds
dP
2
0
0
i=1
is a Q-Brownian motion.
Dale Roberts (c) 2011-2014
135 / 561
or
dYi (t) =
n
X
i j (t)d X j (t).
j=1
136 / 561
Stochastic Exponential
137 / 561
Stochastic Exponential
The stochastic exponential of an It process X is defined as
1
E t (X ) = e X t 2 X ,X t
E0 = e X 0 ;
138 / 561
Black-Scholes Market
139 / 561
Black-Scholes Market
The Black-Scholes market typically consists of a risky asset S whose price
dynamics is given by the solution of the SDE
dS t = S t ( d t + dWt ),
S0 = 1
B0 = 1.
Writing dS t = S t d X t with
dX t = d t + dWt ,
X 0 = 0,
140 / 561
Self-financing Portfolios
A portfolio or trading strategy is a pair of processes (, ) that give the amount
of holding at time t of the asset pair (S, B). The value process of this portfolio is
V (t) = (t)S(t) + (t)B(t)
and there is no inflow or outflow of capital during the trading. In other words,
the change in value of the portfolio over any time period is only due to the
changes in the value of B and S :
141 / 561
Arbitrage Portfolio
142 / 561
Martingale Measures
S(t)
B(t)
is a Q-martingale.
143 / 561
In particular, under the Black-Scholes model, the fair price at time zero for the
option is
V0 = EQ [er T X T ]
fT ) where W
f is a Q-BM.
where X T = f (S T ) and S T = S0 exp((r 21 2 )T + W
144 / 561
Week 5
Black-Scholes Model
Definition of Implied Volatility
Empirical Observations
Arbitrage Bounds
Asymptotics
Approximating Implied Volatility
Dale Roberts (c) 2011-2014
145 / 561
Black-Scholes Model
146 / 561
Black-Scholes Model
Black-Scholes Model
Under real-world measure P, we assume there exists deterministic r , , and
such that
B t = exp(r t)
S t = S exp(Wt + t),
where r is the risk-free interest rate, is the stock volatility and is the stock
drift, and S := S0 is the current stock price.
There are no transaction costs and both instruments are freely and
instantaneously tradable either long or short at the price quoted.
147 / 561
Black-Scholes Formula
Explicit solution was obtained by Black and Scholes (1973) relying heavily on
the notion of no-arbitrage in Merton (1973).
where
log( KS ) + (r 12 2 )T
d2 :=
,
p
T
p
d1 := d2 + T
148 / 561
Benchmark Model
Widely acknowledged that assumptions underlying the BSM model are far
from realistic
BSM model enjoys unrivalled popularity in practice due to its role as a
benchmark model
Provides a convenient mapping device from the space of option prices to a
single real number called implied volatility (IV)
149 / 561
150 / 561
Implied Volatility
151 / 561
Matthias R. Fengler
IV Surface
100 %
80 %
60 %
40 %
20 %
0.5
1
1.5
2
Moneyness [X/S]
152 / 561
The function resulting for a fixed expiry is frequently called the implied
volatility smile due to the U-shaped pattern
For a fixed strike across several expiries one speaks of the term structure of
implied volatility
Practioners like to think of them as stemming from a smooth and
well-behaved surface
153 / 561
Applications
Market makers use the IV surface to quote prices for strike-expiry pairs
which are illiquid or not listed
Pricing engines used to price exotic options are calibrated against the IV
surface
Risk managers use stress scenarios defined on IV surface to visualise and
quantify the risk inherent to option portfolios
154 / 561
Formal Definition
Concept of IV was first introduced by Latan and Rendelman (1976).
Definition
Given an observed market price of European option Ce with strike K and time
to maturity T , the implied volatility of this option is
e:
e ) Ce = 0
C (K, T,
e R+ .
By monotonicity of the BSM price in , there exists a unique solution
Put options can also be used (Put-Call Parity).
155 / 561
IV Surface
Definition
An implied volatility surface is the mapping
e : (t, K, T ) 7
e t (K, T ).
156 / 561
Empirical Observations
157 / 561
Empirical Observations
158 / 561
Definition
We define the implied volatility skew as
e 2
m m=0
The skew often increases during times of crisis (e.g., dot-com crash 2001-2003,
Sept. 11 2001, GFC 2008).
159 / 561
160 / 561
1Y ATMIV
DAX
8000
40 %
7000
35 %
6000
30 %
5000
25 %
4000
20 %
3000
15 %
2001
2003
2005
2007
2009
161 / 561
Matthias R. Fengler
0.2
50
0.6
1M skew
1Y skew
0.8
IV term structure
40
30
20
1.2
10
IV term structure
Skew
0.4
1.4
0
1.6
2001
2003
2005
Time
2007
2009
162 / 561
implied volatility
Empirical Quantiles ofBSMATM-IV
0
0.1
0.2
Skew
0.3
0.4
0.5
95%
75%
Median
25%
5%
0.6
0.7
0.8
0.9
1M 3M
1Y
Time to expiry
2Y
163 / 561
Arbitrage Bounds
164 / 561
Arbitrage Bounds
e must be such that the call price is bounded above and below:
The IV
e) S
(S er T K)+ C (K, T,
C
e) 0
(K, T,
K
and
2C
e) 0
(K, T,
K2
165 / 561
must hold.
166 / 561
Asymptotics
167 / 561
Asymptotics
Asymptotics are mathematical results obtained when you take one of the
variables to infinity or to zero
These types of results provide interesting theoretical information about
the model
They are often easier to establish and help reduce the complexity
168 / 561
169 / 561
IV Surface Flattens
The first theorem shows that the IV surface flattens for infinitely large expiries.
sup
T m ,m [M ,M ]
1
2
e (m1 , T )| = 0.
|e
(m2 , T )
Note: sup means supremum which is equivalent to the maximum if there are
finitely many elements in the set.
170 / 561
Flattening Rate
The rate of flattening of the IV skew can be made more precise by the following
result.
m2 m1
T
m2 m1
T
171 / 561
Representation Formula
The level of the IV for far expiry can be written in terms of the expected share
price.
172 / 561
At Short Expiry
Theorem (Roper and Rutkowski 2009)
If C (K, ) = (S K)+ for some > 0 then
e (K, T ) = 0.
lim
T 0+
Otherwise,
p
2C (K, T )
,
p
S T
e (K, T ) =
lim
T 0+
| log(S/K)|
,
p
2T log(C (K, T ) (S K)+ )
S=K
S 6= K
173 / 561
At Short Expiry
174 / 561
At Far Strike
Lee has established a one-to-one correspondance between the large-strike tail
and the number of moments of S T and the small-strike tail and the number of
moments of S T1 .
Define e
p := sup{p : E[S T ] < } and
R = lim sup
m
e2
|m|/T
R 1
1
+
.
2R
8
2
Equivalently,
R = 2 4( e
p2 + e
pe
p).
Dale Roberts (c) 2011-2014
175 / 561
At Far Strike
Theorem (Lee 2004)
q
Define e
q := sup{p : E[S T ] < } and
L = lim sup
m
e2
|m|/T
1
1
+ L .
2 L
8
2
Equivalently,
L = 2 4( e
q2 + e
qe
q).
176 / 561
177 / 561
Approximation Formulas
178 / 561
BSF Approximation
The simplest approximation to IV is due to Brenner and Subrahmanyam (1988)
and Feinstein (1988).
BSF Approximation
e
v
t 2 C
T S
179 / 561
Li Approximation
A more accurate formula was proposed by Li (2005) and hold for ITM (S K )
and OTM (S K ) options.
Li Approximation
q
2z T2 p1T 8z 2 p62z
1
4(KS)2
2
p
+ S(S+K)
2 T
p
2
S+K (2C
if 1.4
if > 1.4
+ Ker T S) and
180 / 561
Corrado-Miller Approximation
Other approximation formulas often lack a rigourous mathematical
foundation. The most prominent amongst these are those suggested by
Corrado and Miller (1996) and Bharadia et al. (1996)
Corrado-Miller Approximation
v
p
u
2
1
2
SX t
SX
(S X )2
ep
C
+
C
2
2
T S+X
where X = Ker T .
181 / 561
T S (S K)/2
182 / 561
Accuracy of Approximations
183 / 561
Week 5b
Bonds
184 / 561
Bonds
185 / 561
Types of Bonds
186 / 561
Types of Bonds
187 / 561
http://www.aofm.gov.au/content/_download/Treasury_Bond_
Information_Memo_18_December_2009.pdf
188 / 561
http://www.aofm.gov.au/content/_download/Treasury_
Indexed_Bond_Information_Memo_18_December_2009.pdf
189 / 561
http://www.rba.gov.au/fin-services/bond-facility/xls/
otc-prices.xls
190 / 561
Types of Bonds
Bonds are also issued by state government (NSW, QLD, etc), banks, and
companies
Bonds issued by companies are called corporate bonds
191 / 561
Types of Bonds
192 / 561
Types of Bonds
Bonds with identical characteristics but sold by different issuers may have
different prices
Example: 2 bonds of 20Y with 6% coupon, one issued by a company and
one issued by RBA
Bond issued by company will probably trade at a lower price because
market participants take into account the possibility of default on the
payments or the redemption process
193 / 561
Types of Bonds
194 / 561
195 / 561
http://www.aofm.gov.au/content/_download/Treasury_Bond_
Information_Memo_18_December_2009.pdf
196 / 561
Notation
197 / 561
198 / 561
first coupon
0
if bond has gone ex-dividend
c j = gt for j = 2, . . . , n 1
cn = 100 + gt
c1 =
gt
199 / 561
200 / 561
Dirty price
201 / 561
RBA Prices
Banks selling price includes coupon interest accrued since the last
coupon payment and up to the date of purchase
Purchasers of Bonds subsequently receive a full coupon payment when it
next falls due, irrespective of when the purchase takes place in the coupon
period.
202 / 561
RBA Prices
The only exception to this is when Bonds are purchased during the
ex-interest period, which commences seven days prior to a coupon
payment date. Investors who purchase Bonds during the ex-interest
period do not receive the impending coupon payment.
As a result, Bond prices in the ex-interest period do not include accrued
interest and are commensurately lower.
Question: Are RBA prices clean or dirty?
203 / 561
Zero-Coupon Bonds
204 / 561
P(t, T )
t
1
T
P(t, T )
t
1
T
205 / 561
Modelling Assumptions
206 / 561
Term structure
Price P(t, T )
1
0
Years
T 7 P(t, T )
Last Modified: 14/10/2014 08:19
207 / 561
0
Years
t 7 P(t, T )
Last Modified: 14/10/2014 08:19
208 / 561
Modelling Assumptions
209 / 561
eR = 1 + R + o(R)
t =0
1
1
1
..
.
t =1
1+R
(1 + R/2)2
(1 + R/m)m
..
.
eR
for small R
210 / 561
Week 6
Interest Rates
Arbitrage-Free Pricing
Fundamental Theorem of Asset Pricing
The Interest Rate Setting
Blacks model
211 / 561
Interest Rates
212 / 561
Spot Rates
The spot rate at time t for maturity at time T is defined as the yield to
maturity of the T -bond:
R(t, T ) =
In other words,
log P(t, T )
Tt
P(t, T ) = exp (T t)R(t, T )
213 / 561
Forward Rates
1
P(t, T )
log
ST
P(t, S)
The forward rate arises from a contract: agree at time t that we will invest
$1 at time T in return for e(ST )F (t,T,S) at time S
214 / 561
No-arbitrage argument
By definition, contract has value zero at time t provided F (t, T, S) is the fair
forward rate between T and S
We argue that:
F (t, T, S) =
P(t, T )
1
log
ST
P(t, S)
215 / 561
No-arbitrage argument
Suppose this is not true and that
F (t, T, S) >
1
P(t, T )
log
ST
P(t, S)
Units
Value
Forward
+1
0
T -Bond
+1
P(t, T )
S -Bond P(t, T )/P(t, S) P(t, T )
0
216 / 561
No-arbitrage argument
P(t, T )
>0
P(t, S)
217 / 561
No-arbitrage argument
Hence, we cannot have
F (t, T, S) >
1
P(t, T )
log
ST
P(t, S)
1
P(t, T )
log
ST
P(t, S)
F (t, T, S) =
P(t, T )
1
log
ST
P(t, S)
We conclude that
218 / 561
Forward-rate curve
..
.
219 / 561
Forward-rate curve
lim F (t, T, S) =
ST
log P(t, T )
T
T P(t, T )
P(t, T )
Therefore,
P(t, T ) = exp
f (t, u) du
220 / 561
Forward-rate curve
f (t, u)
A=
RT
t
f (t, u) du
221 / 561
Forward-rate curve
F (t, u, u + )
u u+
222 / 561
Forward Rates
223 / 561
Short Rate
R(t, T ) is the risk-free rate of interest over the fixed period from t to T
When we talk about the risk-free rate of interest we mean the
instantaneous risk-free rate:
r(t) = lim R(t, T ) = R(t, t) = f (t, t)
T t
Think of r(t) as the rate of interest on a bank account: this can change on
a daily basis
r(t) is often called the short rate
224 / 561
Short Rate
R(t, T )
r (t)
t
tT
225 / 561
226 / 561
Par Yields
227 / 561
Par Yields
Par yield for a maturity T (annual coupons, = 1), solve following
equation for coupon rates (t, T ):
100 = 100(t, T )
T
X
P(t, s) + 100P(t, T )
s=t+1
Solution given by
1 P(t, T )
(t, T ) = P T
s=t+1 P(t, s)
228 / 561
Yield to Maturity
find :
P(t, T ) =
n
X
c j e(t j t)
j=1
is the YTM
229 / 561
Relationships
For a given t , each of the curves P(t, T ), f (t, T ) and R(t, T ) uniquely
determines the other two
Recall that we have
= exp
f (t, u) du
230 / 561
Relationships
f (t, u)
R(t, T )
A = R(t, T )(T t) =
RT
t
f (t, u) du
P(t, T ) = exp(A)
231 / 561
Example
T
F (0, T, T + 1)
0
1
2
3
0.0420 0.0500 0.0550 0.0560
P(0, T ) = exp
T 1
X
F (0, t, t + 1)
t=0
T
P(0, T )
0
1
2
3
0.95887 0.91211 0.86239 0.81628
232 / 561
Example
R(0, T ) =
T
F (0, T, T + 1)
P(0, T )
R(0, T )
log P(0, T )
T +1
0
1
2
3
0.0420 0.0500 0.0550 0.0560
0.95887 0.91211 0.86239 0.81628
0.0420 0.0460 0.0490 0.05075
233 / 561
Arbitrage-Free Pricing
234 / 561
Arbitrage
We are able to construct at time 0 some portfolio which has a NPV of zero
At some fixed time T > 0, the portfolio gives us a sure profit
This is sometimes called a free lunch
235 / 561
Arbitrage
236 / 561
V (t) =
n
X
x i Pi (t)
i=1
237 / 561
At time t = 0:
V (0) =
n
X
x i Pi (0) = 0
i=1
At time T > 0:
P(V (T ) 0) = 1
P(V (T ) > 0) > 0
238 / 561
Principle of No Arbitrage
239 / 561
240 / 561
P(0, T ) = exp
f (0, u) du
241 / 561
Probability space (, F , P)
Our model is defined so that at time 1 the forward rate curve will be
f (1, u) = f (0, u) + ,
for u > 1
: R is a random variable
242 / 561
u 7 f (0, u)
243 / 561
u 7 f (1, u)
{
u 7 f (0, u)
244 / 561
u 7 f (1, u)
u 7 f (0, u)
245 / 561
{
246 / 561
T1
$1
T2
$1
T3
247 / 561
P(1, T ) = exp
f (1, u) du
( f (0, u) + ) du
= exp
= exp
f (0, u) du e(T 1)
P(0, T ) (T 1)
e
P(0, 1)
248 / 561
P(1, T ) =
P(0, T ) (T 1)
e
P(0, 1)
(6)
249 / 561
3
X
x i P(0, Ti ) = 0
i=1
3
X
i=1
3
X
i=1
250 / 561
V1 () =
=
=
3
X
i=1
3
X
x i P(1, Ti )
xi
P(0, Ti ) (Ti 1)
e
P(0, 1)
by (6)
i=1
(T2 1)
e
g()
P(0, 1)
251 / 561
Where
g() =
3
X
x i P(0, Ti )e(Ti T2 )
i=1
252 / 561
x i P(0, Ti ) = 0
i=1
then g(0) = 0
Also, at time 1 and for a shift ,
253 / 561
As
g() =
3
X
x i P(0, Ti )e(Ti T2 )
i=1
254 / 561
x i (T2 Ti )P(0, Ti ) = 0
i=1
x i Ti P(0, Ti ) = 0
i=1
since T2
3
X
x i P(0, Ti ) = 0
i=1
255 / 561
Finally, it is sufficient that g 00 () > 0 for all to ensure that g() > 0 for all
6= 0
As
00
g () =
3
X
i=1
g () > 0 for all if and only if x 1 and x 3 are both greater than or equal to
0
00
256 / 561
3
X
x i Ti P(0, Ti ) = 0
i=1
257 / 561
3
X
x i P(1, Ti ) > 0
i=1
for all 6= 0
Arbitrage!
258 / 561
Example 2
P(1, t + 1) =
e0.1t
e
0.06t
if = 1
if = 0
259 / 561
Example 2
260 / 561
Example 2
Solving the system of equations:
1
= 1.173511
P(0, 2)
1
x3 =
= 0.635624
2P(0, 3)
1
x1 =
= 0.541644
2P(0, 1)
x2 =
261 / 561
Example 2
262 / 561
Conclusion
Parallel shifts of the yield curve cannot occur at any time in the future
263 / 561
264 / 561
Z
r(s) ds
265 / 561
Theorem (FTAP)
(i) Bond prices evolve in arbitrage-free manner if and only if exists measure Q
)
equivalent to P under which P(t,T
B(t) is a martingale for 0 < t < T
(ii) If (i) holds, then the market is complete if and only if Q is the unique
)
measure under which P(t,T
B(t) are martingales
266 / 561
267 / 561
Corollary
Hence
P(t, T ) = EQ exp
r(s) ds F t
268 / 561
V (t) = EQ exp
r(u) du X F t
269 / 561
270 / 561
V (t) = EQ exp
= EQ e
RT
r(u) du
r(u) du X F t
h
RS
i i
F T F t
i
r(u) du
EQ e
h RT
t r(u) du
KEQ e
F t
i
i
h RS
h RT
= EQ e t r(u) du F t KEQ e t r(u) du F t
t
= P(t, S) K P(t, T )
271 / 561
P(t, S)
P(t, T )
272 / 561
273 / 561
274 / 561
Factors
275 / 561
Factors
276 / 561
A Bond is a Derivative
277 / 561
Types Of Model
278 / 561
Types Of Model
279 / 561
Types Of Model
280 / 561
Types Of Model
No-Arbitrage Models use the observed term structure at the current time
as the starting point
Future prices evolve in a way which is consistent with this initial price
structure and which is arbitrage free
These models are used to price short-term derivatives
281 / 561
Types Of Model
282 / 561
Blacks model
283 / 561
Types Of Model
284 / 561
where
log(F0 /K) + 2 T /2
d1 =
,
p
T
p
log(F0 /K) 2 T /2
d2 =
= d1 T .
p
T
285 / 561
Example
European put future option on crude oil. TTM is 4 months, current futures
price is $20, exercise price is $20, risk-free rate is 5% per annum, and the
volatility of the futures price is 25% per annum.
F0 = 20,
286 / 561
287 / 561
288 / 561
289 / 561
p
d2 = d1 T .
This gives
C = P(0, T ) (F0 (d1 ) K(d2 )) ,
290 / 561
Blacks model can be extended to allow for the situation where the payoff
is calculated at time T but the payoff is actually made at some time S > T
This basically changes the discounting:
C = P(0, S) (F0 (d1 ) K(d2 )) ,
291 / 561
292 / 561
293 / 561
When interest rates are stochastic Blacks model involves two approximations:
E(VT ) = F0 . In a risk-neutral world, E(VT ) is equal to its futures price. The
forward price and futures price are not the same when interest rates are
stochastic
The stochastic behaviour of interest rates is not taken into account for the
discounting
294 / 561
295 / 561
Week 7
296 / 561
297 / 561
Continuous-time setting
One-factor models for the term structure of interest rates
Bond pricing given a one-factor diffusion model for the risk-free rate r(t)
298 / 561
299 / 561
300 / 561
Model
a(r)
Merton (1973)
Dothan (1978)
r
Vasicek (1977)
( r)
CIR (1985)
( r)
Pearson-Sun (1994)
( r)
Brenann-Schwartz (1979)
( r)
Black-Karasinski (1991)
r r log r
b(r)
p r
r
r
r
301 / 561
Model r(t) 0?
AR?
Simple Formula?
M
N
N
Y
D
Y
N
N
V
N
Y
Y
CIR
Y
Y
Y
PS
Y if > 0
Y
N
BS
Y
Y
N
BK
Y
Y ( > 0)
N
302 / 561
All models are approximation to reality but some are better than others
One must consider:
Desired characteristics
Ease of implementation
How well does model approximate reality
Fast vs. slow calculation
303 / 561
304 / 561
Other Characteristics
305 / 561
Other Characteristics
Does the model fit historical data well? (e.g., mean and variance)
If the model keeps rates positive, does it allow forward rates, spot rates
and par yields to take values close to zero?
Does the model have equilibrium derivation?
306 / 561
Other Characteristics
307 / 561
Other Characteristics
308 / 561
309 / 561
310 / 561
311 / 561
dB(t) = r(t)B(t)d t
The solution is
Z
r(u) du
312 / 561
The risk premium at time t on the risky bond P(t, T ) is defined as the
excess expected rate of return on the bond m(t, T ) over the risk-free rate
of interest r(t)
The risk premium represents the extra reward we get for investing in the
risky asset rather than the risk-free cash account
313 / 561
m(t, T ) r(t)
S(t, T )
The market price of risk represents the excess expected return per unit of
volatility
314 / 561
315 / 561
Theorem
There exists a measure Q P with
V (t) = EQ exp
r(u) du X F t
f(t) and W
f(t) is a BM under Q
where d r(t) = (a(t) (t)b(t))d t + b(t)d W
316 / 561
317 / 561
318 / 561
1.
2.
3.
4.
5.
319 / 561
P(t, T )
= B(t)1 P(t, T )
B(t)
to get
320 / 561
d(B(t)1 ) =
321 / 561
d Z(t, T ) =
as
dB 1 , P(t) = 0 d t
322 / 561
m(t, T ) r(t)
S(t, T )
so defining
f(t) = W (t) +
W
(u) du
323 / 561
Z T
1
EP exp
(u)2 du <
2 0
324 / 561
Z T
Z T
1
dQ
(u)dW (u)
(u)2 du
= exp
dP
2 0
0
Under which
f(t) = W (t) +
W
(u) du
325 / 561
Note that under the same change of measure from P to Q we have that
d P(t, T ) = P(t, T )[m(t, T ) d t + S(t, T ) dW (t)],
under P
becomes
f(t)],
d P(t, T ) = P(t, T )[r(t) d t + S(t, T ) d W
under Q
Under Q, all T -bond prices have drift equal to the risk-free rate r(t)
326 / 561
Z T
1
2
EQ exp
S(t, T ) d t
<
2 0
327 / 561
328 / 561
(u)d Z(u, T )
329 / 561
330 / 561
331 / 561
d V (t) = d[B(t)D(t)]
= B(t)d D(t) + D(t)dB(t) + dB(t)d D(t)
= B(t)(t)d Z(t, T ) + D(t)r(t)B(t)d t + 0d t
f(t) + ((t)Z(t, T ) + (t))r(t)B(t)d t
= (t)B(t)S(t, T )Z(t, T )d W
f(t)) + (t)r(t)B(t) d t
= (t)P(t, T )(r(t)d t + S(t, T )d W
= (t)d P(t, T ) + (t)dB(t)
The portfolio process is self-financing
332 / 561
We also have
333 / 561
V (t) = B(t)D(t)
B(t)
= EQ
X S F t
B(S)
ZS
= EQ exp
r(u) du X S F t
334 / 561
335 / 561
P(t, S) = EQ exp
r(u) du F t
Proof.
Take X S = 1 in previous theorem.
336 / 561
Under Q the prices of all tradable assets have the risk-free rate of interest
as the expected growth rate
Under P,
d V (t) = V (t)[(r(t) + (t)V (t)) d t + V (t)dW (t)]
(t)V (t) is called the market risk premium or risk premium
337 / 561
Notice that (t)V (t) depends on the market price of risk (t)
This implies that the risk premiums on different assets are closely linked
They can differ only through the volatility in the tradable asset (V (t) or
S(t, T ) for a ZCB)
338 / 561
We anticipate that ZCB will have a positive risk premium ((t)S(t, T ) > 0
for all T > t ) to reward for the extra risk
Derivatives V (t) for which V (t) has the same sign as S(t, T ) also have a
positive risk premium (e.g., call option on P(t, T ))
Opposite sign for V (t) leads to a negative risk premium (e.g., Put option
on P(t, T ))
339 / 561
340 / 561
341 / 561
r(t) is Markov, i.e., the future dynamics of r(t) given its current value is
independent of past behaviour
The market is efficient: no transactions costs and investors are rational
342 / 561
343 / 561
f
f
1 2f
dt +
d r(t) + b2 2 d t
t
x
2 x
f
f
1 2f
=
dt +
(ad t + bdW (t)) + b2 2 d t
x
2 x
t
2
f
f
f
1 f
=
+a
+ b2 2 d t + b
dW (t)
t
x 2 x
x
d f (t, r(t)) =
344 / 561
S(t, T ) =
1 P
b
P r
345 / 561
We can now consider the market price of risk and use it to derive an
equation for price of a T -bond
Construct a portfolio of:
V1 (t) of a T1 -bond (i.e., short)
V2 (t) of a T2 -bond (i.e., long) with T2 > T1
Total portfolio worth is V (t) = V2 (t) V1 (t)
We now vary V1 (t) and V2 (t) so that V (t) remains risk-free
346 / 561
V1 (t)
Si = S(t, Ti )
347 / 561
Then we get
V1 (t) S(t, T2 ) S2
=
=
V2 (t) S(t, T1 ) S1
V1 (t)S1 V2 (t)S2 = 0
and
V2 m2 V1 m1 =
S1 V
S V
m2 2
m1
S1 S2
S1 S2
m2 S1 m1 S2
V (t)
dt
S1 S2
348 / 561
m2 S1 m1 S2
dV (t) = V (t)
dt
S1 S2
349 / 561
m2 S1 m1 S2
m1 r
m r
= r(t) or
= 2
S1 S2
S1
S2
This must be true for all maturities. Thus for all T > t
m(t, T ) r(t)
= (t)
S(t, T )
(t) is the market price of risk and cannot depend on the maturity date T
350 / 561
P 1 2 2P
1 P
+a
+ b
m(t, T ) =
P t
r 2 r2
1 P
S(t, T ) = b
P r
351 / 561
P
P 1 2 2P
rP = 0
+ (a b)
+ b
t
r 2 r2
352 / 561
1 2
2
t F (t, r) + f (t, r) r F (t, r) + (t, r) r F (t, r) r F (t, r) = 0
2
F (T, r) = (r)
Then1 M (t) = exp(
Rt
0
r (u) du (r(T ))F t
f
where r (t) = r(t) and under Q satisfies dr (t) = f (t, r )d t + (t, r )d W
1
353 / 561
(t, r) = b(t, r)
P(t, T ) = EQ exp
r (u) du F t
354 / 561
Under P we have
Under Q we have
f(t)
dr (t) = f (t, r (t))d t + (t, r (t))d W
We need to satisfy ourselves that these measures P and Q are equivalent
355 / 561
f is a
Then by Girsanovs theorem, there exists Q equiv. to P under which W
BM with Radon-Nikodym derivative
Z T
Z T
dQ
1
= exp
(t)dW (t)
(t)2 d t
dP
2 0
0
356 / 561
357 / 561
358 / 561
Additional Comments
359 / 561
360 / 561
Z T
P(t, T ) = EQ exp
r(u) du
t
Some people back out the market price of risk by statistical methods
from historical observations of price movements
This is actually wrong... statistical observations are under P
361 / 561
Week 8
Particular Models
Affine Short-Rate Models
362 / 561
Particular Models
363 / 561
364 / 561
and variance
1 e2s
2
This implies that the long-term standard deviation of r(t) is
p
2
365 / 561
where
B(t, T ) =
1 e(T t)
2
2
A(t, T ) = (B(t, T ) (T t)) 2
B(t, T )2
2
4
366 / 561
where
p
1
P(t, S)
+
, d2 = d1 p
log
p
K P(t, T )
2
v
t 1 e2(T t)
p = (1 e(ST ) )
d1 =
367 / 561
The Vasicek model has a major problem: the risk-free rate can become
negative
Further, empirical evidence shows that the volatility of r(t) is not constant
but is an increasing function of r(t)
The first tractable model that fixed these problems was the CIR model:
f(t)
d r(t) = ( r(t)) d t + r(t)d W
f is BM under EMM Q
where , , > 0 and W
368 / 561
where
2(et 1)
B(t, T ) =
( + )(et 1) + 2
2
2e(+)t/2
A(t, T ) = 2 log
( + )(et 1) + 2
p
= 2 + 22
369 / 561
370 / 561
371 / 561
372 / 561
2.8%
2.4%
2.0%
Yield
3.2%
3M
6M
1Y
2Y
5Y
10Y
20Y
373 / 561
1
R(0, T ) = (A(0, T ) B(0, T )r(0))
T
374 / 561
i=1
375 / 561
2.8%
2.4%
2.0%
Yield
3.2%
3M
6M
1Y
2Y
5Y
10Y
20Y
376 / 561
377 / 561
A hedge fund might view the difference between the model prices and the
market prices as the market is mispricing certain assets and build a
trading strategy to capture the mispricings
378 / 561
There exist models that fit the current market perfectly: they are called
no-arbitrage models (e.g., Ho & Lee, Hull & White)
In an equilibrium model, todays term structure of interest rates is an
output
In a no-arbitrage model, todays term structure of interest is an input
379 / 561
380 / 561
Affine Models
We have seen that both the Vasicek and CIR models have T -bond prices
that have the affine form
P(t, T ) = exp(A(t, T ) B(t, T )r(t))
An affine transformation in mathematics is a linear transformation
followed by a translation (i.e., x 7 B x + a).
The form of A and B depend on the model
Q: Are there any other models that have an affine form for P(t, T )?
381 / 561
Affine Models
Consider the abstract SDE for r(t)
f(t)
d r(t) = m(t, r(t))d t + s(t, r(t))d W
f is a BM under EMM Q
where W
Suppose that
P(t, T ) = exp(A(t, T ) B(t, T )r(t))
Apply Its formula with f (t, x) = exp(A(t, T ) B(t, T )x) to get
A B
1 2
f(t)
d P(t, T ) = P(t, T )
r(t) Bm + Bs d t Bsd W
t
t
2
382 / 561
Affine Models
But under Q, we know that
f(t)]
d P(t, T ) = P(t, T )[r(t)d t + S(t, T )d W
t, r
383 / 561
Affine Models
Now, to ensure this holds we need
2
2
2g
2 m(t, r) 1
2 (s(t, r) )
= B(t, T )
+ B(t, T )
=0
r2
r2
2
r2
+ B(t, T )
=0
r2
2
r2
Hence we conclude
2 (s(t, r)2 )
=0
r2
and
2 m(t, r)
=0
r2
384 / 561
Affine Models
Theorem
If the drift and volatility of r(t) take the form
m(t, r(t)) = a(t) + b(t)r(t)
and
s(t, r(t)) =
(t)r(t) + (r)
385 / 561
Affine Models
386 / 561
Affine Models
f(t).
d r(t) = ( r(t))d t + r(t)d W
387 / 561
Affine Models
Example (Merton Model)
= 0, = 2 , b = 0 which gives
f(t)
d r(t) = ad t + d W
388 / 561
Affine Models
Example (Pearson and Sun Model)
= 2 , = /2 , b = and a = ( + ) which gives
f(t).
d r(t) = ( r(t))d t + r(t) d W
Note:
This model is similar to CIR
Now, the minimum value for r(t) is
One can derive A(t, T ) and B(t, T ) in terms of the A and B from the CIR
389 / 561
Week 9
No-Arbitrage Models
The Heath-Jarrow-Morton (HJM) Framework
390 / 561
No-Arbitrage Models
391 / 561
Introduction
392 / 561
Introduction
Market practitioners prefer no-arbitrage models over equilibrium models
Market makers need to both buy and sell at quoted prices. If their pricing
model gives different prices than the rest of the market then this will give
rise to arbitrage opportunities against the market maker
No-arbitrage models allow market makers to check market scenarios (e.g.,
what happens to other prices if the short-term rate increases?)
Market makers are typically considered with the pricing and hedging of
short-term claims. No-arbitrage models work well in this situation
Banks typically want to price OTC derivatives in a way which is consistent
with the closest-matching traded derivatives
393 / 561
Markov Models
394 / 561
Recall that a model is called affine if the T -bond prices are given by
P(t, T ) = exp (A(t, T ) B(t, T )r(t))
395 / 561
f(t)
a(t) + (t)r(t)d W
and
P(t, T ) = exp (A(t, T ) B(t, T )r(t))
396 / 561
397 / 561
(t) = 0,
a(t) = 2 ,
(t) = 0
1
t A(t, T ) = 2 B 2 (t, T ) + (t)B(t, T ), A(T, T ) = 0,
2
t B(t, T ) = 1,
B(T, T ) = 0.
398 / 561
1 ds
t
g(T ) g(t) = (T t)
B(t, T ) = T t
=0
399 / 561
400 / 561
with
B(t, T ) = T t
2
A(t, T ) =
(T t)3
6
(s)(T s) ds
t
401 / 561
log P(t, T )
T
=
[A(t, T ) B(t, T )r(t)]
T
f (t, T ) =
402 / 561
2
f (t, T ) = (T t)2 +
2
(s) ds
t
(s) ds + r(t)
t
403 / 561
(s) ds + r(0)
404 / 561
(s) ds =
Z
0
fobs (0, s) + 2 s
s
fobs (0, s) ds +
s
2 s ds
Our theoretical forward curve matches the market forward curve exactly!
405 / 561
P(t, T ) = exp
f (t, u) du
P(t, T ) = exp
2
2
t(T t) (T t)r(t)
2
406 / 561
is easily found to be
r(t) = r(0) +
f(t)
(s) ds + W
2
s f obs (0, s) + s
we get
2 t 2
f(t)
r(t) = fobs (0, t) +
+ W
2
407 / 561
This means that r(t) fluctuates along a modified initial forward curve
408 / 561
d r(t) = ((t)
r(t)) d t + d W
f(t) is a BM under Q and (t) is a deterministic function of time
where W
In the Vasicek model, (t) = is constant
409 / 561
(t) = ,
a(t) = 2 ,
(t) = 0
1
1 e(T t)
410 / 561
t B(t, T ) = e(T t)
T B(t, T ) = T
Hence,
T B(s, T ) = s B(s, T )
411 / 561
412 / 561
1
2
s fobs (0, s) + fobs (0, t) + 2 (1 et )2
then we get
413 / 561
As
2
(1 et )2
22
we get
2
r(t) = fobs (0, t) + 2 (1 et )2 +
2
Last Modified: 14/10/2014 08:19
f(s)
e(ts) d W
414 / 561
d1 =
p
1
P(t, S)
+
,
log
p
K P(t, T )
2
d2 = d1 p
v
t 1 e2(T t)
p := (1 e(ST ) )
2
Last Modified: 14/10/2014 08:19
415 / 561
Dothan Model
In 1978, Dothan proposed the model
d r(t) = r(t) dW (t)
416 / 561
Dothan Model
The solution to the SDE is
r(t) = r(s) exp
2
f(t) W
f(s)) ,
(t s) + (W
2
st
417 / 561
Dothan Model
No closed-form solution for T -bond prices exists
Recall that the cash-bond is given by
Z t
B(t) = exp
r(u) du
0
EQ [B()] = EQ exp
Z
r(u) ds
r(0) + r()
EQ exp
2
X
EQ e e , X Normal
=
Last Modified: 14/10/2014 08:19
418 / 561
Dothan Model
This means that the Dothan model cannot be used to price Eurodollar
futures
This is why Market Models have been introduced
419 / 561
Black-Karasinski Model
In 1991, Black and Karasinski introduced the following model.
Start with Y (t) = log(r(t)), whereby
f(t)
d Y (t) = (t)(log (t) Y (t)) d t + (t) d W
f(t) is a Q-BM
where W
Applying Its formula gives
(t)2
f(t)
log r(t) d t + (t)r(t) d W
d r(t) = (t)r(t) log (t) +
2(t)
420 / 561
Black-Karasinski Model
Under this model, r(t) has a local mean-reversion level of
2
(t)
(t) exp
2(t)
Like the Dothan model, there is no closed-form solution for T -bond prices
It is a lognormal model
Similar to the Dothan model, we get
EQ [B()]
as 0
421 / 561
422 / 561
The HJM approach gives a very general framework that we can work in to
develop a no-arbitrage model
It describes how a general model for the term-structure should evolve in a
way that is arbitrage-free and which the initial forward-rate curve is part of
the input
423 / 561
424 / 561
(s, T ) ds +
(s, T ) dW (s)
The coefficients and may depend upon f (t, T ) or the whole forward
curve at time t
Or even more generally, upon F t = ({W (s) : s t})
As all f (t, T ) depend on the same BM, the changes over the whole
forward rate curve are perfectly related to each other (in a nonlinear way)
425 / 561
For all T , (t, T ) and (t, T ) are previsible and depend only upon the
history of W (s) up to time t
Z TZ T
|(s, t)| dsd t < for all T
426 / 561
(s, T ) ds +
(s, T ) dW (s)
The key insight of HJM (1980) is: if we assume that the dynamics of f (t, T )
lead to an arbitrage-free model then there must be a drift condition
Our aim is now to determine what this drift condition is
427 / 561
428 / 561
Z
r(u) du
429 / 561
(s, T )ds +
we get
r(T ) = lim f (t, T ) = f (0, T ) +
t%T
(s, T ) dW (s) +
(s, T ) ds
430 / 561
Z
0
f (0, u) du +
Z tZ
0
(s, u) duds
Z t Z
0
431 / 561
Tradable Assets
Our tradable assets will be all the zero-coupon bonds with prices P(t, T )
for T > t
Our T -bond prices are given by
T
P(t, T ) = exp
f (t, u) du
Z t Z
P(t, T ) = exp
(s, u) du dW (s)
f (0, u) du
t
Z tZ
(s, u) duds
432 / 561
Tradable Assets
Define the discounted asset price
Z(t, T ) :=
P(t, T )
B(t)
f (0, u) du
with
S(s, T ) :=
Z tZ
(s, u) duds
(s, u) du
433 / 561
Tradable Assets
434 / 561
Tradable Assets
d Z(t, T ) = Z(t, T )
Z T
1 2
(t, u)du d t + S(t, T )dW (t)
S (t, T )
2
t
435 / 561
Change of Measure
436 / 561
Change of Measure
By Girsanovs theorem, there exists a measure Q equivalent to P such that
Z t
f(t) = W (t) +
W
(s) ds
0
is a Q-BM
Under Q we have
f(t)
d Z(t, T ) = Z(t, T )S(t, T )d W
Therefore, Z(t, T ) is a martingale under Q (assuming S(t, T ) satisfies the
technical condition)
437 / 561
Change of Measure
It follows that the dynamics of the T -bond prices under Q are given by
f(t)
d P(t, T ) = P(t, T ) r(t) d t + S(t, T )d W
So all T -bonds have expected return equal to the risk-free rate r(t) under
Q
We now need to determine the relationship between (t, T ), (t, T ), and
(t) so that the model is arbitrage-free
438 / 561
Replicating Strategies
439 / 561
Replicating Strategies
Find the equivalent measure Q under which Z(t, T ) is a martingale
Define the Q-martingale D(t) = EQ [B(S)1 X |F t ]
Find the previsible process (t) such that
Z t
D(t) = D(0) +
(s) d Z(s, T )
440 / 561
Replicating Strategies
441 / 561
Replicating Strategies
(s) d Z(s, T )
442 / 561
Replicating Strategies
Suppose we have a trading strategy which holds (t) units of the T -bond
with price P(t, T )
Set (t) = D(t) (t)Z(t, T ) to be the amount of cash B(t) that we hold
at time t
The value of the portfolio at time t is given by
443 / 561
Replicating Strategies
444 / 561
Replicating Strategies
We have
d P(t, T ) = d(B(t)Z(t, T )) = r(t)B(t)Z(t, T )d t + B(t)d Z(t, T )
The instantaneous investment gain is
445 / 561
Replicating Strategies
446 / 561
ZS
r(u) du F t
P(t, S) = B(t)EQ B(S)1 |F t = EQ exp
447 / 561
P(t, S)
= EQ B(S)1 |F t
B(t)
448 / 561
(t, u) du = (t)
t
Or equivalently,
Z
t
1
(t, u) du = S(t, T )2 (t)S(t, T )
2
449 / 561
(s, u) du
so that
And differentiating
S(t, T ) = (t, T )
T
RT
t
450 / 561
f(t)
= (t, T )S(t, T )d t + (t, T )d W
451 / 561
r(t) = f (0, t)
(s, T )S(s, t) ds +
f(s)
(s, t)d W
452 / 561
Week 10
Multifactor Models
453 / 561
Multifactor Models
454 / 561
Introduction
We shall now look at models which include more than one source of
randomness
When you look at historical interest rate data to see that changes in
interest rates with different maturities are not perfectly correlated as
predicted by one-factor model
We can look at some data from
http://www.rba.gov.au/statistics/tables/
455 / 561
Yield (%)
5
6
2000
2005
Year
2010
456 / 561
5
6
Short Yield (%)
457 / 561
Introduction
458 / 561
Introduction
459 / 561
460 / 561
X 1 (t)
X 2 (t)
X (t) =
...
X n (t)
The model is called affine if the T -bond prices are of the form
n
X
P(t, T ) = exp A(t, T ) +
B j (t, T )X j (t)
j=1
461 / 561
where
B1 (t, T )
B2 (t, T )
B(t, T ) =
..
Bn (t, T )
B(t, T ) T is the transpose of B(t, T )
462 / 561
463 / 561
f(t)
d r(t) = (a + br(t))d t + r(t) + d W
Q: What form must the SDE of X (t) have in the n-dimensional case if X (t)
is affine?
464 / 561
q
1T X (t) + 1
0
0
0
2T X (t) + 2 0
0
..
.
.
..
..
D(X (t)) =
..
..
...
.
.
0
nT X (t) + n
where each i R and each i = (i1 , , in ) T is a constant vector.
Dale Roberts (c) 2011-2014
465 / 561
466 / 561
1
[A( j ) + B() T X (t)]
j
467 / 561
P(t, t + ) = exp A() + B() T BR1 (R(t) aR )
()R(t)
= exp A()
+B
468 / 561
()R(t)
P(t, t + ) = exp A()
+B
469 / 561
is a (n-dimensional) P-BM
470 / 561
471 / 561
472 / 561
473 / 561
where = (1 , . . . , n ) T Rn
If all i 6= 0 we can scale X (t) and assume i = 1
474 / 561
475 / 561
476 / 561
x2
2!
x3
3!
+ . . . so we have
exp(B) = BR exp()B L
exp() = I + +
=
2 3
+
+ ...
2!
3!
X
k
k=0
k!
477 / 561
Y (t) := et B L X (t)
Y (t) = Y (0) +
f(u)
eu B L K d W
478 / 561
f(u)
e(tu)B K d W
X (t) = e t B X (0) +
479 / 561
and
exp(t B) 0
as t
Think of the 1-dimensional case where a R, then
et a 0
as
480 / 561
r(t) d t
0
= T +
T X (t) d t
VarQ [R(T )] =
481 / 561
1 0
B = BR B L
B=
0 1
then
BR = B L = I, = I
482 / 561
1
VarQ [R(T )|X (0)]
2
Z
1 2 2
E e
= exp +
2
483 / 561
where
f1 (t),
d X 1 (t) = 1 X 2 (t)d t + 11 d W
f1 (t) + 22 d W
f2 (t).
d X 2 (t) = 2 X 1 (t)d t + 21 d W
484 / 561
where
f1 (t),
d X 1 (t) = 1 (X 2 (t) X 1 (t))d t + 11 d W
f1 (t) + 22 d W
f2 (t).
d X 2 (t) = 2 X 2 (t)d t + 21 d W
In this model, + X 2 (t) behaves like a stochastic (local) mean reversion level
for r(t).
485 / 561
r(t) =
n
X
X i (t)
i=1
486 / 561
r(u) du F t
n
X
P(t, T ) = EQ exp
= EQ exp
n
Y
EQ exp
t
i=1
= exp
i=1
T
n
X
X i (u) du F t
X i (u) du F t
Ai (T t)
i=1
n
X
Bi (T t)X i (t)
i=1
487 / 561
2i i
2i
2i e(i +i )/2
log
(i + i )(ei 1) + 2i
2(ei 1)
,
(i + i )(ei 1) + 2i
2i + 22i
If we have 2i i > 2i then the probability that X i (t) hits zero is zero
488 / 561
fi (t)
Yi (t)d W
f1 and W
f2 are independent Q-Brownian motions
where W
For constants c1 and c2 , define
489 / 561
f1 (t)
= c1 1 (1 Y1 (t))d t + Y1 (t)d W
f2 (t)
+ c2 2 (2 Y2 (t))d t + Y2 (t)d W
490 / 561
491 / 561
492 / 561
where
i (t, T ) =
Si (t, T )
T
493 / 561
494 / 561
Week 11
495 / 561
496 / 561
497 / 561
A New Numeraire
For a T -bond, we have the dynamics
f(t)]
d P(t, T ) = P(t, T )[r(t) d t + S(t, T )d W
f(t) is a Q-BM
where W
Previously, we considered the discounted price process
Z(t, U) =
R t
0
P(t, U)
B(t)
The discounted price was a martingale under the same measure Q for all
maturity dates U
498 / 561
A New Numeraire
Lets change the numeraire from B(t) to P(t, T ). Thus we consider
Y (t, U) =
P(t, U)
P(t, T )
Then we get
1
1
1
d Y (t, U) =
d P(t, U)+P(t, U)d
+d P(t, U)d
P(t, T )
P(t, T )
P(t, T )
499 / 561
A New Numeraire
Then
f(t)
d Y (t, U) = Y (t, U) (S(t, U) S(t, T )) d W
+ Y (t, U)S 2 (t, T ) d t Y (t, U)S(t, U)S(t, T ) d t
c(t) with
Now suppose we define a new process W
c(0) = 0
W
and
c(t) = d W
f(t) S(t, T ) d t
dW
Then
c(t) + S(t, T )d t]
d Y (t, U) = Y (t, U) (S(t, U) S(t, T )) [d W
Y (t, U)S(t, T )[S(t, U) S(t, T )]d t
c(t)
= Y (t, U)[S(t, U) S(t, T )]d W
Last Modified: 14/10/2014 08:19
500 / 561
A New Numeraire
So we have
c(t)
dY (t, U) = Y (t, U)[S(t, U) S(t, T )]d W
It appears that we can find a suitable measure under which Y (t, U) for all
U > T are martingales
501 / 561
Change of Measure
Define (t) = S(t, T )
By Girsanovs theorem, there exists a measure P T which is equivalent to Q
such that
Z
t
c(t) = W
f(t) +
W
(s) ds
ZU
1
EQ exp(
(t)2 d t) <
2
0
P T is called a forward measure
502 / 561
Change of Measure
The forward measure depends on the forward maturity date T but not
upon the maturity date U of the bond under consideration
The Y (t, U) for U > T are all martingales under the measure P T
503 / 561
Derivative Payments
Let X be a claim contingent upon F T payable at time T
Define
D(t) = EPT
X
F t = EPT [X |F t ]
P(T, T )
504 / 561
A Replicating Strategy
Let (t) = D(t) (t)Y (t, U)
Consider the investment strategy which holds at time t , (t) units of the
numeraire P(t, T ) and (t) units of U -bond
The value of this portfolio at time t is
V (t) = P(t, T )D(t)
Now
f(t)]
d P(t, T ) = P(t, T )[r(t)d t + S(t, T )d W
c(t) + S(t, T )d t)]
= P(t, T )[r(t)d t + S(t, T )(d W
c(t))
= P(t, T )([r(t) + S 2 (t, T )]d t + S(t, T )d W
505 / 561
A Replicating Strategy
And we have
506 / 561
A Replicating Strategy
Thus
d V (t) = D(t)d P(t, T ) + P(t, T )d D(t) + d P(t, T )d D(t)
c(t))
= P(t, T )D(t)((r(t) + S 2 (t, T ))d t + S(t, T )d W
c(t)
+ (t)P(t, T )Y (t, U)(S(t, U) S(t, T ))d W
c(t)]
+ (t)P(t, T )Y (t, U)[(r(t) + S 2 (t, T ))d t + S(t, T )d W
c(t)]
[(S(t, U) S(t, T ))d W
c(t)]
= V (t)[(r(t) + S 2 (t, T ))d t + S(t, T )d W
c(t)
+ (t)P(t, U)(S(t, U) S(t, T ))d W
+ (t)P(t, U)S(t, T )(S(t, U) S(t, T )) d t
507 / 561
A Replicating Strategy
The instantaneous investment gain is thus
(t)d P(t, T ) + (t)d P(t, U)
c(t))
= (D(t) (t)Y (t, U))P(t, T )((r(t) + S 2 (t, T ))d t + S(t, T )d W
c(t) + S(t, T )d t))
+ (t)P(t, U)(r(t)d t + S(t, U)(d W
= d V (t)
Hence ((t), (t)) is a self-financing strategy
V (T ) = X so it is also a replicating strategy
508 / 561
A Replicating Strategy
Theorem
The price at time t for the claim X payable at time T is
V (t) = P(t, T )EPT [X |F t ]
This price is independent of the choice of the hedging asset P(t, U)
But the hedging strategy depends upon the choice of U
509 / 561
Z T
V (t) = EQ exp
r(s) ds X F t
510 / 561
511 / 561
Therefore,
EPT
P(T, U)
P(t, U)
,
F t = EPT [P(T, U)|F t ] =
P(T, T )
P(t, T )
512 / 561
and
EPT [log P(T, U)|F t ] = a = log
P(t, U) 1 2
b
P(t, T ) 2
513 / 561
Vasicek Model
Example
Under the Vasicek model,
S(v, U) =
(1 e(Uv) )
Which gives
2
2
b = 2
(e(T v) e(Uv) )2 d v
t
(1 e(UT ) )2 (1 e2(T t) )
3
2
2
514 / 561
where
log(E[Y ]/K) + 12 b2
a + b2 log K
h=
=
b
b
and is the CDF of a standard Normal distribution.
515 / 561
where
1 2
1
P(t, U)
log K + b
h=
log
b
P(t, T )
2
1
P(t, U)
1
= log
+ b
b
K P(t, T ) 2
Recall
b =
516 / 561
517 / 561
518 / 561
P(t, U)
,
P(t, T )
b(t) =
R
R(t)
P(t, T )
b (t, U) + R (t)d R
b(t)
d D(t) = P (t)d P
519 / 561
b (t, U) R (t)R
b(t)
(t) = D(t) P (t) P
of P(t, U)
R (t) of R(t)
(t) of P(t, T )
520 / 561
where
h=
b2 =
log(F (t)/K) + 21 b2
b
T
(1 S(v, T ))2 d v + 22 (T t)
t
F (t) =
R(t)
P(t, T )
521 / 561
Week 12
Market Models
522 / 561
Market Models
523 / 561
Introduction
524 / 561
Introduction
A key modelling assumption was that relevant market rates are lognormal
This was an assumption already used by traders to price some interest
rate derivatives
The market practice was/is to use Blacks formula
525 / 561
Blacks Formula
Definition
A caplet with reset date T and settlement date T + pays the holder the
difference between a simple market rate F (T, T + ) and the strike rate
Its cash flow at time T + is
(F (T, T + ) )+
We write Cpl(t; T, T + ) for the price at time t of the caplet with reset
date T and settlement date T +
526 / 561
Blacks Formula
Definition
A Cap is a strip of caplets. It consists of:
A number of future dates T0 < T1 < < Tn with Ti Ti1 =
A cap rate
We write Cp(t) for its price at time t
We have
Cp(t) =
n
X
Cpl(t; Ti1 , Ti )
i=1
527 / 561
Blacks Formula
It is market practice to price a cap/floor according to Blacks formula
Let t T0 . Blacks formula for the value of the i th caplet is
Cpl(t; Ti1 , Ti ) = P(t, Ti )[F (t; Ti1 , Ti )(d1 (i; t)) (d2 (i; t))]
where
d1,2 (i; t) =
log
F (t;Ti1 ,Ti )
12 (t)2 (Ti1 t)
p
(t) Ti1 t
Here (t) is the cap implied volatility (it is the same for all caplets
belonging to a cap)
528 / 561
Blacks Formula
529 / 561
Blacks Formula
0
Dale Roberts (c) 2011-2014
10
15
20
Maturity (in years)
Last Modified: 14/10/2014 08:19
25
30
530 / 561
Blacks Formula
531 / 561
Market Models
532 / 561
LIBOR
There are two types of market interest rates: LIBOR and forward LIBOR
LIBOR is an annualized, simple rate of interest that will be delivered at the
end of a defined period
The -LIBOR denoted by F (T, T + ) means that an investment of 1 at
time T will grow to 1 + F (T, T + ) at time T +
LIBOR is always quoted as an annual rate of interest even though it
typically applies over shorter periods
533 / 561
LIBOR
For a fixed > 0, the forward -period LIBOR for the future date T
prevailing at time t is the simple forward rate
1
P(t, T )
L(t, T ) := F (t; T, T + ) =
1
P(t, T + )
With this notation, L(T, T ) = F (T ; T, T + ) is the LIBOR rate
534 / 561
P(t, T )
P(t, T + )
is a martingale under P T +
535 / 561
P(t, T )
P(t, T )
d
=
T,T + (t)dW T + (t)
P(t, T + )
P(t, T + )
R T +
where T,T + (t) = T (t, u) du
1 P(t, T )
1
P(t, T )
d L(t, T ) = d
=
T,T + (t)dW T + (t)
P(t, T + )
P(t, T + )
1
= (L(t, T ) + 1) T,T + (t)dW T + (t)
536 / 561
1
(L(t, T ) + 1) T,T + (t)dW T + (t)
537 / 561
(u, T )dW
T +
1
(u)
2
t
2
k(u, T )k du
and variance
k(s, T )k2 ds
t
538 / 561
where
d1,2 (t, T ) :=
log
L(t,T )
21
RT
k(s, T )k2 ds
R T
1/2
2 ds
k(s,
T
)k
t
539 / 561
L(t, T )
(t, T )
L(t, T ) + 1
for some deterministic (t, T ) yields Blacks formula for caplet prices
540 / 561
541 / 561
542 / 561
543 / 561
544 / 561
H(u)dS(u)
E t ((, T ) W ) = exp
T
Z
0
1
(u, T )dW (u)
2
T
t
2
|(u, T )| du
0
545 / 561
t [0, TM 1 ]
P(0, TM 1 )
1
P(0, TM )
The solution is
546 / 561
1 P(0, TM 1 )
1 E t ((, TM 1 ) W TM )
L(t, TM 1 ) =
P(0, TM )
Now we assume that
TM 1 ,TM (t) =
L(t, TM 1 )
(t, TM 1 ),
L(t, TM 1 ) + 1
t [0, TM 1 ]
547 / 561
Z T
under P
f(t) under Q
(t, u)du d t + (t, T )d W
d f (t, T ) = (t, T )
548 / 561
549 / 561
1 P(0, TM 2 )
1
L(0, TM 2 ) =
P(0, TM 1 )
Define
TM 2 ,TM 1 (t) =
L(t, TM 2 )
(t, TM 2 )
L(t, TM 2 ) + 1
L(t, T0 ), . . . , L(t, TM 1 ),
t [0, TM ]
550 / 561
n
X
d L(t, Tm ) = L(t, Tm ) (t, Tm )
Tr ,Tr+1 (t) T d t + (t, Tm )dW Tn+1
r=m+1
For m = n,
m
X
r=n+1
551 / 561
552 / 561
t [0, Tm1 ]
P(t, Tm1 )
d
= d L(t, Tm1 )
P(t, Tm )
= L(t, Tm1 )(t, Tm1 )dW Tm (t)
P(t, Tm1 )
Tm1 ,Tm (t)dW Tm (t)
=
P(t, Tm )
553 / 561
t [0, Tm1 ]
which is a P Tm martingale
This is sometimes call the Tm -forward price process of the Tm1 -bond
554 / 561
P(t, Tk ) Y P(t, Tr )
=
P(t, Tm ) r=k P(t, Tr+1 )
=
P(t, Tm1 )
P(t, Tk )
P(t, Tk+1 )
P(t, Tm )
P(t, Tk )
=
P(t, Tm )
P(t, Tm )
P(t, Tm+1 )
P(t, Tk1 )
P(t, Tk )
555 / 561
P(t,Tk )
P(t,Tm )
t [0, Tk Tm ]
k<m
k>m
556 / 561
at dates t = Tm
We have derived an arbitrage-free market model for the bonds with
maturities T0 , . . . , TM since we have shown that P Tm is a martingale
measure for the Tm -bond as numeraire
557 / 561
Any Tm -contingent claim X with EPTm [|X |] < can now be priced at time
t Tm in terms of the Tm -bond numeraire via
V (t) = P(t, Tm )EPTm [X |F t ]
We can also express this price relative to any future Tn -bond
558 / 561
Lemma
The Tm -bond discounted Tm -claim price satisfies
P(t, Tn )
V (t)
X
=
EPTn
F t
P(t, Tm )
P(t, Tm )
P(Tm , Tn )
for all m < n M .
559 / 561
where
d1,2 (n; Tm ) =
log
L(Tm ,Tn1 )
rR
Tn1
Tm
12
R Tn1
Tm
560 / 561
561 / 561