Professional Documents
Culture Documents
Economic Models
and Mean-field Games Theory
Diogo A. Gomes
KAUST
Levon Nurbekyan
KAUST
Edgard A. Pimentel
UFC
Distribuio: IMPA
Estrada Dona Castorina, 110
22460-320 Rio de Janeiro, RJ
E-mail: ddic@impa.br
ISBN: 978-85-244-0404-7 http://www.impa.br
i i
Contents
1 Introduction 1
1.1 Mean-field games . . . . . . . . . . . . . . . . . . . . . 3
1.2 Mean-field games and economic theory . . . . . . . . . 5
1.3 Outline of the book . . . . . . . . . . . . . . . . . . . . 7
I Deterministic models 11
2 First-order mean-field games 13
2.1 Deterministic optimal control problems . . . . . . . . 13
2.2 Transport equation . . . . . . . . . . . . . . . . . . . . 20
2.3 Reduced mean-field games . . . . . . . . . . . . . . . . 23
i i
i i
i i
ii CONTENTS
II Stochastic models 75
6 Second order MFG 77
6.1 Hamilton-Jacobi and Fokker-Plank
equations . . . . . . . . . . . . . . . . . . . . . . . . . 77
6.1.1 Hamilton-Jacobi equation . . . . . . . . . . . . 77
6.1.2 Fokker-Planck equation . . . . . . . . . . . . . 80
6.2 Second-order mean-field games . . . . . . . . . . . . . 81
6.3 The non-linear adjoint method . . . . . . . . . . . . . 82
6.3.1 The adjoint structure of the MFG systems . . . 86
i i
i i
i i
CONTENTS iii
i i
i i
i i
i i
i i
i i
Introduction
i i
i i
i i
2 [CHAP. 1: INTRODUCTION
i i
i i
i i
i i
i i
i i
4 [CHAP. 1: INTRODUCTION
i i
i i
i i
to [73, 74, 70, 95, 94, 64] for the general theory, the reference [26] for
applications to socio-economic sciences (also see [157]), as well as to
[88, 89, 86] for numerical methods.
Applications with mixed populations or with a major player were
examined in [101, 102, 133]. Linear quadratic problems have been
considered from diverse perspectives, see [103], [104], [16], [105], [96],
[132], [19], [24], [23], [123], [54], [155], and [62]. In the context of
MFG, crowd and population dynamics were investigated in [56, 31,
32, 33]. The consensus problem was studied in [134].
The rigorous derivation of mean-field models was considered in
particular cases in the original papers by Lions and Lasry. Further
developments, using the theory of nonlinear Markov processes were
obtained in [114], [111], and [112] (see also the monograph [113]),
and using PDE methods in [18, 62]. For finite state problems, the
N -player problem was studied in [74] where a convergence result was
established. For earlier works in the context of statistical physics
and interacting particle systems, see [150]. Recent results in this
field include [65] and [117].
A related problem is mean-field control. For that, we refer the
reader to the monograph [21], as well as the papers [50, 134], com-
paring these different approaches.
i i
i i
i i
6 [CHAP. 1: INTRODUCTION
i i
i i
i i
formation was considered in [120, 30, 29, 35, 34, 25]. In [4], the au-
thors develop MFG methods for a wide range of economic problems
modeled by partial differential equations. In particular, the mate-
rial on that paper was the inspiration for part of our discussion in
Chapter 3. Further results are reported in [7], [135] and [145].
i i
i i
i i
8 [CHAP. 1: INTRODUCTION
i i
i i
i i
i i
i i
i i
i i
i i
i i
Part I
Deterministic models
11
i i
i i
i i
i i
i i
i i
First-order mean-field
games
x t = f (vt , xt ), (2.1)
13
i i
i i
i i
i i
i i
i i
Hp (x , Dx V (x , t)) = f (v , x ). (2.7)
i i
i i
i i
Then,
Z T
V (x, t) e(st) u( s )ds + e(T t) (
vs , x xT )
t
Z r Z T
= e(st) u( s )ds +
vs , x e(st) u(vs , xs )ds
t r
+ e(T t) (xT )
Z r
e(st) u( s )ds + e(rt) (V (y, r) ) .
vs , x
t
i i
i i
i i
i i
i i
i i
e(st) u(vs , xs ),
xj
with
q(T ) = D(xT ). (2.17)
To proceed, we also need the notion of strongly approximately
continuous function; we say that a function vs : [t, T ] Rm is
strongly approximately continuous at s [t, T ] if
1 s+
Z
g(vs ) = lim g(vr )dr,
0 s
i i
i i
i i
for all g C(Rm ). In what follows we assume the controls, and their
respective trajectories, to be strongly approximately continuous a.e.
in [t, T ].
Theorem 2.1.2 (Pontryagin maximum principle). Let v be an op-
timal control for the optimal control problem (2.1)-(2.3). Assume that
q is a solution to (2.16) equipped with the terminal condition (2.17).
Then,
q(s) f (vs , xs ) + u(vs , xs ) = H(x , q(s)).
Proof. Fix s [t, T ) such that v is strongly approximately continu-
ous in s, and choose > 0 such that s + [t, T ]. Define
(
r [s, s + ],
v (r) =
vr otherwise.
1 s+ (rt)
Z
0 e (u(v , x ) u(, x )) dr (2.18)
s
Z T
1
+ e(rt) (u(v , x ) u(v , x )) dr
s+
1
+ e(T t) ((x (T )) (x (T ))) .
We proceed by taking the limit 0 in (2.18). Because v is
strongly approximately continuous, the first term on the right-hand
side of (2.18) becomes
where
. x x
(r) = .
i i
i i
i i
Consider now
d h (rt) i d (rt)
e q(r)(r) = e q(t) (t) + e(rt) q(t)(t);
dr dr
by combining (2.19) with (2.16), we obtain
d h (rt) i
e q(r)(r) = e(rt) ux (v , x )(r).
dr
This, along with (2.20) and the definition of the Hamiltonian H,
concludes the proof.
In the literature, (2.17) is called a transversality condition. Other
related transversality conditions are typical in optimal control prob-
lems. For a more detailed discussion, see [66].
Next, we briefly present the very basics concerning the transport
equation and discuss a few facts about its solutions.
i i
i i
i i
i i
i i
i i
Therefore,
Z Z
(x)t (x, t)dx = (b(x, t)x (x)) (x, t)dx,
Rd Rd
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
27
i i
i i
i i
i i
i i
i i
3.1.3 Preferences
The assumption of rationality presumes that agents choose controls
in order to optimize some functional. This functional encodes the
preferences of the agents with respect to the variables of the model.
For example, it is reasonable to suppose that a microeconomic
agent in the model prefers more consumption to less, whereas a cen-
tral bank aims at controlling price levels while promoting growth
and capital accumulation. In the case of microeconomic agents,
these ideas are formalized through an instantaneous utility function.
Monotonicity and concavity encode many qualitative properties of
the model. For instance, with respect to consumption, the utility
function is increasing (agents are assumed to prefer more consump-
tion to less) and concave, for the satisfaction, due to increments in
consumption, is decreasing. In case of a macroeconomic agent, pref-
erences are represented by an instantaneous welfare function.
Because agents make choices over a particular time span, we in-
troduce the intertemporal counterparts of the utility and the welfare
functions.
i i
i i
i i
f (kt ) = ct + it , (3.2)
where ct , it 0.
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
Vt V + H(a, z, r, Va , Vz , t) = 0,
i i
i i
i i
Moreover,
Z Z
d
(z 1)2 dadz = 2 (z 1)2 dadz.
dt
Thus, we obtain Z
c (a, z, t)dadz , (3.14)
i i
i i
i i
i i
i i
i i
38
i i
i i
i i
for all t 0.
i i
i i
i i
The functions and take into account that agents are able to
change, at least partially, their production from consumer to capital
goods and vice-versa. The precise way in which this choice is made
is not relevant in this model.
Finally, the depreciation of the capital stock is governed by a
function g, depending on both the capital level k and the price p, i.e.,
g = g(k, p).
a t = ct pt it + F (kt , pt ). (4.1)
k t = g(kt , pt ) + it . (4.2)
i i
i i
i i
i i
i i
i i
for every t > 0. This equilibrium condition determines the price level
pt .
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
for every t 0.
In this setting, given the interest rates of the two countries, the
economy is completely characterized by two mean-field game systems,
which we describe next.
The price levels in both countries may be identical, in which case
it is determined by (4.23). Alternatively, one of the prices may be
i i
i i
i i
i i
i i
i i
Vt V + H(a, k, pt , E, Va , Vk ) = 0,
where
H(a, k, p, E, Va , Vk ) = sup Va a + Vk k + u . (4.26)
c,i
The optimal controls, given in feedback form, for both the macroe-
conomic and the microeconomic agents are jointly determined by
(
Hqa = ct (pt + (Et ))it + F (kt , pt ),
(4.27)
Hqk = g(kt , pt ) + it ,
i i
i i
i i
where H is as in (4.26).
i i
i i
i i
Mathematical analysis I
- deterministic models
52
i i
i i
i i
+ e(T t) VT (aT , kT ),
with
H(a, k, p, Ua , Uk ) = sup Ua a + Uk k + u , (5.2)
c,i
Us U + Ua a + Uk k u. (5.4)
i i
i i
i i
hence
U (a, k, t) V (a, k, t).
i i
i i
i i
with (
ait0 = ai0 ,
kti0 = k0i .
Then a1t a2t and kt1 kt2 , for all t t0 .
Proof. Because k01 k02 , the uniqueness of solutions of ordinary dif-
ferential equations implies that kt1 kt2 , for all t 0. Then, by the
monotonicity of F , we have F (kt1 , pt ) F (kt2 , pt ). Therefore, a 1t a 2t
and so a1t a2t , for all t t0 .
The first main result of this section is the monotonicity of the
value function in the wealth variables (a, k).
Proposition 5.2.1. Suppose that:
i the instantaneous utility function u is non-decreasing in the wealth
variables (a, k);
ii the production function F (k, p) is non-decreasing in k;
iii the depreciation function g(k, p) is locally Lipschitz in k.
Then, the value function is non-decreasing in the (a, k) variables.
Proof. To simplify the proof, we suppose the existence of maximizers
for (4.3). This assumption is not essential, since the argument can
be modified recurring to -optimal trajectories.
Fix initial states (a1 , k 1 ) and (a2 , k 2 ) such that a2 a1 and k 2
1
k . Let the optimal consumption and investment level for the first
state be (ct , it ). Then, we have
ZT
V (a1 , k1 , t) = e(st) u(cs , is , as , ks )ds.
t
If we choose for the second state the same investment and con-
sumption levels, this choice is clearly suboptimal. Hence, we have
ZT
V (a2 , k2 , t) e(st) u(cs , is , as , ks )ds,
t
i i
i i
i i
ZT
V (a2 , k2 , t) e(st) u(cs , is , as , ks )ds
t
ZT
e(st) u(cs , is , as , ks )ds
t
= V (a1 , k1 , t).
Proof. Consider two initial states (a1 , k 1 ) and (a2 , k 2 ). Suppose that
the optimal consumption and investment levels for these two states
are, respectively, (c1t , i1t ) and (c2t , i2t ). Consider the state (at , kt ) given
by
at = (1 )a1t + a2t ,
kt = (1 )kt1 + kt2 ,
i i
i i
i i
Define
ct = (1 )c1t + c2t ,
it = (1 )i1t + i2t .
t = ct pt it + F (kt , pt ),
a
k t = it + g(kt , pt ).
We claim that
(
at at ,
(5.5)
kt kt .
k t g(kt , pt ) k t g(kt , pt ).
i i
i i
i i
i i
i i
i i
ks ks kkk
, for all s [t, T ].
Similarly, if (
k s = C + g(k s , ps ),
k 0 = k0 ,
we have
ks k s kkk , for all s [t, T ]. (5.9)
This shows that the capital level has uniform bounds. Next, we
address the consumer goods level. For that, let a s be such that
(
, ps ),
s = C + ps C + F (kkk
a
a
0 = a0 .
as a
s k
ak , for all s [t, T ]. (5.10)
i i
i i
i i
we have
as as kak , for all s [t, T ]. (5.11)
Combining (5.3), (5.9), (5.10) and (5.11), we get (5.8), where
, kkk , k
M = max{kkk ak , kak }.
Accordingly, for any control (c, i) such that (5.6) holds, we have:
ZT ZT
(st)
e u(cs , is , as , ks , ps )ds e(st) ds < .
t t
ZT ZT
(st)
e u(cns , ins , ans , ksn , ps ) sup e(st) u(cs , is , as , ks , ps )
c,i
t t
kcn k , kin k C.
ZT ZT
cns (s) + ans (s)ds cs (s) + as (s)ds, (5.12)
t t
i i
i i
i i
Zs Zs
ksn = in d + g(kn , p )d
t t
ZT Zs
= [t,s] ( )in d + g(kn , p )d.
t t
Now, using the uniform convergence, in (5.13), and the weak-* con-
vergence, in (5.12), and passing to the limit on both sides of the
former equality, we get
ZT Zs
ks = [t,s] ( )i d + g(k , p )d
t t
Zs Zs
= i d + g(k , p )d.
t t
Therefore,
k s = is + g(ks , ps ).
Similarly, we can prove that
a s = cs ps is + F (ks , ps ).
Hence, (cs , is , as , ks ) satisfy (4.1) and (4.2). This means that the
quadruple (cs , is , as , ks ) is a candidate for a maximizing trajectory.
Note that the weak-* convergence and uniform convergence yield
(
|cs |, |is | C
(5.14)
|as |, |ks | M
i i
i i
i i
for some constant L > 0. This, combined with (5.15), (5.12) and
(5.13), yields
ZT
lim e(st) u(cns , ins , ans , ksn , ps )ds
n
t
ZT
e(st) u(cs , is , as , ks , ps )ds.
t
i i
i i
i i
i i
i i
i i
u(c, i, a, k, p) H(a, k, p, qa , qk )
qa (c pi + F (k, p)) qk (i + g(k, p)),
or, equivalently,
(
Dc u(c, i, a, k, p) = qa ,
(5.20)
Di u(c, i, a, k, p) = qa p qk .
i i
i i
i i
Regarding the variable qks , because of (5.17) and (5.21), we have that
qks 0 , for s [0, T ].
Z Z
(st)
e u(cs , is , as , ks , ps )ds e(st) u(cs , is , as , ks , ps )ds
t t
Z
e(st) (Dc u(. . . )(cs cs ) + Di u(. . . )(is is )
t
+Da u(. . . )(as as ) + Dk u(. . . )(ks ks ))ds
Z
= e(st) (qas
(cs cs ) + (qas
ps qks
)(is is )
t
+Ha (. . . )(as as )
+(Hk (. . . ) qas
Dk F (ks , ps ) qks
Dk g(ks , ps ))(ks ks ))ds
Z
(st)
= e qas (cs + is ps cs is ps Dk F (ks , ps )(ks ks ))
t
qks (is is Dk g(ks , ps ))
+Ha (. . . )(as as ) + Hk (. . . )(ks ks ) ds
i i
i i
i i
Z
d (st)
= e(st) qas
(a s a s ) + e qas (as as )
ds
t
d (st)
e(st) qks
(is is ) + e qks (ks ks )ds
ds
Z
+ e(st) qas
(F (ks , ps ) F (ks , ps ) Dk F (ks , ps )(ks ks ))
t
+e(st) qks
(g(ks , ps ) g(ks , ps ) Dk g(ks , ps )(ks ks ))ds
0.
The last inequality follows from a few elementary observations. Be-
cause of (5.17), and the matching initial and terminal conditions, the
first integral vanishes. Moreover, the second integral is non-positive,
due to the concavity of the production function F and the deprecia-
tion function g, combined with the inequalities qas , qks 0.
Remark 5.4.1. The variables (qa , qk ) are called adjoint variables.
i i
i i
i i
qa a s + Hqa (as , ks , ps , qas , qks )
qk k s + Hq (as , ks , ps , qas , qks )
a = qas Ha (as , ks , ps , qas , qks ) .
A k (5.22)
k qks Hk (as , ks , ps , qas , qks )
. i i
We claim that this operator is monotone; let i = (qas , qks , ais , ksi ),
for i = 1, 2. An operator A : H H is said to be monotone if:
hA1 A2 , 1 2 iH 0,
1 1
for all trajectories (qas , qks 2
, a1s , ks1 ) and (qas 2
, qks , a2s , ks2 ) that satisfy
1 1 2 2
the appropriate initial and terminal conditions and qas , qks , qas , qks
0, where H is a Hilbert space and h, iH is the inner product in H.
For ease of notation, we drop the subscript H in what follows.
1,2 1,2
Proof. Firstly, from Proposition 5.4.1 we have that qas , qks 0.
i i
i i
i i
Furthermore,
hA1 A2 , 1 2 i (5.23)
ZT
= (a 1s + a 2s )(qas
1 2
qas ) + (k s1 + k s1 )(qks
1 2
qks )
0
1 2
+ (qas + qas )(a1s a2s ) + (qks
1 2
+ qks )(ks1 ks2 )ds
ZT
Hqa (a1s , ks1 , ps , qas
1 2
) Hqa (a2s , ks2 , ps , qas
2 2
1 2
+ , qks , qks ) (qas qas )
0
Hqk (a1s , ks1 , ps , qas
1 2
) Hqk (a2s , ks2 , ps , qas
2 2
1 2
+ , qks , qks ) (qks qks )
Da H(as , ks , ps , qas , qks ) + Da H(as , ks , ps , qas , qks ) (as a2s )
1 1 1 2 2 2 2 2 1
+
Dk H(a1s , ks1 , ps , qas 1 2
) + Dk H(a2s , ks2 , ps , qas2 2
) (ks1 ks2 )ds.
+ , qks , qks
ZT
(a 1s + a 2s )(qas
1 2
qas ) + (k s1 + k s1 )(qks
1 2
qks )
0
1 2
+ (qas + qas )(a1s a2s ) + (qks
1 2
+ qks )(ks1 ks2 )ds
ZT
d
(a1s + a2s )(qas
1 2
) + (ks1 + ks2 )(qks
1 2
= qas qks )
ds
0
= 0,
i i
i i
i i
variables. For that, we use (5.19) and (5.21) in (5.23) and get that
hA1 A2 , 1 2 i
Z T
c1s ps i1s + F (ks1 , ps ) + c2s + ps i2s F (ks2 , ps ) (qas
1 2
= qas )
0
+ i1s + g(ks1 , ps ) i2s g(ks2 , ps ) (qks 1 2
qks )
+ ua (cs , is , as , ks , ps ) + ua (cs , is , as , ks , ps ) (a1s a2s )
1 1 1 1 2 2 2 2
+ uk (c1s , i1s , a1s , ks1 , ps ) + uk (c2s , i2s , a2s , ks2 , ps ) (ks1 ks2 )
1
Fk (ks1 , ps ) + qks 1
gk (ks1 , ps ) (ks1 ks2 )
qas
2
Fk (ks2 , ps ) + qks 2
gk (ks2 , ps ) (ks1 ks2 )ds.
+ qas
After rearranging the terms, and using the concavity of the functions
1,2 1,2
F and g together with the positivity of qas and qks , we obtain the
following inequality:
Z T
1
F (ks2 , ps ) F (ks1 , ps ) Dk F (ks1 , ps )(ks2 ks1 )
qas
0
2
F (ks2 , ps ) F (ks2 , ps ) Dk F (ks2 , ps )(ks1 ks2 )
qas
1
g(ks2 , ps ) g(ks1 , ps ) Dk g(ks1 , ps )(ks2 ks1 )
qks
2
g(ks1 , ps ) g(ks2 , ps ) Dk g(ks2 , ps )(ks1 ks2 )
qks
1 2
(qas qas )(c1s c2s ) (ps qas 1
qks1
ps qas 2 2
+ qks )(i1s i2s )
Da u(c1s , i1s , a1s , ks1 , ps ) Da u(c2s , i2s , a2s , ks2 , ps ) (a1s a2s )
Dk u(c1s , i1s , a1s , ks1 , ps ) Dk u(c2s , i2s , a2s , ks2 , ps ) (ks1 ks2 )ds
Z T
1 2
(qas qas )(c1s c2s ) (ps qas
1
qks 1
ps qas2 2
+ qks )(i1s i2s )
0
Da u(c1s , i1s , a1s , ks1 , ps ) Da u(c2s , i2s , a2s , ks2 , ps ) (a1s a2s )
Dk u(c1s , i1s , a1s , ks1 , ps ) Dk u(c2s , i2s , a2s , ks2 , ps ) (ks1 ks2 )ds.
i i
i i
i i
Da u(c1s , i1s , a1s , ks1 , ps ) Da u(c2s , i2s , a2s , ks2 , ps ) (a1s a2s )
Dk u(c1s , i1s , a1s , ks1 , ps ) Dk u(c2s , i2s , a2s , ks2 , ps ) (ks1 ks2 )ds
0,
hA1 A2 , 1 2 i 0,
u(c1 , i1 , a1 , k 1 , p) u(c2 , i2 , a2 , k 2 , p)
Du(c2 , i2 , a2 , k2 , p) (c1 c2 , i1 i2 , a1 a2 , k 1 k 2 )
Ck(c1 c2 , i1 i2 , a1 a2 , k 1 k 2 )k2 ,
holds for all c1,2 , i1,2 , a1,2 , k 1,2 , where C > 0 is some constant. Then,
we obtain a stronger inequality:
hA1 A2 , 1 2 i (5.24)
ZT
C (c1s c2s )2 + (i1s i2s )2 + (a1s a2s )2 + (ks1 ks2 )2 ds.
0
hA1 A2 , 1 2 i > 0,
if 1 6= 2 .
As we show in the next corollary, this inequality allows us to prove
the uniqueness of the optimal trajectories.
i i
i i
i i
i i
i i
i i
where (al , k l ) are the initial consumer and capital goods levels for
each of the agents, l = 1, 2, , N . According to our earlier anal-
ysis, if the price ps is given, then under suitable conditions on the
utility function u, production function F and depreciation function g,
there exists a unique evolution s that satisfies the mean-field game
equation (4.10).
If the initial distribution 0 of is an average of N point masses,
it is not clear whether or not s preserves this configuration for the
future times, s > t0 . Heuristically, this means that the agents do
not split. This is natural because an agent cannot use two different
strategies at the same time.
Our aim, in this section, is to suggest a way to construct such a
solution. Fix a price evolution ps . For every agent l {1, 2, , N },
consider the Hamiltonian system
l l l l l
a s = Hqa (as , ks , ps , qas , qks ),
k l = H (al , k l , p , q l , q l ),
s qk s s s as ks
(st) l
(5.26)
d
qas = e(st) Ha (als , ksl , ps , qas
l l
ds e , qks ),
d (st) l
qks = e(st) Hk (als , ksl , ps , qas
l l
ds e , qks ),
together with initial-terminal conditions
l
at = al ,
k l = k l ,
t
l
qaT = 0,
l
qaT = 0.
As we have shown earlier, this system possesses a unique solution.
Moreover, we have the following identities:
l
qas = Va (als , ksl , s), l
qks = Vk (als , ksl , s).
i i
i i
i i
N
1 X
s = (als ,ksl ) .
N
l=1
ls = (als ,ksl ) ,
Hence, ls satisfies the transport equation (see Section 2.2) and the
proof is complete.
All our previous analysis was based on the assumption that the
price evolution ps is given. To find the price ps , we proceed as fol-
lows. We fix an arbitrary smooth evolution ps of the price. Let the
trajectories (als , ksl ) be defined as in Proposition 5.6.1. Suppose ps is
such that
XN N
X
ils = (ksl , ps ), (5.27)
l=1 l=1
i i
i i
i i
ps ps .
Note that
ils = k sl g(ksl , ps )
= Hqk (als , ksl , Va (als , ksl , s), Vk (als , ksl , s), ps ) g(ksl , ps ).
Hence,
N
X
Hqk (als , ksl , Va (als , ksl , s), Vk (als , ksl , s), ps )
l=1
XN
g(ksl , ps ) = (ksl , ps ).
l=1
i i
i i
i i
Part II
Stochastic models
75
i i
i i
i i
i i
i i
i i
77
i i
i i
i i
Tr T D2 u(x, t)
ut (x, t) + H(x, Du(x, t); m) + = 0, (6.4)
2
equipped with the terminal condition
i i
i i
i i
where h and are given functions satisfying some growth and reg-
ularity conditions. Assume that vs is a Markovian control. Then,
(6.7) is called a Markov diffusion. In this case, Av is given by
Tr T D2 f (x, t)
Av f (x, t) = f (x, t) + h fx (x, t) + .
t 2
Before we state and prove a Verification Theorem, we present the
Dynkins formula.
Proposition 6.1.1 (Dynkins formula). Let xs be a Markov diffusion
with infinitesimal generator A. Assume that xt0 = x. If f D(A),
then,
Z t
E(x,t0 ) (f (xt , t)) f (x, t0 ) = E(x,t0 ) Af (xs , s)ds ,
t0
for every t t0 .
For a proof of Proposition 6.1.1, we refer the reader to [66]. Now,
we proceed to the Verification Theorem.
i i
i i
i i
we have w = u.
Therefore:
!
T
Tr T Dx2 w
Z
(x,t)
w(x, t) = E wt + v Dx w + ds + u(xT , T )
t 2
(6.8)
!
T T
Dx2 w
Z
Tr
E(x,t) wt + H + Lds + u(xT , T ) ,
t 2
i i
i i
i i
i i
i i
i i
G[u](x) = 0. (6.11)
i i
i i
i i
The equation
G [m](x) = 0
is called the adjoint equation to (6.11), and m is called the adjoint
variable. In what follows, we consider specific operators G and pro-
duce a few elementary results that illustrate the adjoint methods.
Set
.
G[u](x, t) = ut + H(x, Du) + u,
where H = H(x, p); i.e., G is the operator associated with the Hamilton-
Jacobi equation (6.4). The adjoint equation is
Observe that the adjoint equation has maximum principle and pre-
serves mass.
In the sequel, we equip (6.12) with appropriate initial conditions.
This choice is arbitrary and motivated by the amount of information
we can extract. We start by fixing an initial time [t0 , T ) and a
point x0 in Rd . Then, set
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
i i
87
i i
i i
i i
and
dkt = (g(kt , pt ) + it ) dt + k dBtk , (7.2)
i i
i i
i i
a2 2
Vt V + H(a, k, p, Va , Vk ) + a V + k k V = 0, (7.4)
2 2
where the Hamiltonian H is defined by:
i i
i i
i i
i i
i i
i i
A mean field game model The coupling of (7.4) and (7.8) leads
to the system:
2 2
(
Vt V + H(a, k, p, Va , Vk ) + 2a a V + 2k k V = 0,
2 2
a k
t + (Hqa )a + (Hqk )k = 2 a + 2 k ,
i i
i i
i i
i i
i i
i i
a2 2
Vt V + H(a, k, p, r, Va , Vk ) + a V + k k V = 0,
2 2
where the Hamiltonian H is:
i i
i i
i i
a2 2
t + ((rt a c pi + F ) )a + ((g + i ) )k = a + k k .
2 2
(7.13)
a2 2
t + (Hqa )a + (Hqk )k = a + k k .
2 2
i i
i i
i i
i i
i i
i i
Mathematical analysis
II - stochastic models
96
i i
i i
i i
i i
i i
i i
Therefore,
Z T
W (a, k, t) E (a,k,t)
e(st) u(cs , is , as , ks , ps )ds (8.4)
t
(T t)
e E(a,k,t) W (aT , kT , T ).
To complete the proof, we notice that, because u is strictly concave,
there exists a unique pair (ct , it ), such that
Act ,it W (a, k, t) + u(ct , it , at , kt , pt ) = 0. (8.5)
By using (8.5) in (8.3), one gets equality in (8.4). This establishes
the Theorem.
i i
i i
i i
also has a unique pathwise solution ait with probability 1. From the
non-decreasing property of F in k, we infer that
F (kt1 , pt ) F (kt2 , pt ).
da1t da2t .
i i
i i
i i
are optimal for (a1 , k 1 ), they are suboptimal for (a2 , k 2 ). Therefore,
"Z #
T
2
,k2 ,t)
V (a2 , k 2 , t) E(a e(st) u(a2,1 2,1 1 1
s , ks , cs , is )ds .
t
i i
i i
i i
function, with respect to the variable k. Assume further that the in-
stantaneous utility function u is non-decreasing in the wealth variables
a and k and concave. Then, the value function V is concave in a and
k.
Now, let (cis , iis ) be the optimal controls associated with the initial
conditions (ai , k i ). Consider the convex combination
( .
cs = (1 )c1s + c2s ,
.
is = (1 )i1s + i2s .
k0 = (1 )k 1 + k 2 ,
i i
i i
i i
a0 = (1 )a1 + a2 .
i i
i i
i i
V s V d.
and
i i
i i
i i
i i
i i
i i
105
i i
i i
i i
i i
i i
i i
i i
i i
i i
Vt V + DX V b + H(x, X, Dx V ) + LV = 0, (9.5)
This is called the master equation. Notice that (9.7) encodes the
information regarding the optimization problem faced by the agents
and the evolution of the populations states.
i i
i i
i i
i i
i i
i i
i i
i i
i i
Because agents are rational, they act optimally, i.e., the vector
field driving the evolution of the random variable Xs is
i i
i i
i i
i i
i i
i i
Bibliography
[7] Y. Achdou, J. Han, J.-M. Lasry, P.-L. Lions, and B. Moll. Het-
erogeneous agent models in continuous time. Preprint.
113
i i
i i
i i
114 BIBLIOGRAPHY
i i
i i
i i
BIBLIOGRAPHY 115
i i
i i
i i
116 BIBLIOGRAPHY
i i
i i
i i
BIBLIOGRAPHY 117
i i
i i
i i
118 BIBLIOGRAPHY
i i
i i
i i
BIBLIOGRAPHY 119
i i
i i
i i
120 BIBLIOGRAPHY
i i
i i
i i
BIBLIOGRAPHY 121
[84] D. Gomes and R. Ribeiro. Mean field games with logistic popu-
lation dynamics. 52nd IEEE Conference on Decision and Con-
trol (Florence, December 2013), 2013.
[92] J. Graber. Weak solutions for mean field games with congestion.
Preprint.
i i
i i
i i
122 BIBLIOGRAPHY
i i
i i
i i
BIBLIOGRAPHY 123
i i
i i
i i
124 BIBLIOGRAPHY
[119] J.-M. Lasry and P.-L. Lions. Jeux `a champ moyen. II. Hori-
zon fini et contr
ole optimal. C. R. Math. Acad. Sci. Paris,
343(10):679684, 2006.
[120] J.-M. Lasry and P.-L. Lions. Mean field games. Jpn. J. Math.,
2(1):229260, 2007.
[122] J.-M. Lasry, P.-L. Lions, and O. Gueant. Mean field games
and applications. Paris-Princeton lectures on Mathematical Fi-
nance, 2010.
i i
i i
i i
BIBLIOGRAPHY 125
[133] M. Nourian and P. E. Caines. -Nash mean field game theory for
nonlinear stochastic dynamical systems with major and minor
agents. SIAM J. Control Optim., 51(4):33023331, 2013.
i i
i i
i i
126 BIBLIOGRAPHY
[143] J.-P. Fouque R. Carmona and L.-H. Sun. Mean field games and
systemic risk. Communications in Mathematical Sciences, Vol.
13(4), 2015.
[145] Jr. R. Lucas and B. Moll. Knowledge growth and the allocation
of time. Journal of Political Economy, 122(1), 2014.
i i
i i
i i
BIBLIOGRAPHY 127
i i
i i