Professional Documents
Culture Documents
Cointegrated Assets
_______________
Bahman Angoshtari
Oxford Man Institute of Quantitative Finance, Eagle House, Walton Well Road, Oxford, OX2 6ED
Tel: +44 1865 616600 Email: institute@oxford-man.ox.ac.uk www.oxford-man.ox.ac.uk
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
BAHMAN ANGOSHTARI
The Oxford-Man Institute of Quantitative Finance and the Mathematical Institute,
University of Oxford
Walton Well Road, OX2 6ED Oxford, UK
bahman.angoshtari@oxford-man.ox.ac.uk
24 Jan. 2011
In portfolio management, there are specific strategies for trading between two assets
that are cointegrated. These are commonly referred to as pairs-trading or spread-trading
strategies. In this paper, we provide a theoretical framework for portfolio choice that jus-
tifies the choice of such strategies. For this, we consider a continuous-time error correction
model to model the cointegrated price processes and analyze the problem of maximizing
the expected utility of terminal wealth, for logarithmic and power utilities. We obtain
and justify an extra no-arbitrage condition on the market parameters with which one
obtains decomposition results for the optimal pairs-trading portfolio strategies.
1. Introduction
This paper is a contribution to portfolio management using assets whose price pro-
cesses are cointegrated. Such processes have the property that a linear combination
of them is stationary. Intuitively speaking, two cointegrated processes are tied to-
gether, will never go too far from each other and have a long-run equilibrium with
respect to each other. Many economic and financial data series are known to exhibit
these properties. Examples include interest rates, foreign exchange rates, stock price
indices, stock prices, future and spot prices, and commodities (see, among others,
[2], [4], [5], [8], [10], [21] and [27]).
In portfolio management, there are specific strategies for trading among assets
which have co-movement in their prices. These strategies are commonly referred
to as pairs-trading or spread-trading. Generally speaking, these strategies try to
exploit the relative mispricing of the stocks by taking a long position in the over-
priced asset and a short position in the under-priced one, while maintaining market-
neutrality by taking offsetting short/long positions. They have been around in one
form or another since the beginning of listed markets, but the hedge fund industry
has given a new face to these strategies as well as the specific vehicle needed to
demonstrate their successes and failures. We refer the reader to [7], [28] and [29]
1
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
2 B. Angoshtari
an extra condition which amends the deficiencies and provides justification for pairs-
trading. In section 6, we provide a brief review of the cointegration property and of
error correction models. Finally, in section 7 we present numerical examples, and
conclude in section 8.
Proposition 2.1. Let St1 and St2 satisfy (2.1) and (2.2) and the process zt be given
by (2.4). Assume that the constants δ1 , δ2 , and c satisfy
δ1 + cδ2 < 0. (2.5)
Then zt , t ≥ 0, is a stationary Ornstein-Uhlenbeck process, solving
dzt = κ (z̄ − zt ) dt + σz dWtz (2.6)
with
z0 = ln S01 + c ln S02 .
The constants κ, σz , and z̄ are
2
κ = − (δ1 + cδ2 ) , σz2 = k(1, c) σk , (2.7)
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
4 B. Angoshtari
and
1 1 2
σ1 + cσ22 ,
z̄ = µ1 + cµ2 − (2.8)
κ 2
and the process Wtz , t ≥ 0, is given by
1
Wtz = σz (1, c) σWt
n o (2.9)
1
p
= σz (σ1 + cσ2 ρ) Wt1 + cσ2 1 − ρ2 Wt2 .
the main ideas behind pairs-trading and, specifically, the approach taken by practi-
tioners. To make the arguments more precise, we assume that, in accordance to the
model introduced earlier, we have only one pair of assets, say S 1 and S 2 . Moreover,
to keep the concepts intuitive, the arguments will be presented in an informal way.
We refer the reader to [7], [28] and [29] for a detailed exposition on pairs-trading.
The key step in pairs-(or spread-)trading strategies is to find a way to quantify
the relative price of the pair. Note that co-movement in prices implies that there
should be a way to combine the two prices to obtain a mean-reverting process in a
way highlighted in Proposition 2.1. It is this mean-reverting process that is used to
quantify the relative price of the pair.
There are two common assumptions regarding this relative price indicator:
i) A linear combination of the asset prices St1 and St2 is mean-reverting. In other
words, there is a constant c < 0 such that the process st , t ≥ 0, given by
st = St1 + cSt2 ,
is mean-reverting.
ii) A linear combination of the logarithm of the prices is mean-reverting. In other
words, there is a constant c < 0 such that the process zt , t ≥ 0, defined as
is mean-reverting.
with z0 = ln S01 + c ln S02 . Here (with a slight abuse of notation) κ, z̄, and σz are
constants, κ > 0, and Wtz is a standard Brownian motion.
We clarify the difference between the partial pricing model (3.2) and the seem-
ingly identical model (2.6). The former is an assumption about the market per se,
while the latter is a direct consequence of the price equations (2.1) and (2.2).
As mentioned earlier, a pairs-trader does not model St1 and St2 separately. In-
stead, he takes (3.2) as the partial market model.
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
6 B. Angoshtari
After assuming such a model, the pairs-trader restricts the candidate market
strategies to the so-called pairs-trading strategies. These are any strategies with the
following two properties:
In section 4, we denote the portfolio weights of the risky assets by αt = α1t , α2t .
αt = αMN
t (1, c) , (3.3)
Remark 3.1. As it can be easily seen, the main idea behind pairs-trading is that
the trader tries to exploit the relative mispricing in the pair, while being protected
from the overall market movements by following a market-neutral strategy.
Although this approach is intuitively appealing and there are various good rea-
sons that support it, there is, from a theoretical point of view, an unanswered
fundamental question. How can one justify this investment practice in a theoretical
portfolio choice framework? In other words, can one identify a market model and a
preference criterion for the investor which support pairs-trading?
This idea will be the theme for the rest of the paper. Specifically, we try to
justify pairs-trading in the sense mentioned above, without restricting the portfolio
strategies per se. This requires us to assume a full pricing model which implies
the partial pricing model (3.2). Note that according to Proposition 2.1, the market
setting of section 2 fulfills this requirement.
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
The goal is to find V (0, x, z) and the associated optimal portfolio strategy.
For tractability, we consider two specific choices for the utility function U (x) ,
the logarithmic and power utilities.
Proposition 4.1. Assume that the cointegration condition (2.5) holds. Let zt , t ≥
0, be the residual process introduced in (2.4). Then, the value function, denoted by
a For
rigorous results on the existence and uniqueness of log-optimal strategies under more general
market setting, see [9], [16], and [19].
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
8 B. Angoshtari
αlog
t = αC,log + αR.V,log (zt ) . (4.4)
b The optimal investment problem with power utility has been studied in market settings that are
more general than what considered herein. Specifically, Propositions 4.2 and 4.3 can be seen as a
special case of the results obtained in [20] or [25].
c The terminology is borrowed from [17].
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
10 B. Angoshtari
and
2
4σz2
σ −1 δ
2
∆ = b̃ − 4ãc̃ = (γ − γ0 ) . (4.12)
γ2
Then, the functions h and g appearing in (4.9) and (4.10) are given by
−2c̃
√ √ , if γ > γ0 ,
b̃+ ∆ coth 2∆ (T −t)
h (t) = (4.13)
b̃ 1 − 2
, if γ = γ ,
−2ã b̃(T −t)+2 0
and
with
γκz̄ + (1 − γ) (1, c) (µ − 1r)
C1 = , (4.15)
σz2
n
κ 1−γ
C2 = σz2 κz̄ + γ (1, c) (µ − 1r)
o (4.16)
⊤ −1
+ 1−γ ⊤
γ δ σσ (µ − 1r) ,
√ √
∆ (T −t)
2 ∆ e 2
b̃+√∆ − 1, if γ > γ0 ,
√
√
∆(T −t) − b̃−√ ∆
e b̃+ ∆
G1 (t) = (4.17)
2
− 1, if γ = γ0
b̃(T −t)+2
and
√ √ !
∆ (T −t)
√∆
√∆ e 2 − b̃−
√2 2 (T −t) − 1
b̃+ ∆
e , if γ > γ0 ,
√
√
∆ e ∆(T −t) − b̃−√∆
b̃+ ∆
G2 (t) = (4.18)
(T − t) b̃(T −t)+4 ,
if γ = γ0 .
2b̃(T −t)+4
f (t)
R T n σz2
1−γ
o
= t 2γ g 2
(s) + κz̄ + γ (1, c) (µ − 1r) g (s) + 12 σz2 h (s) ds (4.19)
+ 1−γ
σ (µ − 1r)
2 (T − t) .
−1
2γ
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
Proposition 4.3. (Nirvana case) Suppose that the cointegration condition (2.5)
holds and that γ < γ0 , with γ0 as in (4.8).
i) The value function is finite if and only if
T < Tmax , (4.20)
with
γ κ π
Tmax = −1
√ arctan −1
√ + . (4.21)
σz kσ δk γ0 − γ σz kσ δk γ0 − γ 2
ii) If (4.20) holds, then the value function V p (x, z, t) is given by (4.9), the optimal
portfolio weights αpt is given by (4.10), and f (t) is given by (4.19). However,
h (t) and g (t) are now given by
√ ! √ !
−∆ b̃ −∆ b̃
h (t) = tan arctan √ − (T − t) − (4.22)
2ã −∆ 2 2ã
and
g (t) = C1 G3 (t) + C2 G4 (t) . (4.23)
Here, ã, b̃, and ∆ are as in (4.11) and (4.12), and C1 and C2 are as in (4.15)
and (4.16). The functions G3 (t) and G4 (t) are given by
b̃
cos arctan √−∆
G3 (t) = √ −1 (4.24)
cos arctan √−∆b̃
− −∆ 2 (T − t)
and
2
G4 (t) = √ ×
−∆
√
b̃
sin arctan √−∆ b̃
− sin arctan √−∆ − −∆ 2 (T − t)
× √ . (4.25)
cos arctan √−∆b̃
− −∆2 (T − t)
Equality (4.10) gives the form of the optimal allocation for both the well-behaved
and the nirvana strategies. It shows that moving from the logarithmic preference to
power utility will change the optimal portfolio in two ways:
i) the log optimal portfolio is divided by γ, and
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
12 B. Angoshtari
ii) the investor also invests in a market-neutral strategy, denoted by αMN (t, zt ),
and given by
1
αMN (t, zt ) = (g (t) + h (t) zt ) (1, c) . (4.26)
γ
The market-neutral strategy, αMN (t, zt ), acts against the change in the pairs-
trading part of the portfolio when the risk aversion changes. Specifically, it can be
shown that:
a) if γ > 1, then h (t) < 0, t ≥ 0. This implies that the market-neutral term is a
pairs-trade (i.e. it buys the under-priced stock and sells the over-priced one).
Hence, the market-neutral component mitigates the decrease in the pairs-trade
which comes from dividing the positions in the stocks by γ.
b) if γ < 1, then h (t) > 0, t ≥ 0. This implies that the market-neutral term is the
opposite of a pairs-trade (i.e. it sells the under-priced stock and buys the over-
priced one). Therefore, the market-neutral component mitigates the increase in
the pairs-trade resulting from dividing the positions in the stocks by γ.
Note that the optimal allocation for power utility is not myopic, as the functions
h (t) and g (t) depend explicitly on the time horizon T .
From a practical point of view, these results are not satisfactory. By the same
argument as in the logarithmic case, we deduce that they are not consistent with
practice of pairs-trading. Furthermore, the optimal policies might have unpleasant
properties (i.e. blow-ups for nirvana solutions). In the next section, we provide a
way to amend these deficiencies and find theoretical ground for pairs-trading.
Remark 5.1. The facts that (5.1) and (5.2) are equivalent, and that (5.3) implies
(5.1), are straightforward. To see that (5.3) follows from (5.1), left-multiply (5.1)
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
Theorem 5.1. Suppose that both the cointegration condition (2.5) and Condition
5.1 hold. Then, the log-optimal portfolio weights of Proposition 4.1 can be decom-
posed as
αlog
t = αC,log + αP.T,log (zt ) , (5.8)
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
14 B. Angoshtari
We continue with a discussion on the above policies. The constant weight com-
ponent of the log-optimal portfolio, αC,log , is the log-optimal portfolio if there was
no cointegration (i.e. δ1 = δ2 = 0 in (2.1) and (2.2)).
The relative value component of the log-optimal portfolio, αP.T,log (zt ), is a
market-neutral strategy (cf. 3.3), with αMN t = κzt /σz2 . Furthermore, the inequality
2
−κ/σz < 0 implies that this market-neutral strategy always shorts the over-priced
stock and longs the under-priced one. Therefore, this is a genuine pairs-trading
strategy. Note that the pairs-trading component depends solely on the parameters
c, κ, and σz2 . This, in turn, yields that in order to identify the log-optimal pairs-
trade, we only need to specify the partial pricing model (3.2).
We conclude that Theorem 5.1 gives a solid ground for choosing a pairs-trading
strategy. Indeed, the pairs-trading component (5.9) only depends on parameters
of the partial pricing model, while the constant weight portfolio component (4.5)
depends on the drift parameters, µ1 and µ2 , which are quite hard to estimate in
practice. Hence, if we only know a partial pricing model, then the pairs-trading
component is the only part of the optimal strategy which we can fully identify.
Finally, the form of the log-optimal pairs-trade (5.9) is quite intuitive, and de-
serves attention of its own. It tells us that the long-short positions should be bigger
if the mean-reversion rate κ is bigger, and they should be smaller if the variance
rate of the residual, σz2 , is larger.
Next, we consider the case of power utility. As in the logarithmic case, much
more can be said if we, also, consider the market to be arbitrage-free. In particular
the optimal strategies will be well-behaved for all values of γ, and we would obtain
a decomposition result like (5.8).
We state these results next.
Theorem 5.2. Suppose that both the cointegration condition (2.5) and Condition
5.1 hold. Then, the value function V p (t, x, z), introduced in section 4.2, is finite for
all T > 0 and γ > 0.
Furthermore, the optimal strategy αpt (cf. (4.10)) can be decomposed as
αpt = αC + αP.T. (zt ) + αT.V. (t) , (5.10)
where αC is a constant portfolio given by
1 ⊤
αC = (µ − 1r) (σσ ⊤ )−1 , (5.11)
γ
αP.T. (zt ) is a pairs-trading strategy given by
−κ
αP.T. (zt ) = h̃ (t) zt (1, c) , (5.12)
σz2
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
with
1+ √1 coth √κ (T − t)
γ γ
h̃ (t) = √ , (5.13)
1+ γ coth √κ (T − t)
γ
with
σ12 + σ22 /2 − (1 + c) r
β= . (5.16)
κz̄
Decomposition (5.10) says that it is optimal for the investor to divide her wealth
into a constant-weight portfolio αC , a pairs-trade αP.T. (zt ), and a time-varying (but
deterministic) portfolio αT.V. (t).
As in the logarithmic case, the constant weight portfolio, i.e. αC of (5.11), is the
optimal portfolio if there was no cointegration (i.e. δ1 = δ2 = 0 in (2.1) and (2.2)).
The constant weight portfolio is the constant part of the log-optimal portfolio (i.e.
αC,log in (4.5)) divided by γ.
Moreover, as in the logarithmic case, the pairs-trading component, αP.T. (zt ), has
the factor −κ/σz2 . In other words, the long-short positions are directly proportional
to the mean-reversion rate κ and inversely proportional to the variance rate σz2 .
Remark 5.2. The only difference between the pairs-trade for power utility (i.e.
αP.T. (zt ) of (5.12)) and the pairs-trade for logarithmic utility (i.e. αP.T,log (zt ) of
(5.9)) is the time-varying factor h (t) in (5.13). This adjustment factor has the
following properties:
i) lim h (t) = 1. Hence, the power-optimal pairs-trade becomes the log-optimal
γ→1
pairs-trade when γ → 1, as it is expected.
ii) For γ > 1, the function h (t) is decreasing in t and satisfies
1 1
√ ≤ h (t) < < 1, for t ∈ (−∞, T ] . (5.17)
γ γ
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
16 B. Angoshtari
Therefore, more risk averse investors take smaller long-short positions if com-
pared to a log-utility investor. Furthermore, they tend to reduce the size of their
pairs-trade as time increases.
iii) For γ < 1, the function h (t) is increasing in t and satisfies
1 1
1< < h (t) ≤ √ , for t ∈ (−∞, T ] . (5.18)
γ γ
Therefore, less risk averse investors take larger long-short positions if compared
to a log-utility investor. Furthermore, they tend to increase the size of their
pairs-trade as time increases.
Remark 5.3. The time varying component αT.V. (t) is a market-neutral strategy.
Note that it does not depend on the residual zt , and it is fully known at the initial
time. It can be shown that the function g (t) of (5.15) vanishes as γ → 1. This fact
explains the absence of αT.V. (t) in the logarithmic case. Moreover, g (t) vanishes as
(T − t) → 0, and it is finite as (T − t) → +∞, with the limit given by
1−β 1
lim g (t) = + √ − (2 − β) .
(T −t)→+∞ γ γ
We end this section by commenting on the practicality of the results. As in the
logarithmic case, we only need to know c, κ, and σz2 to identify the pairs-trading
component αP.T. (zt ). On the other hand, to estimate the time varying component
αT.V. (t), we also need to know the parameters r, σ12 , σ22 , and z̄, and to identify the
constant-weight component αC , we need to estimate µ1 , µ2 , and ρ. Hence, in the
power utility case, as in the logarithmic case, if we only know a partial pricing model,
then the pairs-trading component is the only part of the optimal strategy which we
can fully identify. Therefore, the decomposition result (5.10) justifies pairs-trading
for an investor with power utility.
For any process yt , we introduce the difference operator ∆yti = yti − yti−1 .
Then, a stochastic process yt is integrated of order 1, or simply yt ∈ I (1), if yt is
non-stationary and ∆yt is a stationary process.
18 B. Angoshtari
Stock Prices
$
140
120
100
80
60 IBM
MSFT
40
20
0
2001 2002 2003 2004 2005 2006 2007 2008 2009 2010
The Residual
0.2
−0.2
−0.4
z = ln(MSFT) − 0.73 ln(IBM)
−0.6
2001 2002 2003 2004 2005 2006 2007 2008 2009 2010
Fig. 1. Stock prices of Microsoft and IBM (top), and the associated residual (bottom) from Jan
2001 to Dec 2009.
ii) If the tests did not reject cointegration, then one estimates the ECM by re-
gressing ∆xt and ∆yt over zt−1 .
In the Johansen approach, one considers the ECM directly and simultaneously
tests for cointegration and estimates the ECM. For more information on the Jo-
hansen approach, we refer the reader to [13] and [15].
7. Numerical example
We provide an illustration using real market data in order to give some insights
on the ideas presented in the previous sections. We used the daily stock prices of
Microsoft and IBM for a period of nine years (from Jan. 2, 2001 to Dec. 31, 2009).
The data series are extracted from the CRSPd database and are adjusted for splits
and cash dividends. The top part of figure 1 shows these time series.
We propose a methodology to estimate the ECM of equations (6.1) and (6.2)
while imposing the no-arbitrage condition (5.3).
We take the Engle-Granger two-step procedure, as discussed in section 6. After
d Source:c
201010 CRSP
,R Center for Research in Security Prices. Booth School of Business,
The University of Chicago. Used with permission. All rights reserved. www.crsp.chicagobooth.edu
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
establishing that the processes are I(1) using the ADF test, we use the Phillips-
Ouliaris variance ratio (or Pu ) and trace statistic (or Pz ) tests for testing for coin-
tegration. If the tests imply cointegration, then c and the residual process zt can
be obtained by regressing ln St1 over ln St2 . To find κ and σz , we fit a first order
autoregressive model (i.e. AR(1)) to the time series zt obtained by regression in the
previous step. To obtain σ1 , σ2 and ρ, we regress ∆ ln St1 and ∆ ln St2 over zt−1 .
Then, δ1 and δ2 are calculated by inserting the estimates of σ1 , σ2 , ρ, κ and σz
in equation (5.3). Finally µi , i = 1, 2, are estimated by regressing ∆ ln Sti − δi zt−1 ,
i = 1, 2, over a constant.
To check the performance of the estimation method, we try it on simulated data,
generated by an Euler scheme from the SDE given by the price equations (2.1) and
(2.2), with the market parameters given in Table 1. The top part of figure 2 shows
ten simulated sample paths for each stock.
We calculate out-of-sample test statistics and estimates as follows. On each day
form Dec. 12, 2004 to Dec. 12, 2009, we take the last four years of data and run
the estimation process discussed above. The results are shown in figures 3 to 6. The
results suggest that the estimates for c, σz , σ1 , σ2 , and ρ are acceptable while the
estimates for κ, δ1 and δ2 are occasionally far off and the ones for µi , i = 1, 2, are
not robust at all (as is expected).
Next, we evaluate the portfolio value for the log-optimal strategy of equation
(4.4) using simulated data. We assume an initial wealth of $100. Figure 7 shows the
log-optimal portfolio value for each simulation using the real parameters (i.e. from
Table 1). In all scenarios except one, the portfolio does not lose more than half of its
initial value, while all scenarios end up with the terminal wealth of at least $1000.
Figure 8 shows the log-optimal portfolio value, for each simulation, by using the
out-of-sample estimates. The results are quite different from the case of using real
values of parameters. In five out of ten scenarios, it is observed that the portfolio
ends up losing more than 90% of its initial value at some point during the trading
horizon. This observation highlights the importance of having good estimates.
We have conducted the same procedure for the real data series of figure 1. The
results are shown in figures 9 to 12. Note that, as expected, the estimator for c is
quite robust, but the estimators for µi , i = 1, 2, are not robust at all. Moreover,
the test statistics imply that the cointegration relation ceased to exist somewhere
during the estimation period. The estimates for σz , σ1 , σ2 , and ρ suggest that these
parameters vary significantly during the estimation period.
Figure 13 shows the performance of the log-optimal strategy using real data of
figure 1. The portfolio weights are calculated by using the out-of-sample estimates
discussed above. We consider three scenarios with different assumptions on trans-
action costs and frequency of trade. In the first scenario, associated with the top
solid line, it is assumed that there are no transaction costs and that the investor
is adjusting his/her portfolio daily. In the second scenario, showed in the bottom
line, it is assumed that the investor is buying with the daily high price and selling
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
20 B. Angoshtari
with the daily low price. As it can be seen, this strategy is not profitable due to the
high transaction cost. Finally, the third scenario (the middle line) refers to the case
that there are transaction costs, but the investor adjusts his/her portfolio every two
weeks.
Stock Prices
250
200
150
$
100
50
0
2001 2002 2003 2004 2005 2006 2007 2008 2009 2010
The Residual
0.4
0.2
−0.2
z = ln(MSFT) − 0.73 ln(IBM)
−0.4
2001 2002 2003 2004 2005 2006 2007 2008 2009 2010
Fig. 2. Ten sample paths of simulated stock prices of Microsoft/IBM pair (top), and the associated
residual (bottom).
P Test
u
100
10% critical level
80
60
40
20
0
2005 2006 2007 2008 2009 2010
P Test
z
150
10% critical level
100
50
0
2005 2006 2007 2008 2009 2010
ĉ
−0.7
−0.73
−0.75
2005 2006 2007 2008 2009 2010
Fig. 3. Phillips-Ouliaris Pu and Pz cointegration tests, run for ten simulated sample paths, and
the estimated cointegration coefficient ĉ.
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
22 B. Angoshtari
κ̂
11
5.46
σ̂z2
0.07
0.0494
0.03
Fig. 4. Estimation of mean reversion rate κ and variance rate σz2 of the residual, for ten sample
paths.
0.29
0.27
0.25
0.24
0.22
0.2
ρ̂
0.6
0.5
0.4
Fig. 5. Estimation of volatilities σ1 and σ2 and correlation ρ, for ten sample paths.
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
3
1.332
0
−2
−4.448
−10
0.3
0.2
0.1
0.05
0
−0.1
−0.2
1,000,000
100,000
10,000
1,000
$
100
50
10
Fig. 7. Portfolio value of the log-optimal strategy for ten sample paths, using the real values of
the parameters.
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
24 B. Angoshtari
1,000,000
100,000
10,000
1,000
$
100
50
10
Fig. 8. Portfolio value of the log-optimal strategy for ten sample paths, using out of sample
estimates of the parameters.
P Test
u
80
10% critical level
60 test statistic
40
20
0
2005 2006 2007 2008 2009 2010
P Test
z
80
10% critical level
60 test statistic
40
20
0
2005 2006 2007 2008 2009 2010
ĉ
−0.7
−0.72
−0.74
Fig. 9. Phillips-Ouliaris Pu and Pz cointegration tests, run on a period from Jan 2005 to Dec 2009
(at each day the last four years of data is considered), and the estimated cointegration coefficient
ĉ.
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
κ̂
12
10
0
2005 2006 2007 2008 2009 2010
σ̂z2
0.08
0.07
0.06
0.05
0.04
0.03
2005 2006 2007 2008 2009 2010
Fig. 10. Estimation of mean reversion rate κ and variance rate σz2 of the residual.
0.25
0.2
ρ̂
0.7
0.65
0.6
0.55
0.5
0.45
0.4
0.35
2005 2006 2007 2008 2009 2010
Fig. 11. Estimation of volatilities σ1 (for MSFT) and σ2 (for IBM) and correlation ρ.
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
26 B. Angoshtari
−2
δ
1
−4
δ
2
−6
−8
−10
2005 2006 2007 2008 2009 2010
0.15
0.1
0.05
−0.05
−0.1
2005 2006 2007 2008 2009 2010
Portfolio value
No Transaction cost
Bid/Ask, biweekly trade
Bid/Ask, daily trade
1,000,000
100,000
10,000
1,000
$
100
50
10
Fig. 13. Portfolio value of the log-optimal strategy for the MSFT/IBM pair, assuming that: a)
there is no transaction cost (black line). b) buying with the daily high price and selling with the
daily low price (red line). c) same as (b), but trading biweekly.
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
Appendix A. Proofs
A.1. Proof of Proposition 4.1
The following argument is a direct adaptation of the one used in [12]. From (4.1),
for any t ≤ T , we have
R
T RT
XTα = Xtα exp − 21 αu σσ ⊤ α⊤
t u + αu (µ + δzu − 1r) du + t αu σdWu .
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
28 B. Angoshtari
Then,
V log (t, x, z) = E (ln XTα |Xtα = x, zt = z)
R (A.1)
T
− 12 αu σσ ⊤ α⊤
= ln x + E t u + αu (µ + δz u − 1r) du|z t = z .
It, then, follows that the log-optimal portfolio weights are given by equation (4.4).
Substituting back into (A.1) yields the value function as in (4.3).
Finally, the finiteness of the value function follows from Propositions 4.2. Note
1
x1−γ − 1 . Therefore,
that, for any constant γ ∈ (0, 1), we have that ln x < 1−γ
−1 (A.3)
(µ − 1r + δz)⊤ σσ ⊤ Vx
− xVxx
+ (1, c) − Vxz
xVxx .
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
2
Vt − 12
σ −1 (µ − 1r + δz)
Vx2 /Vxx
1 2 2
− 2 σz Vxz /Vxx + (κz − (1, c) (µ − 1r)) Vx Vxz /Vxx
(A.4)
+κ (z̄ − z) Vz + 12 σz2 Vzz = 0,
1−γ
V (x, z, T ) = x1−γ .
σz2 h−κ
1−γ
g′ + γ g + κz̄ + γ (1, c) (µ − 1r) h
−1
+ 1−γ ⊤
σσ ⊤ (A.6)
γ δ (µ − 1r) = 0,
g (T ) = 0,
and
n 2
′ σz 2 1−γ
f + 2γ g + κz̄ + γ (1, c) (µ − 1r) g
2 o
1 2 1−γ
−1 (A.7)
+ 2 σz h + 2γ σ (µ − 1r)
=0
f (T ) = 0.
We need the following preliminary result. Its proof is immediate and, thus,
omitted.
where ã, b̃, and c̃ are constants. Also, define the constant
∆ = b̃2 − 4ãc̃. (A.9)
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
30 B. Angoshtari
and given by
√ ! √ !
−∆ b̃ −∆ b̃
h (t) = tan arctan √ − (T − t) − . (A.12)
2ã −∆ 2 2ã
We continue with the proof Propositions 4.2 and 4.3. Note that ODE (A.5) is
the Riccati equation introduced in (A.8), with the constants ã, b̃, c̃, and ∆ as in
(4.11) and (4.12).
We identify the following two cases, which correspond to Propositions 4.2 and
4.3, respectively.
To finish the proof, we need to verify that the candidate optimal control is indeed
optimal. For the related regularity and verification results we refer the reader to [3]
and [18].
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
Define
R 2 −1
1 T
k σ−1 δ k zs2 +2(µ−1r)⊤ (σσ⊤ ) δzs ds
v (t, z) = E e 2 t
|zt = z ,
and recall that the process zt follows (2.6) and (2.7). Then, the Novikov condition
(5.6) becomes
2
1 RT
E e 2 0 kλs k ds = e 2 T kσ (µ−1r)k v (0, z0 ) < ∞, ∀T ∈ (0, ∞) .
2 1 −1
(A.14)
Hence, we only need to show that v (0, z) is finite for all z and T > 0. Using the
Feynman-Kac formula, we deduce that the function v satisfies
vt + κ (z̄ − z) vz + 12 σz2 vzz
2 ⊤ −1
+ 21
σ −1 δ
z 2 + 2 (µ − 1r) σσ ⊤ δz v = 0, (A.15)
v (T, z) = 1.
To solve the above terminal value problem, we use the ansatz v (t, z) =
1 2
ef (t)+g(t)z+ 2 h(t)z , for some appropriate functions f , g, and h. Substituting back
into (A.15) results in three ODE for these functions. The equations for f and g are
quite similar to (A.7) and (A.6) above. It can be easily shown that f and g are
finite if and only if h is finite. Therefore, one only needs to find conditions under
which h is finite for all T > 0.
The function h solves the Riccati equation (A.8), with the coefficients ã = −σz2 ,
2
b̃ = 2κ, and c̃ = −
σ −1 δ
. On the other hand,
2
∆ = 4 κ2 − σz2
σ −1 δ
,
32 B. Angoshtari
only possibility is that ∆ = 0, and we deduce that the Novikov condition holds if
and only if Condition 5.1 holds. This concludes the proof of part (i).
Part (ii):
The sufficiency follows from Theorems 5.1 and 5.2.
For the necessity, note that the results of section 4 supports pairs-trading, only
if, the relative-value policy αR.V,log , introduced in (4.6), is a market-neutral strategy
(cf. (3.3)). This, in turn, implies that there should be a constant η such that
−1
δ ⊤ σσ ⊤ = η (1, c) . (A.17)
It follows that
⊤
δ = η σσ ⊤ (1, c) , (A.18)
Part (iii): If Condition 5.1 holds, then there cannot be any nirvana solutions,
since the value function (4.2) is finite for any utility function U (x) and all T > 0.
To see this, note (5.3) implies that in equation (4.8) we have γ0 = 0. Therefore,
Proposition 4.2 holds for γ = 0, i.e. supE (XTα ) < ∞ for all T > 0. Finally, for any
A
increasing and concave function U (x), Jensen’s inequality yields
supE (U (XTα )) ≤ supU (E (XTα )) ≤ U supE (XTα ) < ∞. (A.19)
A A A
Conversely, if Condition 5.1 fails, then nirvana solutions exist. To see this, note
that if (5.3) does not hold, then
(see (A.16)). Therefore, equation (4.8) yields that γ0 > 0, and Proposition 4.3 yields
nirvana solutions for any constant γ ∈ (0, γ0 ). This concludes the proof of part (iii).
Acknowledgments
This work is part of the author’s D.Phil. dissertation under advising of T. Za-
riphopoulou. The author would like to thank her as well as T. Schoeneborn for
their invaluable suggestions and comments. Financial support from the Oxford-Man
Institute of Quantitative Finance is acknowledged.
References
[1] M. Avellaneda and J.-H. Lee, Statistical arbitrage in the US equities market, Quan-
titative Finance, 10(7) (2010) 761–78.
[2] R. T. Baillie and T. Bollerslev, Common stochastic trends in a system of exchange
rates, The Journal of Finance 44(1) (1989) 167–181.
[3] F. E. Benth and K. H. Karlsen, A note on Merton’s portfolio selection problem for
the Schwartz mean-reversion model, Stochastic Analysis and Applications 23 (2005)
687–704.
[4] R. J. Brenner and K. F. Kroner, Arbitrage, cointegration, and testing the unbiased-
ness hypothesis in financial markets, Journal of Financial and Quantitative Analysis
30(1) (1995) 23–42.
[5] M. Cerchi and A. Havenner, Cointegration and stock prices: the random walk on Wall
Street revisited, Journal of Economic Dynamics and Control 12 (1988) 333–346.
[6] J-C. Duan and S. R. Pliska, Option valuation with co-integrated asset prices, Journal
of Economic Dynamics and Control 28 (2004) 727–754.
[7] D.S. Ehrman, The handbook of pairs trading, John Wiley & Sons 2006.
[8] R. Engle and C. Granger, Co-integration and error correction, representation, esti-
mation and testing, Econometrica 55 (1987) 251–276.
[9] T. Goll and J. Kallsen, A complete explicit solution to the log-optimal portfolio
problem, The Annals of Applied Probability 13.2 (2003) 774–799.
[10] A. Hall, H. Anderson and C. Granger, A cointegration analysis of treasury bill yields,
Review of Economics and Statistics 74 (1992) 116–126.
[11] J. D. Hamilton, Time Series Analysis, Princeton University Press 1994.
[12] F. Jamshidian, On utility of wealth maximization, Unpublished note 2009.
[13] S. Johansen, Likelihood-based inference in cointegrated vector autoregressive models,
Advanced Texts in Econometrics, Oxford University Press Oxford 1995.
[14] J. W. Jurek and H. Yang, Dynamic portfolio selection in arbitrage, European Finance
Association 2006 meetings paper,
Available at SSRN: http://ssrn.com/abstract=882536
[15] K. Juselius, The cointegrated VAR model: methodology and applications, Advanced
Texts in Econometrics, Oxford University Press Oxford 2006.
[16] I. Karatzas and C. Kardaras, The numéraire portfolio in semimartingale financial
models, Finance Stoch. 11 (2007) 447–493.
[17] S. K. Kim and E. Omberg, Dynamic nonmyopic portfolio behavior, The Review of
Financial Studies 9(1) (1996) 141–161.
[18] R. Korn and H. Kraft, A stochastic control approach to portfolio problem with
stochastic interest rates, SIAM J. Control Optim. 40(4) (2001) 1250–1269.
[19] D. Kramkov and W. Schachermayer, The asymptotic elasticity of utility functions
and optimal investment in incomplete markets, The Annals of Applied Probability
9(3) (1999) 904–950.
[20] J. Liu, Portfolio selection in stochastic environments, The Review of Financial Studies
20(1) (2007) 1–39.
January 24, 2011 14:58 WSPC/INSTRUCTION FILE PortfolioChoice-
WithCoIntegratedAssets˙LaTex
34 B. Angoshtari
[21] R. MacDonald and M. Taylor, Metals prices, efficiency and cointegration: some ev-
idence from the London metal exchange, Bulletin of Economic Research 40 (1988)
235-239.
[22] J. Mossin, Optimal multiperiod portfolio policies, The Journal of Business 41(2)
(1968) 215–229.
[23] S. Mudchanatongsuk, J. A. Primbs and W. Wong, Optimal pairs trading: a stochastic
control approach, Proceedings of the American Control Conference, Seattle, Washing-
ton (2008) 1035–1039.
[24] P. C. B. Phillips and S. Ouliaris, Asymptotic properties of residual based tests for
cointegration, Econometrica 58 (1990) 165–193.
[25] M. Schroder and C. Skiadas, Optimal consumption and portfolio selection with
stochastic differential utility, Journal of Economic Theory 89 (1999) 68–126.
[26] E. S. Schwartz, The stochastic behaviour of commodity prices: Implications for val-
uation and hedging, The Journal of Finance 52(3) (1997) 923–973.
[27] M. P. Taylor and I. Tonks, The internationalisation of stock markets and the abolition
of U.K. exchange control, The Review of Economics and Statistics 71(2) (1989) 332–
336.
[28] G. Vidyamurthy, Pairs trading: quantitative methods and analysis, John Wiley &
Sons Inc. 2004.
[29] M. Whistler, Trading pairs: capturing profits and hedging risk with statistical arbi-
trage strategies, John Wiley & Sons Inc. 2004.
Founded in 2007, the Oxford-Man Institute of Quantitative Finance is
the home of interdisciplinary research in quantitative aspects of
finance at the University of Oxford. Drawing researchers and
students from many Departments in the University, it has particular
strengths in computational finance, financial econometrics, hedge
fund research, and quantitative aspects of law and finance.
The core funding for the Institute and its activities is provided by Man
Group plc, which is a leading global provider of alternative
investment products and solutions. In addition individual faculty and
students at the Institute are supported by a variety of grants from
research councils and charities. Further, the Institute benefits from
the support companies provide to Oxford Finance.
Oxford Man Institute of Quantitative Finance, Eagle House, Walton Well Road, Oxford, OX2 6ED
Tel: +44 1865 616600 Email: institute@oxford-man.ox.ac.uk www.oxford-man.ox.ac.uk