Professional Documents
Culture Documents
In this essay, I review the new book by Torsten Persson and Guido Tabellini, The
Economic Effects of Constitutions, which investigates the policy and economic con-
sequences of different forms of government and electoral rules. I also take advantage
of this opportunity to discuss the advantages and disadvantages of a number of popu-
lar empirical strategies in the newly emerging field of comparative political economy.
1025
de05_Article3 11/16/05 3:59 PM Page 1026
Economic Constitutional
Outcomes Rules
Markets Political
Outcomes
Policy Decisions
book. Section 3 provides a detailed discus- the rental market works. The major advance
sion of econometric issues and identification that came with political economy is encapsu-
problems that Persson and Tabellini have lated in a new question: why do policies dif-
rightly emphasized in their work. Section 4 fer over time, across countries and more
discusses the major empirical results of the broadly across polities? Why, for example, do
book. Sections 58 discuss a number of some governments impose rent control
problems and challenges facing empirical while others do not intervene in rental mar-
work in political economy, and reevaluate kets. Before, we had no better answer than
Persson and Tabellinis contribution in this (some) politicians just dont get it. Political
light. Sections 9 and 10 discuss two sets of economy is about developing better answers.
topics that I view as important areas for This task requires a framework for think-
future research in political economy. ing about why policies differ across coun-
tries. This is what much of political economy
2. Background and Objectives does. Starting with Kenneth J. Arrows cele-
brated (im)possibility theorem (Arrow 1963,
Persson and Tabellini (henceforth PT) set Duncan Black 1948), Anthony Downss
the scene with figure 1.1 in their book, median voter result (Downs 1957, Harold
which I replicate here. The box on the left is Hotelling 1929), and George J. Stiglers work
where much of economics falls. We have a on regulation (Stigler 1970, 1972), the cen-
good, though far from perfect, understand- tral working hypothesis has been that
ing of how different organizations of markets agentsas voters, lobbyists, revolutionaries,
lead to different economic outcomes. But politicianshave induced preferences over
why are markets organized differently in dif- policies. This means that they understand
ferent societies? Economists and other that different policies will map into different
social scientists have long realized the outcomes, and consequently their prefer-
importance of policies. If the government ences over policies are shaped by their pref-
imposes rent control, this will impact how erences over the outcomes that will be
de05_Article3 11/16/05 3:59 PM Page 1027
induced by the policies.2 Political economy comparative political economy more specif-
is then about understanding how these ically. What makes this work even more
induced preferences over policies are aggre- impressive is the fact that the authors are
gated. The recent work in political economy investigating predictions of their earlier
is also part of this research program, but models.
more explicitly investigates why collective In reality, PT have an even more ambi-
decisions and policies differ across societies. tious goal. Many scholars attempt to docu-
Much work in political science, on the ment a set of correlations in the data that are
other hand, as succinctly captured by the box related to predictions of existing models.
on the right in Persson and Tabellinis figure Instead, PT are interested in more than cor-
is about how different political procedures relations. They undertake the ambitious task
lead to different political outcomes. Let us of identifying the causal effects of the form
refer to political procedures as political of political institutions on economic and
institutions and make the link between political outcomes. They write on page 7:
political outcomes and policies. This gives us Our ultimate goal is to draw conclusions
a natural framework for thinking about why about the causal effect of constitutions on
different societies choose different policies specific policy outcomes. We would like to
they have different political institutions. answer questions like the following: if the
PTs first book, Political Economics, devel- United Kingdom were to switch its electoral
oped various models of collective decision- rule from majoritarian to proportional, how
making and derived predictions on how would this affect the size of its welfare state
differences in political institutions translate or its budget deficits? If Argentina were to
into differences of policies and economic abandon its presidential regime in favor of a
outcomes. This new book is the next step to parliamentary form of government, would
push this framework further by confronting this facilitate the adoption of sound policy
its predictions with data. towards economic development?
Although this type of comparative analysis How would we get to the causal effects of
has been vibrant in political science, there a set of political institutions on outcomes?
are no systematic theoretical analyses deriv- PT have a well-specified strategy. First,
ing precise predictions from micro-founded they focus on a subset of political institu-
models to confront with data. Here PT are tions; the constitutionally determined form
much more ambitious. They take the pre- of electoral rules and the form of govern-
dictions of a set of theoretical models, in ment among democracies. This leaves out a
particular, the probabilistic voting model of large part of political institutions, for exam-
Assar Lindbeck and Jrgen Weibull (1987), ple, those that relate to whether a country
extensively studied in their first book, and is a democracy or not, and thus a lot of
an extended version of John Ferejohns interesting questions related to what
(1986) model of politician accountability, makes constitutions credible in the first
studied in Persson, Gerard Roland, and place.3 Nevertheless, narrowing the scope
Tabellini (1997, 2000), seriously. Although
one can have qualms about the specific 3
See, for example, Avinash Dixit, Gene M. Grossman,
assumptions that are important to derive and Faruk Gul (2000). This question perhaps is not first-
order for PT because they focus on democracies that are
predictions in these models, PTs is the right fully consolidated and do not switch to dictatorship, and
approach in social science in general, and in investigate how electoral rules affect policies within the
democratic framework. Nevertheless, whether a society is
a fully consolidated democracy is endogenous and poten-
2
This approach is referred to as rational choice in tially also depends on the form of government and elec-
political science. To economists, it is the natural approach toral rules in democracy (see Acemoglu and Robinson
without apologies. 2005 for a discussion).
de05_Article3 11/16/05 3:59 PM Page 1028
of the investigation is the right approach to on a model that is linear in the covariates
be able to estimate causal effects. Second, and features constant (marginal) effects.
they use a variety of state-of-the-art tech- Let Xi be a K 1 vector of (observable)
niques developed in the micro-economet- exogenous covariates and suppose that the
rics literature, in particular, matching structural relationship of interest is
estimators, propensity score methods, E(Yi|Xi) = Si + Xi , or:
instrumental variables estimation, and
parametric selection corrections. (1) Yi = Si + Xi + ui ,
Since the identification of causal effects
is the major objective of this book, and also where measures the effect of Si on the out-
because the econometric issues that arise come of interest, is a K 1 vector of coef-
here are becoming increasingly important ficients associated with X, and ui captures
in the political economy literature, in the the effect of all unobserved determinants as
next section I discuss PTs econometric well as random influences on Yi.5
methodology in detail. PT use four different empirical strategies
to estimate (1) or variants: ordinary least-
squares (OLS), matching and propensity
3. Econometric Methodology score estimators to relax the linearity
assumption in (1), instrumental variables
3.1 The Basic Framework (IV), and parametric selection corrections
due to Heckman.
PTs major interest is to identify the These strategies deal with the identifica-
causal effect of two political institutions, tion problem in (1) in two different ways,
presidentialism (versus parliamentary sys- which can be classified as selection on
tems) and majoritarian electoral rules (ver- observables and exclusion restrictions
sus proportional representation), on the (see Angrist and Krueger 1999).
amount and the composition of government 3.2 Selection on Observables
spending, and on politicians rents and
other political outcomes. Ultimately, part of The identifying assumption for strategy 1
the interest is also to estimate the effect of is that conditional on Xi, Si and ui are orthog-
these different constitutional forms on onal. This last condition implies that the
aggregate productivity and income per OLS estimator is consistent. To see this
capita levels. briefly, let variables with the tildes denote
Following PTs exposition in chapter 5, I the original variable after the effect of the
simplify the problem by focusing only on vector Xi has been partialed out.6 Then, (1)
one binary constitutional feature, denoted can be rewritten as:
by Si {0,1}, and a generic outcome vari-
able, Yi, with i denoting country in a cross- (2) Yi = S i + u i ,
sectional dataset.4 PT consider a relatively
general econometric framework, incorpo- which implies that
rating potentially heterogeneous effects.
For the purposes of this review, let us focus 5
It is worth noting that equation (1) can be estimated
without the constant marginal effects and linearity
assumptions to obtain average treatment effects with
weights depending on the estimation strategy, see Angrist
4
See, for example, Joshua D. Angrist and Guido W. and Krueger (1999).
Imbens (1995), Angrist, Imbens, and Donald B. Rubin 6 In words, these are the residuals from the original
(1996), Angrist and Alan B. Krueger (1999), James J variables linear projection on the vector Xi, given by
Heckman (1997), and Jeffrey M. Wooldridge (2002), for yt = (I X(XX)1X)y, where X denotes the K N matrix
more detailed treatments. obtained from the X is.
de05_Article3 11/16/05 3:59 PM Page 1029
instrument also removes the attenuation bias first-stage relationship (while the entire Xi is
arising from classical measurement error. included in the second stage). In practice, the
In all of this, the set of covariates included in procedure is the following: first, Si is predict-
Xi is important because one can imagine that Zi ed on the basis of X1i and Zi (using linear
could be correlated with the error term in the regression). Denote the predicted values by Si.
second stage, but this correlation disappears Then the parameters of interest are estimated
once we condition on a set of covariates (i.e., using an OLS regression with Si instead of Si.
we may have that while Cov(Zi,ui|X1i) 0, It can be shown that for this estimator, PT, to
Cov(Zi,ui|X2i) = 0 for some (not necessarily be consistent the Xis that are omitted from the
mutually exclusive) different sets of covariates first stage should have no predictive power for
X1i and X2i). Interestingly, this issue does not the endogenous regressors (conditional on the
arise when there is random assignment due to other covariates).8 Otherwise, the residuals
real experiments or because of natural experi- from the first stage, which mechanically go
ments. This makes the identification problem into the second stage, are no longer orthogo-
more difficult and subtle in political economy nal to the covariates, and this biases the esti-
models than in many other contexts, and mate of the vector , and the bias from
makes the distinction between the selection on carries over to the estimate of the parameter
observables and exclusion restriction strategies of interest, .9 Therefore, the estimator used
I tried to highlight above less clear-cut.
PT include income per capita, democracy
8
score, age of democracy, measures of trade, In particular, let the true (statistical) first-stage rela-
measures of age composition of the popula- tionship be SiZi(X1i) 1(X2i) 2i, with SiZi
(X1i) 1(X2i) 2 denoting the predicted component of Si.
tion, a dummy for federal structure and a The second-stage equation of the traditional 2SLS proce-
dummy for OECD in Xi in their main spec- dure is YiSi(X1i) 1(X2i) 2ui. The assumption
ifications. The instruments are various com- E(Ziui)0 ensures consistent estimation of (, ) with
the standard IV procedure. In contrast, the predicted
binations of the timing of adoption of the value from the PT procedure is SbibZi(X1i)b 1
countrys constitution, fraction of the popu- S i(b)Zi(X1i)(b 1b 1)(X2i) 2, where the second
lations that speak English or another equality substitutes for the definition of Sbi. So the second
stage in this case becomes: YiSi(X1i) 1 (X2i) 2ub i,
European language, and latitude. where ub iui((b)Zi(X1i)(b 1 1)(X2i) 2). Defining
I will discuss the validity of these instru- Wb i(Xi,Sbi), PTs estimator is
ments below. Before doing this, however, an
N 1
N
important estimation issue needs to be PT = N 1 Wi Wi N 1 Wi Yi
i =1
i =1
raised. The above derivation of the consis-
tency of the instrumental variable estimator
N 1
N
made it clear that the same set of covariates, = + N 1 Wi Wi N 1 Wi ui .
i =1
i =1
Xi, needs to be included both in the first and
For this estimator to be consistent, we need E(W b i ub i)0.
the second stage. This was made explicit by
However, when 20, ub i will be no longer be orthogonal to
the tilde notation above, where the regres- X2i, and this will imply that the entire vector of coefficients
sions include the variables after the vector Xi is not estimated consistently. In other words, we will typ-
ically have E(W b i ub i)0 and plim PT, even if E(Zi ui)0.
has been partialed out. Equations (5) and (6) 9
For example, Wooldridge (2002, p. 91) notes: In
show immediately why the IV estimator, practice, it is best to use a software package with a 2SLS
given the identification assumption rather than explicitly carry out the two-step procedure.
Cov(Zt i, ut i) = 0, is consistent in this case. Carrying out the two-step procedure explicitly makes one
susceptible to harmful mistakes. For example, the follow-
PT, instead, choose a nonstandard econo- ing seemingly sensible, two-step procedure is generally
metric method, and exclude some of the Xis inconsistent: (1) regress xK on 1, z1,,zM [the set of exclud-
from the first stage. In particular, suppose that ed instruments for xK] and obtain the fitted values, say xtK ;
(2) run the regression [of the dependent variable on the
the vector of covariates is partitioned as other covariates and on xtK]... This example is equivalent to
Xi = (X1i,X2i), and only X1i is included in the the procedure that PT implement.
de05_Article3 11/16/05 3:59 PM Page 1032
same factors that make policies unappealing elite prefer presidential systems to parlia-
variables to treat as exogenous. mentary systems. In this case, when the elite
This point has featured in one way or are more powerful, the equilibrium form of
another in the comparative politics litera- government is more likely to be presidential.
ture. It is emphasized in its most general If, in addition, the strength of the elite is
form by the Charles A. Beards classic 1913 persistent, there will be an empirical corre-
study, An Economic Interpretation of the lation between presidentialism and smaller
U.S. Constitution, where he argues that the governments, not only at the time the con-
primary objective of the government and the stitution is written, but also in the subse-
constitution is to ensure favorable economic quent decades. But this correlation would
conditions for those holding political power, not necessarily reflect the causal effect of
and the form of government should be seen presidentialism on equilibrium policies.11 In
as a (secondary) feature serving the same terms of the econometric discussion above,
objectiveby secondary Beard does not the requirement that Cov(St i, ut i) = 0 in equa-
mean less important, but chosen in the same tion (2), is violated because there is an omit-
way and for the same objectives as the over- ted factor, like Lt i in equation (4), in this case
all structure of political power. He writes: the political strength of the elite, which will
(1913, p. 13) affect both the form of government and the
Inasmuch as the primary object of a govern- outcomes of interest. The same concerns
ment, beyond the mere repression of physical generalize to the other potential outcome
violence, is the making of the rules that deter- variables.
mine the property relations of society, the In this light, the OLS (non-IV) estimates
dominant classes whose rights are thus to be
determined must perforce obtain from the of the causal effects of the form of govern-
government such rules as are consonant with ment and electoral systems should not be
the larger interests necessary to the continu- interpreted as causal. Instead, they likely
ance of their economic processes, or they correspond to interesting robust correlations
must themselves control the organs of gov- in the data. For causal estimates, we have to
ernment. In a stable despotism the former
takes place; under any other system of gov- rely on IV approaches.
ernment, where political power is shared by 5.2 The Instrumental Variables Approach in
any portion of the population, the methods
and nature of this control become the prob- Practice
lem of prime importancein fact, the funda- PT attempt to estimate the causal effects
mental problem in constitutional law. The
social structure by which one type of legisla- of the form of government and electoral sys-
tion is secured and another preventedis a tems by using a simple IV approach. Their
secondary or derivative feature arising from potential instruments are three constitution-
the nature of the economic groups seeking al dating variables, indicating whether a
positive action and negative restraint.
If Beard is correct in his assessment, then 11
A similar argument, though with the opposite impli-
many features of constitutions will be influ- cation, is suggested by another major figure in comparative
enced by factors that also have a direct political science, Stein Rokkan. Rokkan argued in his 1970
book that the emergence of proportional representation in
impact on policy and economic outcomes. Continental Europe was a result of the previous elites
To illustrate this possibility, imagine a world attempts to manage the transition to democracy in a man-
consisting of a group of politically powerful ner that was consistent with their interests. He suggests
that a key objective of the proportional representation sys-
elite and citizens. The elite oppose redistrib- tem was to make the emergence of a socialist majority
ution, while the citizens favor it. When the more difficult. If Rokkan is correct, then because propor-
elite are more powerful, they will be able to tional representation emerged in societies where the elites
were more powerful, we may expect a negative correlation
limit the amount of redistribution and the between proportional representation and redistribution, as
size of government. Imagine also that the opposed to the positive correlation documented by PT.
de05_Article3 11/16/05 3:59 PM Page 1035
country adopted its current form of govern- countries fell into different waves depend-
ment and current electoral rule after 1981, ing on when they adopted their constitution
between 195180, between 192150, with or declared their independence. The two
before 1921 as the omitted category; two lan- language variables and latitude are included
guage variables, indicating the fraction of the as proxies for European influence follow-
population in the country speaking one of ing the arguments in Hall and Charles I.
the major European languages, and the frac- Jones (1999). Hall and Jones use these vari-
tion speaking English as a native language; ables as instruments for the overall quality
and latitude (distance from the equator). of institutions (social infrastructure), and
Although the IV approach adopted by PT PT argue that these could also have affect-
is important on methodological grounds, I ed the form of government and electoral
will argue that it is unlikely to be estimating rule. The reasoning for these HallJones
the causal effect of the form of government instruments is discussed in the next section.
and electoral rules on policy and economic These potential instruments are highly
outcomes for two reasons: first, because the correlated with the form of democratic insti-
three constitutional timing variables have tutions. Typically, countries farther away
little effect on the endogenous regressors, from the equator and those with a greater
identification ultimately relies on the frac- fraction of the population speaking English
tion of the population with European lan- as a native language (but not those with other
guage as native tongue and latitude, which European languages) are less likely to have
are not convincing instruments for institu- presidential systems. For majoritarian sys-
tional features in general; second, even if tems, the effects of English and other
they were valid for some general institu- European languages are reversed. There is
tional features, by implication they could some effect from the constitutional dating
not be valid instruments for specific institu- variables, but these are typically weaker, and
tional features such as the form of govern- less consistent (depending on what covariates
ment or electoral rules. 12 In this are included in the first and second stages).
subsection, I show the importance of the The general pattern suggests that with
language and latitude instruments for iden- only the constitutional variables, there is no
tification, and also briefly look at the impli- first stage for the instrumental variable strat-
cations of not including second stage egy.13 This is illustrated in table 1 where I use
covariates in the first stage. PTs data to replicate and further investigate
Let us begin with the reasoning for these their first-stage relationships. Columns 1a and
instruments. The three timing variables 1b are identical to the first two columns of
could be relevant for the form of govern- PTs table 5.1, and separately estimate OLS
ment and for the electoral system because first-stage relationships with dummy variables
there may have been waves (or fads) in for presidential and majoritarian systems on
the type of constitutions, and different the right-hand side (respectively PRES and
MAJ). Both columns include seven variables;
three dummies for the dates of the adoption
12
Another potential problem is that, as discussed in of the constitution, latitude, fraction of the
section 3, a successful IV strategy also requires E(X iui)0, population speaking English and fraction of
i.e., the covariates that are treated as exogenous to be the population speaking another European
orthogonal to the error term. These covariates here
include income per capita and trade, which are jointly
determined with the size and composition of government
spending, and their inclusion in the equation may lead to
13
biased estimates. Nevertheless, reestimating PTs IV mod- Lack of a first-stage relationship even at the conven-
els excluding income per capita and trade leads to results tional significance levels is an extreme form of the weak
similar to their baseline IV estimates. instruments problem mentioned in footnote 7.
de05_Article3 11/16/05 3:59 PM Page 1036
language as native languages, and age of are essential in the first-stage relationships
democracy (the latter is included as a covari- reported by PT. Columns 3a,b and 4a,b show
ate in the second stage equations as well, that they are, and that once these variables
while the others are excluded instruments). are left out, there is a very weak or no first-
They also report F-tests for joint signifi- stage relationship left. Without the second-
cance of all the excluded instruments and stage covariates, the constitution dummies
the constitutional variables at the bottom. are significant at 3 percent and 5 percent
These two columns show a reasonably respectively in columns 3a and 3b, and when
strong first-stage relationships, which are the second-stage covariates are included,
then used by PT in their 2SLS estimation they are highly insignificant.
(the R2 is 0.48 in the first column and 0.40 This implies that an instrumental-vari-
in the second column). However, we can ables strategy relying only on the constitu-
see that the major determinants of the tional timing dummies would not achieve
endogenous regressors are the HallJones identification.
instruments, not the constitution timing Table 2 shows the corresponding 2SLS
dummies. In both columns 1a and 1b, the estimates of the form of government and the
six instruments are jointly significant at less electoral system on the size of government.
than 1 percent, but the three constitutional Column 1 replicates PTs results (the corre-
dummies are only significant at 2 percent in sponding first stages are in columns 1a,b of
column 1a (for the presidential dummy) table 1). Column 2 shows that the estimates
and very far from statistically significant in are slightly less precise when the second-
column 1b (for the majoritarian dummy). stage covariates are included in the first stage,
Columns 2a and 2b show that the joint sig- though the dummy for a presidential system
nificance of the excluded instruments, and continues to be statistically significant. This
especially of the constitutional variables, is shows that leaving the second-stage covari-
significantly reduced when the second-stage ates out of the first stage is unlikely to have
covariates are also included in the first stage. caused a major bias in this case.
Recall that these second-stage covariates are Columns 3 and 4 exclude the HallJones
log income per capita, the proportions of the instruments and show that there is no longer
population between the ages of fifteen and a significant effect of the form of govern-
sixty-four, and over the age of sixty-five, the ment or the electoral system on the size of
sum of exports plus imports divided by GDP, government. For example, the coefficient on
the average of political and civil liberties the presidential dummy in column 3 is 2.01
indices of Freedom House, and dummies for (standard error = 5.11) as compared to 8.65
a federal structure and OECD (see PT, espe- (standard error = 3.61) in column 1.
cially table 6.2 and the data appendix for Evidently, the HallJones instruments are
details). As discussed above, PT include essential for PTs instrumentation strategy.
these covariates in the second stage, but not Table 3 shows the 2SLS results for the
in the first-stage relationship. Once they are other major outcome variable that PT look at
also included in the first stage, the constitu- in their chapter 6, the composition of spend-
tional dummies are no longer jointly signifi- ing, proxied by the share of social spending
cant for either presidential or majoritarian and welfare in GDP. The structure of this
dummies. table is identical to that of table 2. It also
I will argue in greater detail in sections 6 shows that the IV estimates of the effect of
and 7 that latitude and European languages constitutional features on policy outcomes
may not be valid instruments for the form of are sensitive to excluding the HallJones
government and electoral rules. In this light, instruments. Furthermore, in this case,
it is important to understand whether they including the second-stage covariates in
TABLE 1
CONSTITUTION SELECTION: FIRST STAGE ESTIMATES
TABLE 2
SIZE OF GOVERNMENT AND CONSTITUTIONS: SECOND STAGE INSTRUMENTAL VARIABLE ESTIMATES
the first stage also matters to some extent, touches upon the importance of including
and the effect of a majoritarian electoral second-stage covariates in the first stage of
system is no longer significant at 5 percent the IV estimation). The implication is that
in column 2. the IV strategy in PTs book heavily relies
This brief discussion illustrates the on the validity of the HallJones instru-
importance of the HallJones instruments ments. Could these instruments be valid in
for PTs identification strategy,14 (and also the current context?
I believe there are two fundamental rea-
sons for these instruments not to be valid.
14
PT are upfront about this issue, and argue that The first is that the HallJones instruments
Admittedly, these [HallJones] variables could be corre- are unlikely to be valid for the overall qual-
lated with other unobserved historical determinants of fis-
cal policy or corruption . . . [As] we are confident about the ity of institutions. The second is that even
exogeneity of the time dummies for constitutional adop- if these instruments were valid for the
tion, we can test the validity of the additional instruments overall quality of institutions, they would
by exploiting the overidentifying restrictions (p. 131). In
addition, the parametric selection models that they report by implication be invalid for a specific fea-
also yield very similar estimates to the linear IV estimates. ture of the institutional structure such as
Nevertheless, these checks are unlikely to be sufficient to presidentialism or a majoritarian electoral
validate the IV estimates. The overidentifying restrictions
have very little power, since, as we saw above, the consti- system. I discuss these two issues in the
tutional timing variables have little explanatory power in next two sections. This discussion is not
the first stage. The parametric selection models, on the only relevant to PTs work, but also to a
other hand, also use the same exclusion restrictions in
addition to the parametric restrictions, so the similarity in number of recent papers adopting similar
the results is not too surprising. strategies.
de05_Article3 11/16/05 3:59 PM Page 1039
TABLE 3
COMPOSITION OF GOVERNMENT SPENDING AND CONSTITUTIONS:
SECOND STAGE INSTRUMENTAL VARIABLE ESTIMATES
6. Are the HallJones Variables Valid This makes it useful to reconsider the the-
Instruments for Institutions? oretical justification of these instruments.
What is the theory underlying the
Hall and Jones wrote a very important
HallJones instruments? In broad strokes,
and influential article emphasizing the
Hall and Joness theory is as follows. Good
importance of social infrastructure, or insti-
institutions originated in Western Europe
tutions on aggregate productivity and eco-
and spread from there to other countries.
nomic growth. Hall and Jones suggested
Therefore, a potential determinant of the
that the origins of good institutions around
quality of institutions is the extent to which a
the world lie in Europe, so proximity to
country has been influenced by Europes
Europe, as measured by the fraction of
culture, values and institutions. Hall and
those speaking European languages and lat-
Jones isolate two channels of European
itude, can be used as instruments for the
influence. The first is through a shared lan-
quality of institutions.
guage, and the second is through geography.
Hall and Joness paper has already
Consequently, they argue, countries with a
become a classic. It is not only cited exten-
greater fraction of the population speaking
sively, but their arguments are often invoked
European languages and those farther from
to use latitude and the fraction of the popu-
the equator, which were less densely popu-
lation speaking European languages as
lated and geographically more similar to
native tongues as instrument for various
Europe hence more conducive to European
institutional features in the recent empirical
migration, have benefited from a
literature.
de05_Article3 11/16/05 3:59 PM Page 1040
entire world, and consequently, there are and wealth cannot easily survive with a set of
good reasons to suspect that they may not be political institutions that distribute political
excludable from the regressions of interest. power equally. Those with political power
would be greatly tempted to use their power
to redistribute income and change the eco-
7. Cluster of Institutions Versus Specific
nomic institutions in line with their interests.
Institutions
Economic institutions that lead to a very
The above discussion centered around the unequal distribution of income and wealth
question of whether the HallJones variables are only consistent with a similarly unequal
are valid instruments for the overall quality distribution of political power, i.e., with dic-
of institutions. I now turn to a discussion of tatorships and other repressive regimes. In
the problems involved in using these vari- this case, sources of variation that affect a
ables as instruments for specific institutions. broad cluster of institutions (e.g., economic
This has become a common practice in the and political institutions together) would not
newly flourishing empirical political econo- be useful in identifying the role of specific
my literature. PT also follow this practice institutional features.
and use these variables as instruments for As an example of the difficulty of this type
the form of government and electoral rules of strategy to estimate the effect of specific
(indicators of presidential or majoritarian institutions, consider the quasi-natural
systems). I will argue that there are serious experiment due to international politics, the
problems in this procedure because of inher- division of Korea into North and South. The
ent complementarities between different two parts of Korea before the division were
types of institutions. ethnically, culturally, economically, and
To develop this argument, let us put aside socially very similar. But because of the
the concerns raised in the previous section geopolitical balance between the United
and suppose that the HallJones instruments States and Soviet Union, the South ended up
(or perhaps settler mortality rates in the for- largely capitalist, while the North became
mer European colonies sample) are valid for communist. In the following forty years, we
the overall quality of institutions (or the witnessed a large divergence between these
broad cluster of institutions). This means we two countries. This is a good source of varia-
now suppose that they are excludable from a tion to understand the effect of the broad
second-stage regression of economic out- cluster of institutions at the level of quasi-
comes (such as aggregate output or produc- capitalist versus communist systems.
tivity) with the broad measure of institutions Suppose now that we try to use this source of
as the endogenous regressor. variation to understand the effect of some
Here the distinction between a broad specific institutional feature, say financial
cluster of institutions and specific institu- development, on economic growth. It
tions is crucial. In AJR (2001), we defined a should be clear that this strategy will lead to
broad cluster of institutions as a combination a highly biased estimate. It is true that South
of economic, political, social and legal insti- Korea is financially more developed than
tutions that are mutually reinforcing. For North Korea. It is also true that the reason
example, it is impossible to think of a system for this is the division in 1946 (had it not
like the plantation economies in the been for the division, the North and the
Caribbean Islands until the nineteenth cen- South would probably have similar levels of
tury together with democratic political insti- financial development). But this does not
tutions. This is because a set of economic make the division a good experiment to
institutions, like the plantation system, that understand the effect of financial develop-
lead to a very unequal distribution of income ment, because this division also caused many
de05_Article3 11/16/05 3:59 PM Page 1042
other institutional changes. It is a good labo- Next, suppose that we have an instrument
ratory for the study of broad institutions, but Zi, in particular a country-specific variable,
not for a study of the specific institutions. which is potentially related to each of the
Another example directly related to PTs specific institutions, i.e.,
empirical work may also be useful. In the Sik = k Z i + ik ,
(9)
first-stage relationships shown in table 1, the
fraction of the population speaking English is and is a valid instrument for the cluster of
a strong predictor of a majoritarian system, institutions, Gi, i.e., Cov(Zi,ui) = 0. So if we
and this fraction may be a predetermined have a good measure of Gi, equation (7) can
variable, shaped, for example, by colonial his- be estimated consistently by IV.
tory. Nevertheless, this is not sufficient for it However, the objective here is to estimate
to be an excludable instrument in estimating the effect of specific institutions. To simplify
the causal effect of majoritarian systems, the discussion, also assume that Cov(Zi,i)
even if as Hall and Jones claim, English influ- = 0, and Cov(
i,vki) = Cov(Zi,vki) = 0 for all k
ence is conducive to the development of and Cov(vki,v ji) = 0 for all j k. Note that S kis
good institutions overall. For example, as we are correlated even if vkis are independent,
know from Rafael La Porta, Florencio since they are all affected by Zi. This corre-
Lopez-de-Silanes, Andrei Shleifer, and lation is at the root of the identification
Robert W. Vishny (1998), countries more problems facing IV estimates of the impact
influenced by the English heritage also have of specific institutions.
more developed financial markets. So the In the last section, I questioned whether
correlation between majoritarian electoral latitude could play the role of Zi to estimate
systems (instrumented by fraction of the the structural relationship in (7). Now let us
population speaking English) and the size of put aside the issues raised there, and sup-
government may reflect the effect of finan- pose that indeed Cov(Zi,ui) = 0. Can we then
cial development, which is omitted from the use Zi as an instrument for a specific institu-
regressions. tion, say S1? The answer is no. If we were to
To make these issues a little more precise, do this, all of the Sks would also load onto S1.
consider the following structural model: More explicitly, we are now estimating
Yi = 1S1i + Xi 1 + u1i
(7) Yi = Gi + Xi + ui ,
with IV using Zi as the excluded instru-
where Yi denotes the outcome of interest, ment. Given (7) above, the true value of 1
is again a vector of coefficients associated is 1 = 1. Moreover, from equations (8)
with X, which is now assumed to stand for a and (9),
set of noninstitutional covariates. Gi is a
K
measure of a broad cluster of institutions.
(10) u1i = ui + kSik + i .
Moreover, suppose that there are K > 1 specif- k= 2
ic institutions, each denoted by Sk, and
We can now see that since, by construction,
K
Kk2 kSki is correlated with Zt i, plim1,IV
(8) Gi = S + i
k k
k =1
i 1,IV. In particular
in the Dominican Republic, the Duvaliers in owing to the absence of strong institutions,
Haiti, the Somozas in Nicaragua, Charles rulers can deploy strategies, in particular
Taylor in Liberia, and Ferdinand Marcos in divide-and-rule, to defuse opposition to
the Philippines. their regime. The logic of the divide-and-rule
Much historical evidence suggests that a strategy is to enable a ruler to bribe political-
systematic study of the political economy of ly pivotal groups off the equilibrium path,
such regimes must depart from some of the ensuring that he can remain in power against
modeling approaches of politics in strongly challenges. By providing selective incentives
institutionalized polities. While in strongly and punishments, the divide-and-rule strate-
institutionalized polities, formal political insti- gy exploits the fragility of social cooperation
tutions, such as the constitution, the structure in weakly-institutionalized polities: when
of the legislature, or electoral rules, place faced with the threat of being ousted, the
constraints on the behavior of politicians and kleptocratic ruler intensifies the collective
political elites, and directly influence political action problem and destroys the coalition
outcomes, the same does not appear to be the against him by bribing the pivotal groups.
case in weakly institutionalized polities. A different and innovative answer is given
Instead, the nature of politics appears to be by Gerard Padro-i-Miquel (2004), who also
different between strongly and weakly institu- constructs a dynamic model of politics.
tionalized polities. Most importantly, when Ethnic divisions are the key feature in his
institutions are strong, citizens have the model. Each ethnic group is afraid of replac-
power to punish politicians by voting them ing their own leader when in power, because
out of power; when institutions are weak, this increases the probability of a switch of
politicians pursue clientelistic policies that power from their own ethnic group to a rival
punish citizens who fail to support them (see group. This makes the standard method of
Robert H. Jackson and Carl G. Rosberg 1982 controlling political elites in strongly-institu-
and Acemoglu, Robinson, and Thierry tionalized polities ineffective, and enables
Verdier 2004). leaders to not only exploit other ethnic
In modeling politics in weakly institutional- groups but also their own ethnic group.
ized polities, the first question that emerges is Padro-i-Miquel shows how this framework
theoretical: how can we understand policy can account for a puzzling feature of African
making and collective decisions in such soci- politics first highlighted in Robert H. Bates
eties? There are only a few papers that classic study Markets and States in Tropical
attempt to develop answers this question. In Africa (1981): the simultaneous use of ineffi-
Acemoglu, Robinson, and Verdier (2004), we cient transfers to and taxes on the same
construct a dynamic model of politics in group.16 In the logic of Padro-i-Miquels
weakly institutionalized polities based on the model, this strategy makes sense because
idea of divide and rule. The key focus is to leaders need to keep their own group happy
understand how kleptocratic regimes that to remain in power. This sets a limit on the
impoverish their citizens can remain in power amount of net taxes they can impose on their
for so long.15 The answer we suggest is that,
16
Bates described the web of inefficient transfers and
15
Models of strongly institutionalized polities would policies in effect in many parts of Africa, but most notably
suggest that poorly performing leaders should be replaced in Ghana and Zambia. For example, the Ghanaian govern-
more often. In contrast, many disastrous kleptocracies last ment heavily taxed cocoa producers, while at the same
for long periods; Mobutu ruled for thirty-two years, time subsidizing their inputs of seeds and fertilizers. While
Trujillo for thirty-one, and the Somozas for forty-two the simultaneous use of taxes and subsidies may be to alle-
years. This longevity is made even more surprising by the viate the negative effects of taxes on investments, the
fact that many kleptocratic regimes lack both a core interpretation in the literature is that the extent of these
constituency of supporters and a firm command of the practices in sub-Saharan Africa is beyond what can be jus-
military. tified as an optimal mix of taxes and subsidies.
de05_Article3 11/16/05 3:59 PM Page 1047
own group. However, taxes across groups are and those documented by PT appear to be
linked, since much lower taxes on one group highly robust and of central importance for
will encourage other groups to switch eco- our theoretical understanding. Few political
nomic activity. This then motivates leaders to economy papers can be written from now
inefficiently subsidize their own ethnic group on that do not take this book, both its
so that they can increase the tax rate on their methodological and empirical contributions,
own ethnic group, and consequently tax seriously.
other groups more intensively. Equally important, with the addition of
A second crucial area is to construct this book, (comparative) political economy
models to understand how weak institu- has taken one more step toward establishing
tions can be strengthened. Although there itself as a major field of economics, and it
is now a number of formal models of the offers exciting and important research areas
creation and consolidation of democracy for further inquiry.
(see the literature review in Acemoglu and REFERENCES
Robinson 2005), many of these models are Acemoglu, Daron, and Joshua D. Angrist. 2000. How
still motivated by the practice of strongly Large Are Human-Capital Externalities? Evidence
from Compulsory Schooling Laws, in NBER
institutionalized polities. Macroeconomics Annual 2000. Ben S. Bernanke and
Finally, there is almost no work on how Kenneth Rogoff, eds. Cambridge: MIT Press, 959.
changes in formal political institutions affect Acemoglu, Daron, and Simon Johnson. 2003.
Unbundling Institutions. NBER Working Paper
policy and economic outcomes in weakly 9934.
institutionalized polities. These are all fruitful Acemoglu, Daron, Simon Johnson, and James A.
areas for future research. Robinson. 2001. The Colonial Origins of
Comparative Development: An Empirical
11. Conclusion Investigation. American Economic Review, 91(5):
13691401.
This book is an important landmark in Acemoglu, Daron, Simon Johnson, and James A.
Robinson. 2002. Reversal of Fortune: Geography
political economy. It takes a body of theoreti- and Institutions in the Making of the Modern World
cal literature seriously and carefully confronts Income Distribution. Quarterly Journal of
their predictions with data. PT set themselves Economics, 117(4): 123194.
Acemoglu, Daron, and James A. Robinson. 2005.
the very ambitious goal of estimating the Economic Origins of Dictatorship and Democracy.
causal effects of constitutions, in particular, Cambridge: Cambridge University Press.
the form of government and electoral rules. Acemoglu, Daron, James A. Robinson, and Thierry
Verdier. 2004. Kleptocracy and Divide-and-Rule: A
Bearing in mind the difficulties of estimat- Model of Personal Rule. Journal of the European
ing causal effects in social sciences, especial- Economic Association, 2(2): 16292.
ly in cross-country data, the book is a Angrist, Joshua D., and Guido W. Imbens. 1995. Two-
Stage Least Squares Estimation of Average Causal
tremendous success. It documents impor- Effects in Models with Variable Treatment
tant and robust correlations between the Intensity. Journal of the American Statistical
form of government and electoral systems, Association, 90(430): 43142.
Angrist, Joshua D., Guido W. Imbens, and Donald B.
on the one hand, and various policy and eco- Rubin. 1996. Identification of Causal Effects Using
nomic outcomes, on the other. It also uses Instrumental Variables. Journal of the American
state-of-the-art econometric techniques to Statistical Association, 91(434): 44455.
Angrist, Joshua D., and Alan B. Krueger. 1991. Does
estimate causal effects. Compulsory School Attendance Affect Schooling
I argued above that these methods may and Earnings? Quarterly Journal of Economics,
not have been sufficient to arrive at causal 106(4): 9791014.
Angrist, Joshua D., and Alan B. Krueger. 1999.
effects, both because of certain conceptual Empirical Strategies in Labor Economics, in
and practical problems. Nevertheless, the Handbook of Labor Economics Volume 3A. Orley
achievement here should not be underesti- Ashenfelter and David Card, eds. Amsterdam:
North-Holland, 12771366.
mated. Even noncausal robust relationships Arrow, Kenneth J. 1951. Social Choice and Individual
are rare in comparative political economy, Values. New York: Wiley.
de05_Article3 11/16/05 3:59 PM Page 1048
Austen-Smith, David, and Jeffrey S. Banks. 1999. Holmes, Thomas J. 1998. The Effect of State Policies
Positive Political Theory I: Collective Preferences. on the Location of Manufacturing: Evidence from
Ann Arbor: University of Michigan Press. State Borders. Journal of Political Economy, 106(4):
Bates, Robert H. 1981. Markets and States in Tropical 667705.
Africa. Berkeley: University of California Press. Hotelling, Harold. 1929. Stability in Competition.
Barro, Robert. 1997. Determinants of Economic Economic Journal, 39: 4157.
Growth. Cambridge: MIT Press. Jackson, Robert H., and Carl G. Rosberg. 1982.
Bertrand, Marianne, and Francis Kramarz. 2002. Personal Rule in Black Africa. Berkeley: University
Does Entry Regulation Hinder Job Creation? of California Press.
Evidence from the French Retail Industry. La Porta, Rafael, Florencio Lopez-de-Silanes, Andrei
Quarterly Journal of Economics, 117(4): 13691413. Shleifer, and Robert W. Vishny. 1998. Law and
Beard, Charles A. 1913. An Economic Interpretation of Finance. Journal of Political Economy, 106(6):
the Constitution of the United States. New York: The 111355.
Free Press. Levine, Ross. 1997. Financial Development and
Besley, Timothy, and Robin Burgess. 2004. Can Labor Economic Growth: Views and Agenda. Journal of
Regulation Hinder Economic Performance? Economic Literature, 35(2): 688726.
Evidence from India. Quarterly Journal of Levine, Ross. Forthcoming. Finance and Growth:
Economics, 119(1): 91134. Theory, Evidence, and Mechanisms, in Handbook
Black, Duncan. 1948. On the Rationale of Group of Economic Growth. Philippe Aghion and Steven
Decision-Making. Journal of Political Economy, Durlauf, eds. Amsterdam: Elsevier.
56(1): 2334. Lindbeck, Assar, and Jrgen Weibull. 1987. Balanced-
Bourguignon, Franois, and Thierry Verdier. 2000. Budget Redistribution as the Outcome of Political
Oligarchy, Democracy, Inequality and Growth. Competition. Public Choice, 52(3): 27297.
Journal of Development Economics, 62(2): 285313. Padro-i-Miquel, Gerard. 2004. The Control of
Dixit, Avinash, Gene M. Grossman, and Faruk Gul. Politicians in Divided Societies: Politics of Fear.
2000. The Dynamics of Political Compromise. MIT. Mimeo.
Journal of Political Economy, 108(3): 53168. Persson, Torsten, and Guido Tabellini. 2000. Political
Downs, Anthony. 1957. An Economic Theory of Economics: Explaining Economic Policy. Cambridge:
Democracy. New York: Harper and Row. MIT Press.
Engerman, Stanley L., and Kenneth L. Sokoloff. 1997. Persson, Torsten, and Guido Tabellini. 2003. The
Factor Endowments, Institutions, and Differential Economic Effects of Constitutions: What do the Data
Paths of Growth among New World Economies: A Say? Cambridge: MIT Press.
View from Economic Historians of the United Persson, Torsten, Gerard Roland, and Guido Tabellini.
States, in How Latin America Fell Behind: Essays 1997. Separation of Powers and Political
on the Economic Histories of Brazil and Mexico, Accountability. Quarterly Journal of Economics,
1800-1914. Stephen Haber, ed. Stanford: Stanford 112(4): 11631202.
University Press, 260304. Persson, Torsten, Gerard Roland, and Guido Tabellini.
Ferejohn, John. 1986. Incumbent Performance and 2000. Comparative Politics and Public Finance.
Electoral Control. Public Choice, 50(13): 525. Journal of Political Economy, 108(6): 112161.
Hall, Robert E. 1978. Stochastic Implications of the Rokkan, Stein. 1970. Citizens, Elections, Parties:
Life Cycle-Permanent Income Hypothesis: Theory Approaches to the Comparative Study of the
and Evidence. Journal of Political Economy, 86(6): Processes of Development. New York: McKay.
97187. Staiger, Douglas, and James H. Stock. 1997.
Hall, Robert E., and Charles I. Jones. 1999. Why Do Instrumental Variables Regression with Weak
Some Countries Produce So Much More Output Per Instruments. Econometrica, 65(3): 55786.
Worker Than Others? Quarterly Journal of Stigler, George J. 1970. Directors Law of Public
Economics, 114(1): 83116. Income Redistribution. Journal of Law and
Hansen, Christian, Jerry Hausman, and Whitney Economics, 13(1): 110.
Newey. 2004. Weak Instruments, Many Instruments Stigler, George J. 1972. Economic Competition and
and Microeconomic Practice. MIT. Mimeo. Political Competition. Public Choice, 13: 91106.
Hansen, Lars Peter, and Kenneth J. Singleton. 1982. Stock, James H., Jonathan H. Wright, and Motohiro
Generalized Instrumental Variables Estimation Yogo. 2002. A Survey of Weak Instruments and
of Nonlinear Rational Expectations Models. Weak Identification in Generalized Method of
Econometrica, 50(5): 126986. Moments. Journal of Business and Economic
Heckman, James J. 1976. The Common Structure of Statistics, 20(4): 51829.
Statistical Models of Truncation, Sample Selection Svensson, Jakob. 2003. Who Must Pay Bribes and How
and Limited Dependent Variables and a Simple Much? Evidence from a Cross Section of Firms.
Estimator for Such Models. Annals of Economic Quarterly Journal of Economics, 118(1): 20730.
and Social Measurement, 5(4): 47592. Wooldridge, Jeffrey M. 2002. Econometric Analysis of
Heckman, James J. 1979. Sample Selection Bias as a Cross Section and Panel Data. Cambridge and
Specification Error. Econometrica, 47(1): 15361. London: MIT Press.