The Building Blocks of Economic Complexity

The building blocks of economic complexity
Cesar A. Hidalgo1 and Ricardo Hausmann

aCenter
for International Development and Harvard Kennedy School, Harvard University, Cambridge, MA 02138
Edited by Partha Sarathi Dasgupta, University of Cambridge, Cambridge, United Kingdom, and approved May 1, 2009 (received for review January 28, 2009)
For Adam Smith, wealth was related to the division of labor. As people and rms specialize in different activities, economic efciency increases, suggesting that development is associated with an increase in the number of individual activities and with the complexity that emerges from the interactions between them. Here we develop a view of economic growth and development that gives a central role to the complexity of a countrys economy by interpreting trade data as a bipartite network in which countries are connected to the products they export, and show that it is possible to quantify the complexity of a countrys economy by characterizing the structure of this network. Furthermore, we show that the measures of complexity we derive are correlated with a countrys level of income, and that deviations from this relationship are predictive of future growth. This suggests that countries tend to converge to the level of income dictated by the complexity of their productive structures, indicating that development efforts should focus on generating the conditions that would allow complexity to emerge to generate sustained growth and prosperity.
economic development networks
or Adam Smith, the secret to the wealth of nations was related to the division of labor. As people and firms specialize in different activities, economic efficiency increases. This division of labor, however, is limited by the extent of the market: The bigger the market, the more its participants can specialize and the deeper the division of labor that can be achieved. This suggests that wealth and development are related to the complexity that emerges from the interactions between the increasing number of individual activities that conform an economy (13). Now, if all countries are connected to each other through a global market for inputs and outputs so that they can exploit a division of labor at the global scale, why have differences in Gross Domestic Product (GDP) per capita exploded over the past 2 centuries? (4, 5, *) One possible answer is that some of the individual activities that arise from the division of labor described above cannot be imported, such as property rights, regulation, infrastructure, specific labor skills, etc., and so countries need to have them locally available to produce. Hence, the productivity of a country resides in the diversity of its available nontradable capabilities, and therefore, cross-country differences in income can be explained by differences in economic complexity, as measured by the diversity of capabilities present in a country and their interactions. During the last 20 years, models of economic growth have often included the assumption that the variety of inputs that go into the production of the goods produced by a country affects that countrys overall productivity (3, 6). There have been very few attempts, however, to bring this intuition to the data. In fact, the most frequently cited surveys of the empirical literature do not incorporate a single reference to any measure of diversity of inputs or complexity (7). We can create indirect measures of the capabilities available in a country by thinking of each capability as a building block or Lego piece. In this analogy, a product is equivalent to a Lego model, and a country is equivalent to a bucket of Legos. Countries will be able to make products for which they have all of the necessary capabilities, just like a child is able to produce a Lego model if the childs bucket contains all of the necessary Lego pieces. Using this analogy,
10570 10575 PNAS June 30, 2009 vol. 106 no. 26
the question of economic complexity is equivalent to asking whether we can infer properties such as the diversity and exclusivity of the Lego pieces inside a childs bucket by looking only at the models that a group of children, each with a different bucket of Legos, can make. Here we show that this is possible if we interpret data connecting countries to the products they export as a bipartite network and assume that this network is the result of a larger, tripartite network, connecting countries to the capabilities they have and products to the capabilities they require (Fig. 1A). Hence, connections between countries and products signal the availability of capabilities in a country just like the creation of a model by a child signals the availability of a specific set of Lego pieces. Note that this interpretation says nothing of the processes whereby countries accumulate capabilities and the characteristics of an economy that might affect them. It just attempts to develop measures of the complexity of a countrys economy at a point in time. However, the approach presented here can be seen as a building block of a theory that accounts for the process by which countries accumulate capabilities. A detailed analysis of capability accumulation is beyond the scope of this article but the implications of our approach will be discussed briefly in Discussion. In this article we develop a method to characterize the structure of bipartite networks, which we call the Method of Reflections, and apply it to trade data to illustrate how it can be used to extract relevant information about the availability of capabilities in a country. We interpret the variables produced by the Method of Reflections as indicators of economic complexity and show that the complexity of a countrys economy is correlated with income and that deviations from this relationship are predictive of future growth, suggesting that countries tend to approach the level of income associated with the capability set available in them. We validate our measures of the capabilities available in a country by introducing a model and by showing empirically that our metrics are strongly correlated with the diversity of the labor inputs used in the production of a countrys goods, approximated by using data on the use of labor inputs in the United States. Finally, we show that the level of complexity of a countrys economy predicts the types of products that countries will be able to develop in the future, suggesting that the new products that a country develops depend substantially on the capabilities already available in that country. Methods We look at country product associations by using international trade data with products disaggregated according to 3 alternative data sources and classifications: First, the Standard International Trade Classification (SITC) revision 4 at the 4-digit level (see ref. 8; the data are available at www.nber.org/data, http://cid.econ. udavis.edu/data/undata/undata.html, and www.chidalgo.com/
Author contributions: C.A.H. and R.H. designed research, performed research, contributed new reagents/analytic tools, analyzed data, and wrote the paper. The authors declare no conict of interest. This article is a PNAS Direct Submission.
1To
whom correspondence should be addressed. E-mail: cesar hidalgo@ksg.harvard.edu.
*In ref. 4, Maddison presents GDP per capita measures for 60 countries since 1820. In that year, the ratio of the 95th to the 5th percentile was 3.18 but it increased to 17.82 by the year 2000. Today, the U.S. GDP per capita is 60 times higher than Malawis. This article contains supporting information online at www.pnas.org/cgi/content/full/ 0900943106/DCSupplemental.
www.pnas.org cgi doi 10.1073 pnas.0900943106
A Countries Capabilities
c1 c2 c3
a1 a2 a3
Products
p1 p2 p3
MYS PAK
Countries Products
c1 c2 c3
p1 p2 p3
JPN PHL
B Node Color SITC-4 Category Name

0-999 Food & live animals 1000-1999 Beverages & tobacco 2000-2999 Raw materials 3000-3999 Mineral fuels, lubricants & related materials 4000-4999 Animal & vegetable oils, fats & waxes 5000-5999 Chemicals 6000-6999 Manufactured goods by material 7000-7999 Machinery & transport equipment 8000-8999 Miscellanous manufactured articles 9000-9999 Miscellaneous
35 30 25 20 15
JPN MWI FJI MDG HTIHND WSM c,0 SLV NIC GTM GMB GIN JAM GUY BGD TGO MUS CAF SDN MAC TKM CRI UGA MNG DOM CMR SYRKEN ALB MAR SEN NPL MOZ BDI PAK GAB BLZ TZA LVA ETH NCL TJK BLR LTU MDA NGA BHR TTOGHA EGY PNG AZEBOL LKA BFA CIV ECU LBN EST ZWE CYP c,1 ZMB BEN VEN ARM ISL PAN KGZ PHL PER DZA BHS HRV GEO JOR COL TUR SLE MLT RWASAU ROM IDN GRC KNA PRY OMN MLI CHL PRT NZL NERIRN BRB THA URY UKR SVK IND ZAF POL MEX ARG HUN SVN KAZ NOR AUS CHN HKG RUS DNK BRA CZE ESP ISR CAN ITA FIN KOR AUT MYS NLD IRL SWE
<k
>
C
kc,1
Non-Diversified Countries Producing Exclusive Products
Diversified Countries Producing Exclusive Products
SGP
GBR
USA
DEU
kc,0
10 0
100
200
300
400
kc,0
productspace/data.html); second, the COMTRADE Harmonized System at the 4-digit level; and third, the North American Industry Classification System (NAICS) at the 6-digit level (SI Appendix, Section 1). We interpret these data as bipartite networks in which countries are connected to the products they export (Fig. 1B). Mathematically, we represent this network using the adjacency matrix Mcp, where Mcp 1 if country c is a significant exporter of product p and 0 otherwise. We consider country c to be a significant exporter of product p if its Revealed Comparative Advantage (RCA) (the share of product p in the export basket of country c to the share of product p in world trade) is greater than some threshold value, which we take as 1 in this exercise (RCAcp 1) (see SI Appendix, Section 2).
Method of Reflections. We characterize countries and products by introducing a family of variables capturing the structure of the network defined by Mcp (SI Appendix, Section 3). Because of the symmetry of the bipartite network, we refer to this technique as the Method of Reflections, as the method produces a symmetric set of variables for the 2 types of nodes in the network (countries and products). The Method of Reflections consists of iteratively calculating the average value of the previous-level properties of a nodes neighbors and is defined as the set of observables:
kc,0
p
M cp, M cp.
c
[3] [4]
kp,0
kc,0 and kp,0 represent, respectively, the observed levels of diversification of a country (the number of products exported by that country), and the ubiquity of a product (the number of countries exporting that product). Hence, we characterize each country through the vector kc (kc,0, kc,1, kc,2 . . . kc,N) and each product by the vector kp (kp,0,kp,1,kp,2, . . . ,kp,N). For countries, even variables (kc,0,kc,2,kc,4, . . . ) are generalized measures of diversification, whereas odd variables (kc,1,kc,3,kc,5, . . . ) are generalized measures of the ubiquity of their exports. For products, even variables are related to their ubiquity and the ubiquity of other related products, whereas odd variables are related to the diversification of countries exporting those products. In network terms, kc,1 and kp,1 are known as the average nearest neighbor degree (9,10). Higher order variables, however, (N 1) can be interpreted as a linear combination of the properties of all of the nodes in the network with coefficients given by the probability that a random walker that started at a given node ends up at another node after N steps (see SI Appendix, Section 4). Results We can begin understanding the type of information about countries captured by the Method of Reflections by looking at where countries are located in the space defined by the first two sets of variables produced by our method: kc,0 and kc,1. Fig. 1C shows that there is a strong negative correlation between kc,0 and kc,1 (10, 11), meaning that diversified countries tend to export less ubiquitous products. Deviations from this behavior, however, are informative. For example, whereas Malaysia and Pakistan export the same
PNAS June 30, 2009 vol. 106 no. 26 10571
kc, N
1 k c,0 1 k p,0
M cpk p,N 1,
p
[1]
kp, N
M cpk c,N 1,
c
[2]
for N 1. With initial conditions given by the degree, or number of links, of countries and products:
Hidalgo and Hausmann
ECONOMIC SCIENCES
STATISTICS
Non-Diversified Countries Producing Standard Products
Diversified Countries Producing Standard Products
kc,1
<k
>
Fig. 1. Quantifying countries economic complexity. (A) A country will be able to produce a product if it has all of the available capabilities, hence the bipartite network connecting countries to products is a result of the tripartite network connecting countries to their available capabilities and products to the capabilities they require. (B) Network visualization of a subset of Mcp in which we show Malaysia (MYS), Pakistan (PAK), Philippines (PHL), Japan (JPN), and all of the products exported by them in the year 2000 (colored circles), illustrating how countries and products are connected in Mcp. (C) kc,0kc,1 diagram divided into 4 quadrants dened by the empirically observed averages kc,0 and kc,1 .
A
20
r=0.7
100 200 300 400 500 600 700 50
q=0.05
B
30
q=0.05
Na=200
70 60 50 40 30 20 0 70
q=0.1
Na=50
kc,1
r=0.55 q=0.1 N =50 60 a

50 40 30
30 25
r=0.7 q=0.05 Na=200
Countries
Products
40 60 80
Cca
Capabilities
100 150
pa
50
kc,1
kc,1
20 10 0 0 10 20 30
r=0.55
100 120 200
Capabilities
100
150
200
35
20
kc,0
40
100
k c,0
200
300
20 150
30
Na=200
kc,0
Countries
kc,1
kc,1
40 60 80
100 120 100 300 500 700
r=0.7
Mcp
Products
25 20 15 10 0
50 40
100 50
kc,0
50
30 100 0
100
200
kc,0
300
400
kc,0
20 30 40
60
Na=50
kc,1
10
20 15 10
80 60 40 20 0 110 130 150
Na
MYS
Na
FIN SWE JPN
Average Number of Labor Inputs
140
MYS ARM
FIN
SWE JPN AUT ITA CZE DEU POL ESP
JPN
100
HUN SVK CAN ROM JOR PHL SEN NOREST HRV DNK KOR BRASVN LVA BLR ALB IRL SGP THA PRT CHN IDN BOLLBN ISR NLD MEX UKR LTU HKG RUS BRB GIN CRI BHS OMNMUS MAR URY TUR CYP NZL HND TGOSLV ZWE NPL KNA KAZ IND MNG PRY COL ARG GRC GHA ZAF BEN ECUMDA GTM ZMB TTO FJI MLT CIV PER CAF MLI KGZCHL MAC DZAISLKEN BGD MOZ UGA MDG NER JAM GEO PAN AUS EGY MWI NCL BHR GUY TZA NIC TKM CMR VEN GMB BDI ETH BFA SAU SDN BLZ GAB AZE PNG NGA IRN
MYS SWE FIN AUT ITA ARM CAN HUN JOR CZE SVK ROM DEU NOR PHL SEN DNK POL HRV EST LVA BRA SVN ALB IRL KORCHN THA SGP ISR PRT IDN LBN BLR BOL NLDESP MEX LTU RUS UKR HKG BRB BHS CRIGIN OMN MAR TGO URY TUR NZL CYP NPL MUS HND KAZ KNA ZWE IND COL GHA MNG SLV ZAF PRY ECU BEN MDA ARG GRC ZMB GTM FJI TTO MLT CIV KGZ MLIPER CAF ISL AUS CHLDZA EGY MOZMAC BGD KEN JAM MDG UGA NER PAN GEO NCL BHR TKM NIC MWI TZA GUY VEN ETHCMR GMB BFA BDI SAU BLZ SDN PNGGAB AZE NGA
AUT ITA ARM SVK CZE CAN HUN JOR ROM NOR PHL POL SEN DEU EST BRA LVA BLR SGP PRT DNK ALB IDN THA HRV KOR SVN IRL CHN ISR BOL LBN MEX LTU HKG NLD ESP RUS BRB UKR GIN CRI OMN BHS TUR CYP NZL MUS MAR URY TGO HNDSLV KNA NPL ZWE COL MNG PRY GHAGTM KAZ ZAF IND ECU GRC ARG MDA ZMBBEN FJI TTO KGZ MLT MLI PER CAFCIV ISLPAN MAC CHL DZA NER AUS KEN MOZ EGY UGA MDG BGD GEO JAM MWITZA NCL BHR GUY TKM NIC VEN CMR ETH GMB BDI BFA SAU SDN BLZ PNG GAB AZE NGA IRN
110
120
130
90
IRN
100
200
300
400
15
20
25
30
35
120
140
kc,0
k c,1
kc,2
160
180
200
Fig. 2. Capabilities and bipartite network structure. (A) We model the structure of Mcp by taking 2 random matrices representing the availability of capabilities in a country and the requirement of capabilities by products and consider that countries are able to produce products if they have all of the required capabilities. (B) The kc,0kc,1 diagrams that emerge from 4 implementations of the model described in A. (C) kc,0 and kc,1 as a function of the number of capabilities (Nc) available in countries for 2 implementations of the model. (D) Average number of labor inputs required by products produced in a country as a function of the rst 3 components of kc.
number of products, the products exported by Malaysia (kMYS,0 18) are exported by fewer countries than those 104, kMYS,1 exported by Pakistan (kPAK,0 104, kPAK,1 27.5). Combining this fact with our third level of analysis, we see that Malaysian products are exported by more diversified countries than the exports of Pakistan (kMYS,2 163 kPAK,2 142, SI Appendix, Section 8). This suggests that the productive structure of Malaysia is more complex than that of Pakistan, due, as we will show shortly, to a larger number of capabilities available in Malaysia than in Pakistan. In SI Appendix we show that the negative relationship presented in the kc,0kc,1 diagram is not a consequence of variations in the level of diversification of countries and in the ubiquity of products. We prove this by creating 4 null models (11) that control, with increasing stringency, for the diversification of countries and the ubiquity of products and show that these distributions, per se, are not responsible for the negative relationship observed in the data (see SI Appendix, section 6).
Minimalistic Model. We show that the location of countries in the
Using the notation introduced above, together with our only assumption, we can model the structure of the Mcp matrix as: Mcp 1 if
a pa a paC ca
and
M cp
otherwise
[5]
kc,0kc,1 diagram is informative about the capabilities available in a country by introducing a simple model based on the assumption that country c will be able to produce product p if it has all of the required capabilities (Fig. 2A). We implement this model by considering a fixed number of capabilities in each country and represent this by using a matrix Cca, that is equal to 1 if country c has capability a and 0 otherwise. We represent the relationship between capabilities and the products that require them by a matrix pa whose elements are equal to 1 if product p requires capability a and 0 otherwise.
10572 www.pnas.org cgi doi 10.1073 pnas.0900943106
The simplest implementation of this model is to consider Cca 1 with probability r and 0 with probability 1 r and pa 1 with probability q and 0 with probability 1 q. An emergent property of the matrix resulting from this model is that the average ubiquity of a countrys products tends to decrease with its level of diversification for a wide range of parameters (Fig. 2B). We interpret this negative relationship by considering that countries with many capabilities will be more diversified, because they can produce a wider set of products, and that because they can make products requiring many capabilities, few other countries will have all of the requisite capabilities to make them, hence diversified countries will be able to make less ubiquitous products. The model allows us to test directly whether given this set of assumptions we should expect countries with more capabilities to be more diversified and produce less ubiquitous products. Fig. 2C shows that, in the model, the diversity of a country increases with the number of capabilities it poses, whereas the ubiquity of a countrys products is a decreasing function of the number of capabilities available in that country, providing further theoretical evidence that kc captures information on the availability of capabilities in a country, and therefore, about the complexity of its economy.
Fig. 3. Bipartite network structure and income (all GDPs have been adjusted by Purchasing Power Parity PPP). AE were constructed with data from the year 2000. (AC) GDP per capita adjusted by purchasing power parity as a function of our rst 3 measures of diversication (kc,0,kc,2,kc,4), normalized by subtracting their respective means ( kc,N ) and dividing them by their standard deviations (stdev(kc,N)). (A) kc,0. (B) kc,2. (C) kc,4. (D) Comparison between the ranking of countries based on successive measures of diversication (kc,2N) (E) Absolute value of the Pearson correlation between the log GDP per capita at ppp of countries and theit local network structure characterized by kc,N. (F) Growth in GDP per capita at ppp observed between 1985 and 2005 as a function of growth predicted from kc,18 and kc,19 measured in 1985 and controlling for GDP per capita at ppp in 1985.
Direct Measurement of a Subset of Capabilities. We provide empir-
ical evidence that the method of reflections extracts information that is related to the capabilities available in a country by looking at a measurable subset of the capabilities required by products. Fig. 2D shows the average number of different employment categories required by products exported by countries versus kc,0, kc,1, and kc,2. We measure the number of employment categories that go into a product by using the data of the U.S. Bureau of Labor Statistics (see SI Appendix, Section 1). This data should play against us, because
we are disregarding the fact that other countries may use different technologies to produce goods that are similarly classified. Despite this, we find a strong positive correlation between the average
Indeed,
it is common for poorer countries to exchange labor for capital. For example, building a road in the US is done by a relatively small team of workers, each of them specialized to operate a different machine or technique, whereas more modest economies will tend to use more workers, yet less specialized ones, because the relative cost of machines to labor is larger in poorer economies. Hence we should expect poor countries
PNAS
June 30, 2009
vol. 106
no. 26
10573
ECONOMIC SCIENCES
STATISTICS
A
<kp,0> (new exports)
40 35 30 25 20 15 10 5 0
WSM MWI
B
HTI NIC MNGMDG TKM TGOSDN VEN PNG CAF BDI ZMB GUY TJK SLV GTM NPL ETH SENTZAMDABLR FJI GIN FIN ARM UGA BOL ALB BHR RWA KGZ NCL GHA BLZ HNDDOM BEN AZE SYR MOZ PAN BFA NGA ZWE MAR MAC CYP NER SLE GMB LBN PRY LTU SVK GEO JOR CMR BHS ECUKENLVA MYSIRL SWE MLT PER EST MLI PAK BRB BGD OMN UKR AUS GAB ZAF DZAIRNTTO KAZ CRI LKA ARGHKG CIV PHL KNA ISL MUS EGY IDNTUR ROM URYTHA SAU COL PRT IND CHL ISR HRV NOR CANPOL BRA HUN GRC SVN JAM NZL SGP RUS MEX CHN ESP KOR AUT DNK DEU ITA NLD GBR USA k=-0.051k+21.82 Pearson correlation = -0.73 t-test=11.8 p-value=6x10-22
40 35 30 25 20 15 10 5 0 -5 12 14 16 18 20 22 24 26 28 30 32 k1=0.83k1-1.83
Pearson correlation = 0.63 t-test=9.17 p-value<2x10-15
WSM MWI HTI MDG NIC MNG TKMSDN TGO VEN PNG CAF GTM GUY BDI TZA ZMB ETH SLV TJK BLRMDA NPL FJI FIN BOL GHA UGA GIN ALB SEN BHR RWA NCL ARM MOZ KGZ MAR BEN HND PAN MACAZEDOM NER NGA JOR GEO ECU BLZKEN BFA GMB CYP ZWE SLE LBN PRY LTU SYR CMR SVK IRL MYS BHS MLT MLI PERLVA OMN EST SWE BGD UKRBRB TTOPAK AUS GAB CIV LKA ZAF KAZ CRI HKG MUS ARG DZAPHL IRN ISL KNA EGY ROM TUR IDN PRT HRV SAU URYTHACOL CANISR CHL NOR BRA HUN IND GRC NZL JAM SVN POL SGP RUS MEX AUT CHN ESP KOR DNK DEU ITA GBR NLD USA
10
20
50
kc,0
100
200
500
kc,1
240 k1=0.178k+146.2 220

Pearson correlation = 0.59 t-test=8.21 p-value<3x10-13
POL NOR KOR DNK ITA MEX NZL GBR SVN SAU JAM IRL NLD HRV RUS GRC AUT SVK SWE DEU BRB GMB CHL IDN THA HUN TUR ESP EGY EST PHL CHN SLE MUS COL URYIND KNA LVA SGP ROM GEO IRN ISL JOR ZAF CAN MOZUKR BRA USA PAN ARMGABBFA ALBMAC MAR LKALBN TTO BHR DOMLTU ISR SLV MDA CYP MLI BGD CRI BLR PRT NGA BHS NPLPRY PAK KEN PER ARGHKG NER OMN KGZ MLT SYR BEN DZA ECU ZWE FJI HNDGTM MYS CAF BDI AZE SEN KAZ BOL NCL PNG CIV VEN GHA MNG TJK TGO RWA ZMB BLZ ETH FIN UGA TZA NIC AUS MDG HTI TKM SDN CMR GUY MWI GIN WSM
D
240 220 200 180 160 140 120

POL NORKOR DNK ITA MEX NZL SVN SAU JAM IRL NLD RUS HRV GRC DEU SVK AUT SWE BRB IDN GMB CHL ESP TUR EGY CHN SLE HUN THACOL EST MUS URY PHL KNA IND LVA ROM UKR ISL ZAF JOR GEO IRN USA SGPCAN MOZMAC BRA PAN LTU GAB CYP LBN MAR ISR BHR PRT BHS TTO LKA ARM SLV ALB BFA DOM CRI BGD NGA NPL MLI HKG BLRMDA PER PAK KEN PRY KGZ ARG NER OMN MLT SYR BEN MYS HND DZA ECU FJI ZWE BDI AZE BLZ KAZ GTM VEN BOLCIV NCL SEN CAF PNG GHA TGO ZMB ETH TJK RWA MNG FIN UGATZA NIC AUS MDG HTI TKMSDN CMR GUY MWI GIN GBR WSM
200 180 160 140 120 100 80
100 k1=-2.99k1+230 80 12 14 16
Pearson correlation = -0.54 t-test=7.2 p-value<6x10-11
10
20
50
kc,0
100
200
500
18
20
22
kc,1
24
26
28
30
32
Fig. 4. Path dependent development. Average network properties ( kp,0 , kp,1 ; measured in 1992) of the new exports developed by a country between 1992 and 2000 as a function of the diversication of a country kc,0 and the average ubiquity of its products kc,1 measured in 1992. (A) kc,0 vs. kp,0 . (B) kc,1 vs. kp,0 . (C) kc,0 vs. kp,1 . (D) kc,1 vs. kp,1 .
number of employment categories going into the export basket of countries and our family of measures of diversification (kc,0, kc,2, kc,4, . . . ,kc,2N). We also find a negative correlation between the average number of employment categories and measures of the ubiquity of products made by a country (kc,1, kc,3, kc,5, . . . ,kc,2N 1) (Fig. 2D). This shows that more diversified countries indeed produce more complex products, in the sense that they require a wider combination of human capabilities, and that kc is able to capture this information.
Complexity of the Productive Structure, Income and Growth. We show that the information extracted by the method of reflections is connected to income by looking at the first 3 measures of diversification of a country (kc,0, kc,2, kc,4) versus GDP per-capita adjusted for Purchasing Power Parity (PPP) (Fig. 3 AC). To make these 3 different measures comparable we have normalized them by subtracting their respective means ( kN ) and dividing them by their respective standard deviations (stdev(kN)). As we iterate the method the relative ranking of countries defined by these variables shifts (Fig. 3D and SI Appendix, Fig. S14), making our measures of diversification and ubiquity increasingly more correlated with income (Fig. 3E and SI Appendix, Section 11). This can be illustrated by looking at the position, in the kc,NGDP diagrams, of 3 countries that exported a similar number of products in the year 2000, albeit having large differences in income (Pakistan (PAK), Chile (CHL) and Singapore (SGP) Fig. 3 AC). Higher reflections of our method are able to correctly differentiate the income level of these countries because they incorporate information about the ubiquity of the products they export and about the diversification of other countries connected indirectly to them in Mcp, altering their relative rankings (Fig. 3D and SI Appendix, Fig. S14). For example, kc,2 is
to use less labor inputs in the production of products than what would be reported from U.S. labor data, accentuating the effect presented in Fig. 2D.
able to correctly separate Singapore, Chile and Pakistan, because it considers that in the bipartite network Singapore is connected to diversified countries mainly through nonubiquitous products, signaling the availability in Singapore of capabilities that are required to produce goods in diversified countries. In contrast, Pakistan is connected mostly to poorly diversified countries, and most of its connections are through ubiquitous products, indicating that Pakistan has capabilities that are available in most countries and that its relatively high level of diversification is probably due to its relatively large population, rather than to the complexity of its productive structure. Indeed, we find the method of reflections to be an accurate way to control for a countrys population, as correlations between kc and population decrease rapidly as we iterate the method (see SI Appendix, Section 11), whereas correlations between kc and GDP increase as we iterate the method. This is another piece of evidence suggesting that the information captured by our method is related to factors that affect the ability to generate per capita income. Deviations from the correlation between kc and income are good predictors of future growth, indicating that countries tend to approach the levels of income that correspond to their measured complexity. We show this by regressing the rate of growth of income per capita on successive generations of our measures of economic complexity (i.e., kc,0,kc,1 or kc,10,kc,11) and on a countrys initial level of income log GDP t t GDP t a b 1GDP t b 3k c,N
1
b 2k c,N t
t ,
finding that successive generations of the variables constructed in the previous section are increasingly good predictors of growth. In SI Appendix, Section 13, we present regression tables showing that these results are valid for a 20-year period (19852005), two 10-year
10574
www.pnas.org cgi doi 10.1073 pnas.0900943106
periods or four 5-year periods, and that it is robust to the inclusion of other control variables such as individual country dummies (to capture any time-invariant country characteristic) and outperforms other indicators used to measure the productive structure of a country such as the Hirschman-Herfindahl (12, 13) index and entropy measures (14). A graphical example of this relationship is presented in Fig. 3f, which compares the growth predicted from the linear regression described by Eq. 6 and that observed empirically for the 19852005 period and N 18. Finally, we show that the evolution of Mcp exhibits strong path dependence, meaning that we can anticipate some of the properties of a countrys future new exports based on its current productive structure. This observation is consistent with the existence of an unobservable capability space that evolves gradually, because the ability of a country to produce a new product is limited to combinations of the capabilities it initially possesses plus any new capabilities it will accumulate. Countries with many capabilities will be able to combine new capabilities with a wide set of existing capabilities, resulting in new products of higher complexity than those of countries with few capabilities, which will be limited by this fact. We show this using data collected between 1992 and 2000 (we choose 1992 as our starting point because the end of the Soviet Union and the unification of Germany introduce large discontinuities in the number and identity of countries) and consider as a countrys new exports those items for which that country had an RCAcp 0.1 in the year 1992 and an RCAcp 1 by the year 2000. Fig. 4 shows that the level of diversification (kc,0) of a country and the ubiquity of its exports (kc,1), predicts the average ubiquity ( kp,0 ) of a countrys new exports and the average level of diversification ( kp,1 ) of the countries that were hitherto exporting those products. This result is related to the idea that the productive structure of countries evolves by spreading to nearby products in The Product Space (1517), which is a projection of the bipartite network studied here in which pairs of products are connected based on the probability that they are exported by the same countries. This last set of results suggests that the proximity between products in the The Product Space is related to the similarity of the requisite capabilities that go into a product, because countries tend to jump into products that require capabilities that are similar to those required by the products they already export. Discussion Understanding the increasingly large gaps in income per capita across countries is one of the eternal puzzles of development economics. Our view is that complexity is at the root of the explanation, as argued by both Adam Smith (1) and the recent endogenous growth theories (2, 3), yet empirical research has not advanced along these dimensions because of the absence of adequate measures of complexity. Instead, it has emphasized the
1. Smith A (1776) An Inquiry into the Nature and Causes of the Wealth of Nations (W. Strahan and T. Cadell, London). 2. Romer P (1990) Endogenous technological change. J Pol Econ 98:S71S102. 3. Grossman GM, Helpman E (1991). Quality ladders in the theory of growth. Rev Econ Stud 58:43 61. 4. Maddison A (2001) The World Economy: A Millennial Perspective (Development Centre of the OECD, Paris). 5. Pritchett L (1997) Divergence, big time. J Econ Perspec 11:318. 6. Aghion P, Howitt PW(1998) Endogenous Growth Theory (MIT Press, Cambridge, MA) 7. Barro RJ, Sala-i-Martin X(2003) Economic Growth (MIT Press, Cambridge, MA) 8. Feenstra RC, Lipsey RE, Deng H, Ma AC, Ma H (2005) World Trade Flows: 19622000. NBER Working Paper 11040. Available at www.nber.org/papers/w11040. 9. Pastor-Satorras R, Vazquez A, Vespignani A (2001) Dynamical and correlation properties of the internet. Phys Rev Lett 87:258701. 10. Maslov S, Sneppen K (2002) Specicity and stability in topology of protein networks. Science 296:910 913.
accumulation of a few highly aggregated factors of production, such as physical and human capital or general institutional measures, such as rule of law, disregarding their specificity and complementarity. In this article we have presented a technique that uses available economic data to develop measures of the complexity of products and of countries, and showed that (i) these measures capture information about the complexity of the set of capabilities available in a country; (ii) are strongly correlated with income per capita; (iii) are predictive of future growth; and (iv) are predictive of the complexity of a countrys future exports, making a strong empirical case that the level of development is indeed associated to the complexity of a countrys economy. This article has not emphasized the process through which countries accumulate capabilities, but has instead focused on their measurement and consequences. However, the results presented here suggest that changes in a countrys productive structure can be understood as a combination of 2 processes, (i) that by which countries find new products as yet unexplored combinations of the capabilities they already have, and (ii) the process by which countries accumulate new capabilities and combine them with other previously available capabilities to develop yet more products. A possible explanation for the connection between economic complexity and growth is that countries that are below the income expected from their capability endowment have yet to develop all of the products that are feasible with their existing capabilities. We can expect such countries to be able to grow more quickly, relative to those countries that can only grow by accumulating new capabilities. This perspective also suggests that the incentive to accumulate capabilities would depend, among other things, on the expected demand that new capabilities would face, and this would depend on how new capabilities can complement existing ones to create new products. This opens up an avenue for further research on the dynamics of product and capability accumulation. Development economics has tended to disregard the search for detailed capabilities and their patterns of complementarity, hoping that aggregate measures of physical capital (e.g., measured in dollars) or human capital (e.g., measured in years of schooling) would provide enough guidance for policy. Our line of research would justify and provide guidance to development strategies that look to promote products (or capabilities) as a way to create incentives to accumulate capabilities (or develop new products) that could themselves encourage the further coevolution of new products and capabilities, echoing ideas put forward by Albert Hirschman (18) more than 50 years ago, but adding the capacity to analyze them in practice.
ACKNOWLEDGMENTS. We thank M. Andrews, A.-L. Barabasi, B. Klinger, M. Kremer, N. Nunn, L. Pritchett, R. Rigobon, D. Rodrik, M. Yildirim, R. Zeckhauser, participants at the Center for International Developments Seminar on Economic Policy and the Harvard Kennedy School Faculty Seminar, members of the Center for Complex Network Research at Northeastern University, and the Ratatouille Seminar Series. We acknowledge support from the Growth Lab and the Empowerment Lab at the Center for International Development.
11. Newman MEJ (2002) Assortative mixing in networks. Phys Rev Lett 89:208701. 12. Hirschman AO (1945) National power and structure of foreign trade (University of California Press, Berkley, CA). 13. Herndahl OC (1950) Concentration in the steel industry (PhD Dissertation, Columbia University, New York) 14. Saviotti PP, Frenken K (2008) Export variety and the economic performance of countries. J Evol Econ 18:201218. 15. Hidalgo CA, Klinger B, Barabasi A-L, Hausmann R (2007) The product space conditions the development of nations. Science 317:482 487. 16. Hausmann R, Klinger B (2006) The structure of the product space and the evolution of comparative advantage. CID Working Paper No. 128. Available at www.cid.harvard. edu/cidwp/128.htm. 17. Hidalgo CA, Hausmann R (2008) A network view of economic development. Developing Alternatives 12(1):510. 18. Hirschman AO (1958) The Strategy of Economic Development (Yale Univ Press, New Haven, CT).
PNAS
June 30, 2009
vol. 106
no. 26
10575
ECONOMIC SCIENCES
STATISTICS

The Building Blocks of Economic Complexity

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

The Building Blocks of Economic Complexity

Uploaded by

Copyright:

Available Formats

The building blocks of economic complexity

Cesar A. Hidalgo1 and Ricardo Hausmann

whom correspondence should be addressed. E-mail: cesar hidalgo@ksg.harvard.edu.

www.pnas.org cgi doi 10.1073 pnas.0900943106

B Node Color SITC-4 Category Name

Non-Diversified Countries Producing Exclusive Products

Diversified Countries Producing Exclusive Products

Non-Diversified Countries Producing Standard Products

Diversified Countries Producing Standard Products

r=0.55 q=0.1 N =50 60 a

r=0.7 q=0.05 Na=200

100 120 200

100 120 100 300 500 700

80 60 40 20 0 110 130 150

Average Number of Labor Inputs

SWE JPN AUT ITA CZE DEU POL ESP

Direct Measurement of a Subset of Capabilities. We provide empir-

June 30, 2009

240 k1=0.178k+146.2 220

240 220 200 180 160 140 120

<kp,1> (new exports)

200 180 160 140 120 100 80

Pearson correlation = -0.54 t-test=7.2 p-value<6x10-11

www.pnas.org cgi doi 10.1073 pnas.0900943106

Hidalgo and Hausmann

June 30, 2009

You might also like