Professional Documents
Culture Documents
+ c
t
Where:
B = Health indicators such as life expectancy, infant mortality or under-5 mortality
e = Error term
t = Time interval
i = Country index
At this stage, a few other potential health determinants will not be included in the
regression and they are number of physicians (PHY), health expenditure per capita (EXP),
urbanization (URB), and governance quality (GOV). There are two reasons for their exclusion
in this phase of estimation. First, these variables are relatively less popular in the empirical
literature and hence serve better instead as variables of interest in the extreme bounds
analysis (EBA). Second, there exists a degree of collinearity between the variables included
and not included. For example, both GDP and EXP are highly correlated (r=0.93) and hence
the estimation results may be biased if the two are entered simultaneously in the equation.
Consequently, it is sensible to exclude certain variables from the basic health regression
model for the purpose of this chapter.
Another equally important point to address here is that the above model does not explicitly
account for causality. It is fairly understood that any conclusion derived from the estimation
of this model will be subjected to the problem of endogeneity if no proper controls or
instruments are used. This is especially true when one considers for example the possible
two-way relationship between income and health where the former is determining and
being determined by the other. However, as mentioned earlier in the introduction chapter,
the thesis is more focused on examining the associative relationship between a particular
variable and population health rather than establishing any causal effect between them. For
this reason, even though the endogeneity problem is acknowledged, it will not be dealt with
extensively because it is not a major concern in the context of this research.
31
4.2 Explanation of Dependent and Independent Variables
The three health indicators which will be used are taken from the World Banks World
Development Indicator (WDI) database: (i) life expectancy (LFE), defined as the number of
years a newborn infant is expected to live based on the pattern of mortality (which is
assumed to be constant throughout its lifetime) at the time of its birth, (ii) infant mortality
(IFM) defined as the number of infant deaths occurring before they reach one year of age for
every 1,000 live births per year and (iii) under-5 mortality measured as the probability per
1,000 that a newborn baby will die before the age of five based on the current age-specific
mortality rates.
The independent variables (also taken from the WDI unless otherwise indicated) are defined
as follows. Gross domestic product per capita (GDP) is the total gross value produced by the
economy divided by the total population of the country and is denoted in constant 2000 US
dollars. Generally, it implies the level of income or purchasing power possessed by
individuals and households. The Gini coefficient (GIN) is an index developed to measure the
pattern of income distribution and its deviation from a perfectly equal distribution. A value
of zero signifies complete equality while a value of 100 represents complete inequality. The
data for GIN is derived from the World Income Inequality Database (WIID), Version 2.0c,
May 2008 which is maintained by the World Institute for Development Economics Research
of the United Nations University (UNU-WIDER).
EDU is the proxy for the level of education (primary, secondary and tertiary included). It
specifies the average years of total schooling attained by the female population aged 25
years and above (Barro and Lee, 2010). The level of access to safe water (WAT) is defined as
the percentage of the population with reasonable access to a sufficient amount of water
from an improved source such as a household connection, protected well or spring and
rainwater collection. The African dummy variable (AFR) is based on the World Banks
classification of sub-Saharan African countries.
32
PHY is defined as the number of physicians (including general practitioners and specialists)
for every 1000 persons in a country. Health expenditure per capita (EXP) is the ratio of public
and private health expenditure to the total population. Expenditure includes the provision of
health services, family planning and nutrition activities by individuals and households, and
emergency aid allocated for population health. The data for this variable is denoted in
constant 2005 international dollars after correcting for purchasing power parity.
Urbanization (URB) measures the percentage of the total population living in urban areas.
The definition of urban areas is determined by the respective national statistical offices in
each country.
Governance quality (GOV) is an index of which covers six dimensions of governance namely
voice and accountability, political stability & the absence of violence/terrorism, government
effectiveness, regulatory quality, rule of law and control of corruption. These governance
indicators are essentially the statistical compilation of survey results on the quality of
governance gathered from various types of respondent which include enterprises,
institutional bodies and citizens of both developed and developing countries. The data for
this variable is measured in numerical units ranging from the lowest (-2.5) to the highest
(2.5). A higher value denotes a better governance outcome. The source for GOV originates
from the Worldwide Governance Indicators (WGI) project under the auspices of The World
Bank.
4.3 Data Averaging
The sample used for estimation consists of observations on 95 countries over the period
1960 and 2008. The sample period is split into two subperiods. The period from 1960 to
1985 is treated as a lag interval and hence will be denoted with the subscript 0. The period
from 1986 to 2008 is referred to as the current interval and will be denoted with the
subscript 1. For instance, GDP
0
is the past value of the gross domestic product per capita
while GDP
1
is the equivalent value of the present. Therefore, provided that sufficient data is
available, there will be two observations for a particular variable in the sample set. The
33
reason for splitting the data into two subperiods is because it allows the extreme bounds
analysis (EBA) to produce estimations for two different periods so that a comparative
analysis over time can be made. Chapter 5 will accordingly present the result for this
exercise. However, to economize space in this chapter, the focus here is on the current
interval only.
For each country, data are averaged over the two subperiods, as illustrated in Figure 4.1. As
shown in the calculation of GDP in the diagram, the value of the denominator is dependent
upon the total observations that are available in each subperiod. For example, in the case of
Japan (shown here), 26 observations and 23 observations were obtained for the lag interval
and current interval respectively. This method of averaging GDP applies for other variables
of interest as well. There are several reasons as to why averaging is preferred to other
methods of presenting data. First, averaging is a standard procedure used in the literature.
Second, it removes any short-term effect that might be present in the data. Third, due to
unbalanced observations in the annual data among countries, averaging can help to
construct a more balanced dataset.
A special mention should be given to the procedure used to obtain values of governance
quality (GOV) as it is slightly different from the calculation of other variables. The six
dimensions of governance, as discussed earlier, contain observations over the period 1996
and 2008 and therefore require two averaging processes in order for them be incorporated
in the basic health regression model. Figure 4.2 shows the procedure to determine the value
of GOV for Malaysia. The overall aim of this exercise is to create a fairly representative
aggregate indicator of governing quality that can be used to compare between countries.
Table 4.1 shows the summary statistics of all the variables and their respective measurement
unit.
34
Figure 4.2: GOV Calculation for Malaysia Figure 4.1: The GDP Intervals and Calculations for Japan
35
Table 4.1: Summary Statistics of All Variables
Note: The summary statistics above are for the period between 1986 and 2008 (current interval).
36
4.4 Expected Relationships between Variables
One would generally expect any increases in GDP per capita (GDP), health expenditure per
capita (EXP) and number of physicians (PHY) to improve the ability of individuals and
households to acquire goods and services which are beneficial for their health. Thus, these
variables contribute favourably to life expectancy and reduce mortality. Investments in
female education (EDU) meanwhile is predicted to have a positive effect on future income
and this in turn will enable a greater command over basic necessities such as food, water
and shelter. EDU also assist in the diffusion of basic health knowledge which can help to
encourage the adoption of good dietary habits and preventive measures against diseases.
Overall, EDU will have a positive relationship with population health.
With water being an important source of life, access to clean water (WAT) is most likely a
crucial element in ensuring population survival and hence is important in predicting
mortality. In addition, the provision of clean water helps to create hygienic and sanitary
living conditions which can reduce the virulence of disease-producing pathogens such as
bacteria and viruses. Meanwhile, a high level of urbanization (URB) is frequently linked with
the ease of access to medical treatments, hospitals and other health amenities. The
affluence of these facilities in a highly populated area enables the sick to be treated early
and effectively.
Governance quality (GOV) reflects the efficiency of public goods distribution within a country
and is often crucial in making public health programs readily accessible to those in need.
Therefore, a good governance system in place ensures policies and public institutions are
able to achieve their objectives and this in turn will improve population health interventions
and programs. A higher value of GOV denotes a better governance outcome.
Gini index (GIN) is a measure of the income gap between the rich and the poor. A high state
of inequality will often imply a dominant control of the countrys resources by a small group
of the population while leaving the rest with few social benefits. In many cases, the rich has
37
little incentive to provide public goods (including health) to the poor if the former can
comfortably afford private goods and services. As a result, the rich would naturally resist any
form of public health measure that is not within their interest. In the absence of an effective
interventionist policy by the government, a high value of GIN is often associated with an
increase in poverty and poor population health. However, there are also situations when
income inequality can lead to better health. For example, if the poor in a democratic society
forms the majority of the population (implying high income inequality), they are more likely
to vote for the government that will provide the largest amount of public good. Assuming
that this effect dominates, income inequality will then be associated with higher public
health expenditure (e.g construction of hospitals) and perhaps better health outcomes. For
the purpose of this thesis, however, GIN is expected to be negatively associated with health.
Some preliminary stylized facts on the relationship between the three health indicators and
the independent variables can be determined from simple bivariate scatter plots. Figure 4.3
shows scatter diagrams depicting the relationship between life expectancy (LFE
1
) and each
of the independent variable. It is apparent that GDP
1
, EDU
1
, WAT
1
, PHY
1
, EXP
1
, URB
1
and
GOV
1
share a positive relationship with life expectancy while the inverse relationship is true
for GIN
1
. This corresponds with the population health theories and regressions discussed
earlier in Chapter 2 and Chapter 3.
Figure 4.4 presents scatter plots with infant mortality (IFM
1
) on the y-axis and the respective
independent variables on the x-axis. Similarly, Figure 4.5 displays under-5 mortality (UFM
1
)
on the y-axis and the respective independent variables on the x-axis. Since both infant
mortality and under-5 mortality measure the same dimension of population health, they
share identical relationships with respect to other independent variables. As expected, GDP
1
,
EDU
1
, WAT
1
, PHY
1
, EXP
1
, URB
1
and GOV
1
are negatively correlated with both mortality
indicators while GIN
1
is positively correlated. Again, this is line with expectations. Table 4.2
shows a brief summary of the relationships between the population health indicators and
the independent variables.
38
30
40
50
60
70
80
90
4 5 6 7 8 9 10 11
GDP PER CAPITA (LOG)
L
I
F
E
E
X
P
E
C
T
A
N
C
Y
30
40
50
60
70
80
90
20 30 40 50 60 70 80
GINI INDEX
L
I
F
E
E
X
P
E
C
T
A
N
C
Y
30
40
50
60
70
80
90
30 40 50 60 70 80 90 100
ACCESS TO SAFE WATER
L
I
F
E
E
X
P
E
C
T
A
N
C
Y
30
40
50
60
70
80
90
0 1 2 3 4
NUMBER OF PHYSICIANS
L
I
F
E
E
X
P
E
C
T
A
N
C
Y
30
40
50
60
70
80
90
0 2 4 6 8 10 12 14
FEMALE EDUCATION
L
I
F
E
E
X
P
E
C
T
A
N
C
Y
30
40
50
60
70
80
90
0 1 2 3 4 5 6 7 8 9
HEALTH EXPENDITURE PER CAPITA (LOG)
L
I
F
E
E
X
P
E
C
T
A
N
C
Y
Figure 4.3: Scatter Plots of Life Expectancy and Independent Variables
(a) Correlation = 0.8481
(b) Correlation = -0.4816
(c) Correlation = 0.7768
(d) Correlation = 0.8195
(e) Correlation = 0.7370 (f) Correlation = 0.8300
39
30
40
50
60
70
80
90
0 20 40 60 80 100
URBANIZATION
L
I
F
E
E
X
P
E
C
T
A
N
C
Y
30
40
50
60
70
80
90
-2 -1 0 1 2
GOVERNANCE
L
I
F
E
E
X
P
E
C
T
A
N
C
Y
0
20
40
60
80
100
120
140
160
4 5 6 7 8 9 10 11
GDP PER CAPITA (LOG)
I
N
F
A
N
T
M
O
R
T
A
L
I
T
Y
0
20
40
60
80
100
120
140
160
20 30 40 50 60 70 80
GINI INDEX
I
N
F
A
N
T
M
O
R
T
A
L
I
T
Y
0
20
40
60
80
100
120
140
160
0 2 4 6 8 10 12 14
FEMALE EDUCATION
I
N
F
A
N
T
M
O
R
T
A
L
I
T
Y
(g) Correlation = 0.7079 (h) Correlation = 0.7014
Figure 4.4: Scatter Plots of Infant Mortality and Independent Variables
(a) Correlation = -0.8406 (b) Correlation = 0.4017 (c) Correlation = -0.8019
40
0
20
40
60
80
100
120
140
160
30 40 50 60 70 80 90 100
ACCESS TO SAFE WATER
I
N
F
A
N
T
M
O
R
T
A
L
I
T
Y
0
20
40
60
80
100
120
140
160
0.0 0.2 0.4 0.6 0.8 1.0
ETHNIC FRACTIONALIZATION
I
N
F
A
N
T
M
O
R
T
A
L
I
T
Y
0
20
40
60
80
100
120
140
160
0 1 2 3 4
NUMBER OF PHYSICIANS
I
N
F
A
N
T
M
O
R
T
A
L
I
T
Y
0
20
40
60
80
100
120
140
160
0 20 40 60 80 100
URBANIZATION
I
N
F
A
N
T
M
O
R
T
A
L
I
T
Y
0
20
40
60
80
100
120
140
160
-2 -1 0 1 2
GOVERNANCE
I
N
F
A
N
T
M
O
R
T
A
L
I
T
Y
0
20
40
60
80
100
120
140
160
0 1 2 3 4 5 6 7 8 9
HEALTH EXPENDITURE PER CAPITA (LOG)
I
N
F
A
N
T
M
O
R
T
A
L
I
T
Y
(d) Correlation = -0.8291 (e) Correlation = 0.6010 (f) Correlation = -0.7116
(g) Correlation = -0.8188 (h) Correlation = -0.655 (i) Correlation = -0.7199
41
0
40
80
120
160
200
240
4 5 6 7 8 9 10 11
GDP PER CAPITA (LOG)
U
N
D
E
R
-
5
M
O
R
T
A
L
I
T
Y
0
40
80
120
160
200
240
20 30 40 50 60 70 80
GINI INDEX
U
N
D
E
R
-
5
M
O
R
T
A
L
I
T
Y
0
40
80
120
160
200
240
30 40 50 60 70 80 90 100
ACCESS TO SAFE WATER
U
N
D
E
R
-
5
M
O
R
T
A
L
I
T
Y
0
40
80
120
160
200
240
0 1 2 3 4
NUMBER OF PHYSICIANS
U
N
D
E
R
-
5
M
O
R
T
A
L
I
T
Y
0
40
80
120
160
200
240
0 2 4 6 8 10 12 14
FEMALE EDUCATION
U
N
D
E
R
-
5
M
O
R
T
A
L
I
T
Y
Figure 4.5: Scatter Plots of Under-5 Mortality and Independent Variables
(b) Correlation = 0.4114 (a) Correlation = -0.8106 (c) Correlation = -0.7858
(d) Correlation = -0.8400 (e) Correlation = -0.6489
42
0
40
80
120
160
200
240
0 20 40 60 80 100
URBANIZATION
U
N
D
E
R
-
5
M
O
R
T
A
L
I
T
Y
0
40
80
120
160
200
240
-2 -1 0 1 2
GOVERNANCE
U
N
D
E
R
-
5
M
O
R
T
A
L
I
T
Y
0
40
80
120
160
200
240
0 1 2 3 4 5 6 7 8 9
HEALTH EXPENDITURE PER CAPITA (LOG)
U
N
D
E
R
-
5
M
O
R
T
A
L
I
T
Y
Table 4.2: Relationships between Population Health Indicators and Independent Variables
VARIABLE
CODE
INDEPENDENT VARIABLE
DEPENDENT VARIABLE
Life expectancy
(LFE)
Infant mortality
(IFM)
Under-5 mortality
(UFM)
GDP Gross domestic product per capita Positive Negative Negative
GIN Gini index Negative Positive Positive
EDU Female education Positive Negative Negative
WAT Access to safe water Positive Negative Negative
PHY Number of physicians Positive Negative Negative
EXP Health expenditure per capita Positive 3Negative Negative
URB Urbanization Positive Negative Negative
GOV Governance quality Positive Negative Negative
(f) Correlation = -0.7922 (g) Correlation = -0.6382 (h) Correlation = -0.6658
43
4.5 Some Preliminary Estimations and Diagnostic Results
Table 4.3 reports the regression results of equation (1) for the dependent variables life
expectancy (LFE
1
), infant mortality (IFM
1
) and under-5 mortality (UFM
1
). All of the
independent variables and dummy variable were of the expected sign except for the Gini
index (GIN
1
)
20
. The log of GDP per capita (GDP
1
) is consistently significant at the 1% level
across all three health indicators and implies the importance of income in explaining
population health.
Female education (EDU
1
) and access to safe water (WAT
1
) are significant in the model and
evidence is especially stronger for both infant and under-5 mortality at the 1% level of
significance. One possible reason is because maternal education plays an important role in
the development of newborns and young infants. Educated mothers tend to be more
conscious in promoting a healthy growing environment for their children to live in and this
has a beneficial effect of reducing mortality. While both EDU
1
and WAT
1
are significant when
the dependent variable is life expectancy, they are however only significant at 5% and 10%
level respectively.
The strong significance of the dummy variable for sub-Saharan African countries (AFR) seems
to suggest a regional effect on population health. The effect is especially large as indicated
by the value of its coefficient where life expectancy, on average, being approximately 11
years lower if a country originates from the sub-Saharan African region. Additionally, these
countries experience higher rates of mortality with the difference being on average as high
as 62 deaths for under-5 mortality. This could be attributed to Africas extreme cases of
poverty and high prevalence of the HIV/AIDS disease.
The R
2
for the model with LFE
1
as the dependent variable is equal to 0.91 implying that
variations in the independent variables can account for 91% of the cross-country variation in
life expectancy. Similarly, the IFM
1
and UFM
1
models recorded high values of R
2
at 0.88 and
20
Nevertheless, as discussed previously in section 4.4, there can be a positive relationship between the Gini
index and population health and this in turn implies a priori the sign of GIN
is ambiguous.
44
Table 4.3: Regression Result of Equation (1)
Dependent variable
Life expectancy (LFE
1
) Infant mortality (IFM
1
) Under-5 mortality (UFM
1
)
Constant term (C) 39.60 (2.79)*** 144.96 (13.43)*** 215.24 (20.52)***
GDP per capita (log) (GDP
1
) 2.68 (0.41)*** -6.69 (1.54)*** -7.51 (2.40)***
Gini index (GIN
1
) 0.02 (0.03) -0.30 (0.13)** -0.50 (0.22)**
Female education (EDU
1
) 0.41 (0.16)** -2.67 (0.74)*** -4.17 (1.00)***
Access to safe water (WAT
1
) 0.07 (0.04)* -0.39 (0.12)*** -0.83 (0.23)***
African dummy variable (AFR) -11.04 (1.24)*** 31.76 (4.33)*** 62.94 (7.62)***
Observations 88 88 88
R-squared 0.91 0.88 0.89
Figures in parentheses are standard errors.
Significance at 1%, 5% and 10% level are denoted with ***, **, and * respectively.
All standard errors are White-heteroskedasticity-consistent.
45
0.89 respectively indicating a good fit between the estimated regressions and the data.
Hence, the inclusion of these independent variables in the basic health regression model is
justified based on their explanatory power.
Since the measurement units are different, the relative strength of each independent
variable could not be determined directly from Table 4.3. In order to investigate their
relative importance, these variables have to be standardized so all of their variances have
the value of 1. Accordingly, Table 4.4 shows the standardized beta coefficients for the five
independent variables based on equation (1).
Table 4.4: Standardized Beta Coefficients
Dependent variable
Life expectancy
(LFE
1
)
Infant mortality
(IFM
1
)
Under-5 mortality
(UFM
1
)
GDP per capita (log) (GDP
1
) 0.40 -0.30 -0.20
Gini index (GIN
1
) 0.02 -0.11 -0.12
Female education (EDU
1
) 0.13 -0.24 -0.23
Access to safe water (WAT
1
) 0.11 -0.20 -0.25
African dummy variable (AFR) -0.48 0.41 0.49
It appears that the African dummy variable (AFR) has the strongest effect while the Gini
index (GIN
1
) has the lowest among all independent variables. The log of GDP per capita
(GDP
1
) features more prominently for the life expectancy and infant mortality regressions in
comparison to female education (EDU
1
) and access to safe water (WAT
1
). The regression
with under-5 mortality as the dependent variable sees a generally equal share of magnitude
between GDP
1
, EDU
1
and WAT
1
. Notwithstanding the role of AFR and GIN
1
, the standardized
beta coefficients match the common expectations regarding the relative importance of each
explanatory variable with respect to the three health indicators.
46
4.5.1 Possible Collinearity between Gini Index and other Explanatory Variables
A correlation matrix is constructed to investigate whether the incorrect sign for the Gini
index (GIN
1
) in all three regressions is possibly related to the effects of collinearity between
GIN
1
and other explanatory variables. Table 4.5 shows the sample correlation coefficient
matrix for all explanatory variables except the African dummy variable (AFR).
Table 4.5: Correlation Matrix
Variable
GDP per capita
(log) (GDP
1
)
Gini index
(GIN
1
)
Female
education
(EDU
1
)
Access to
safe water
(WAT
1
)
GDP per capita (log) (GDP
1
)
Gini index (GIN
1
) -0.46
Female education (EDU
1
) 0.62 -0.42
Access to safe water (WAT
1
) 0.54 -0.42 0.70
From the table, the Gini index does not seem to be highly correlated with the log of GDP per
capita, female education or access to safe water. Although this examination failed to provide
a solution to the problem of GIN
1
having the wrong sign for its coefficient, its modest
correlation with others implies a variation in GIN
1
that could potentially explain the
dependent variables without succumbing to the collinearity effect. In other words, the Gini
index could still be a potential population health determinant in this set of independent
variables despite its coefficient sign being inversed.
To further investigate the problem, an auxiliary regression is performed with the Gini index
as the dependent variable and the rest of the explanatory variables as the independent
variables. Auxiliary regressions are preferred to correlation matrix in detecting collinearity if
there are more than one explanatory variable in the regression model. Table 4.6 reports the
results of this estimation. The value of R
2
obtained is quite low at 0.29 implying a low
probability of multicollinearity. All variables are insignificant with the exception of AFR.
47
Table 4.6: Auxiliary Regression (Dependent Variable: Gini Index)
Constant term (C)
51.93 (8.98)***
GDP per capita (log) (GDP
1
) -0.18 (1.56)
Female education (EDU
1
) -0.69 (0.63)
Access to safe water (WAT
1
) -0.05 (0.13)
African dummy variable (AFR) 10.89 (3.71)***
Observations 88
R-squared 0.29
Figures in parentheses are standard errors.
Significance at 1%, 5% and 10% level are denoted with ***, **, and * respectively.
All standard errors are White-heteroskedasticity-consistent.
The significance of AFR at the 1% level in the auxiliary regression yields an interesting
observation. One plausible interpretation of this is that both GIN
1
and AFR are commonly
related to certain underlying factor(s) which could potentially obscure their respective
marginal effect on population health. For example, the extreme poverty and generally weak
economic conditions found in many sub-Saharan African countries tend to further aggravate
the existing level of income inequality especially if the quality of institutions is inadequate.
Therefore, it is quite possible to argue that the inclusion of AFR in the basic regression model
could alter the effect of GIN
2
on life expectancy (LFE
1
), infant mortality (IFM
1
) and under-5
mortality (UFM
1
). For this reason, equation (1) is estimated again for the three health
indicators without the presence of AFR.
4.5.2 Excluding the African Dummy Variable (AFR)
Table 4.7 reports the regression results for equation (1) without including the African
dummy variable (AFR). The model with life expectancy (LFE
1
) as the dependent variable
improved as shown inside the circled area with the Gini index (GIN
1
) having the correct
theoretical sign and being significant at the 10% level. In addition, both female education
and (EDU
1
) and access to safe water (WAT
1
) are now statistically significant at the 1% level.
48
Table 4.7: Regression Result of Equation (1) excluding the African Dummy Variable (AFR)
Dependent variable
Life expectancy (LFE
1
) Infant mortality (IFM
1
) Under-5 mortality (UFM
1
)
Constant term (C) 30.83 (4.60)*** 170.20 (16.70)*** 265.26 (26.07)***
GDP per capita (log) (GDP
1
)
2.61 (0.54)***
-6.49 (2.07)*** -7.11 (4.01)*
Gini index (GIN
1
) -0.08 (0.05)* -0.00 (0.15) 0.08 (0.26)
Female education (EDU
1
) 0.66 (0.24)*** -3.36 (0.95)*** -5.55 (1.69)***
Access to safe water (WAT
1
) 0.18 (0.04)*** -0.73 (0.17)*** -1.49 (0.31)***
Observations 88 88 88
R-squared 0.79 0.79 0.76
Figures in parentheses are standard errors.
Significance at 1%, 5% and 10% level are denoted with ***, **, and * respectively.
All standard errors are White-heteroskedasticity-consistent.
49
However, the exclusion of AFR fails to improve the other two models. The infant mortality
model experienced little changes to its coefficients with the GIN
1
coefficient still displaying
the inaccurate sign. Even though the under-5 mortality model shows a correct sign for GIN
1
,
the variable is not significant and the statistical strength for its log of GDP per capita (GDP
1
)
variable has been reduced to the 10% level (highlighted in red).
Furthermore, the R
2
values for all three regressions have dropped substantially after being
estimated without the African dummy variable. The standardized beta coefficients (refer to
Table A1 in the Appendix) did not change considerably after excluding AFR and hence the
order of relative importance among the explanatory variables remains the same. At this
stage, it is fairly clear that the inclusion of GIN
1
in the infant and under-5 mortality models
will not be necessary. Nevertheless, the situation is uncertain for the life expectancy model
and therefore the following test is performed.
4.5.3 Functional Form Stability and Omitted Variables Test
For the purpose of deciding whether to retain or exclude the Gini index (GIN
1
) from the life
expectancy model, the Regression Specifications Error Test (RESET) shall be utilized. The
RESET test is designed to detect misspecification in a model through the inclusion of
polynomial terms in the regression (Hill et al., 2008). In the above case, three possible
regressions will be examined under this test. The first one includes the GIN
1
variable but
excludes the African dummy variable (as seen in Table 4.7), the second includes both (as
seen in Table 4.3) and the last includes the African dummy variable (AFR) but excludes the
GIN
1
variable.
Results show that the RESET test (with 1 and 2 fitted terms) rejects the null hypothesis of no
misspecification at the 1% level of significance for the first regression indicating there could
be a problem of omitted variables, inclusion of irrelevant variables or inappropriate
functional form. However, there is a failure to reject the null hypothesis in both the second
and third regressions. Two implications arise from these outcomes. First, the omitted
50
variables problem seems to be mitigated with the inclusion of AFR. Second, the GIN
1
variable
appears to be an irrelevant variable since the failure of the RESET test to reject the null
hypothesis in the third regression suggests that the model is well specified. At this stage, it is
unlikely that GIN
1
will be considered as one of the candidates for the always included
regressors in the extreme bounds analysis (EBA) of Chapter 5. Based on the results of various
diagnostic tests performed above, it is perhaps more sensible instead to test the robustness
of GIN
1
as a variable of interest using the EBA.
4.6 Summary
The basic health regression model is estimated using data collected from 95 developed and
developing countries. After observing some preliminary scatter plots, the chapter provides
regression estimates of the effect of several potential determinants of health outcomes in
order to identify which variables will be always included in the extreme bounds analysis
(EBA) of Chapter 5. These variables are the log of GDP per capita (GDP
1
), female education
(EDU
1
), access to safe water (WAT
1
) and the African dummy variable (AFR). The Gini index
(GIN
1
) variable will not be treated as an always included regressor for three main reasons.
First, the scatter plots reveal a low correlation between GIN
1
and the three health indicators
thus suggesting low explanatory power. Second, GIN
1
fails to achieve the desired theoretical
sign in most specifications and appears to be irrelevant under the RESET test. Third, the
standardized beta coefficients show GIN
1
as the weakest among all regressed independent
variables in terms of importance. By taking all of these into consideration, it is then probably
more meaningful to designate GIN
1
as one of the variables of interest instead of the always
included regressors in the EBA.
51
Chapter 5: A Sensitivity Analysis
In this chapter, a sensitivity test of the potential health determinants will be performed by
using the extreme bounds analysis (EBA) method. The chapter begins with a background of
the EBA methodology and its application in the economic literature. Then, a detailed
explanation is given on the procedures of EBA in the context of investigating population
health. Following that, the subsequent interpretation of EBA estimation results for each
health indicator are discussed. The chapter concludes with a summary of the robust
potential determinants of population health.
5.1 Background of the Extreme Bounds Analysis
Empirical conclusions are often subjected to issues of robustness. In a typical regression
framework, it relates to the statistical significance of a particular variable when there are
changes in the choice of information set. For example, x
1
may be a significant explanatory
variable in a hypothetical regression model that includes x
2
and
x
3
but becomes insignificant
when x
4
is included. If it is indeed the case, then one cannot have complete confidence in the
conclusion that x
1
is important even though it was significant before the inclusion of x
4
. This
leads to several questions. In view of the situation, how does one interpret the relationship
between x
1
and the dependent variable? What if there are other variables in addition to x
4
that warrant consideration? How does one choose which variables to include?
The problem is that in many cases, other than the key variables of interest, it is not clear
which variables should be included as controls in the model. More often than not,
researchers have to resort to arbitrary selection of variables based on their preconceived
notions and ideas. Since different researchers hold different views about what kind of
variables should be in a model, this convention of estimation may give rise to biased
conclusions. The researcher could be tempted to report only those specifications that deliver
the expected/desired results and omit the others. Moosa and Cardak (2006) remark that
there are economists who estimate 1000 regressions, only to discard 999 of them after
successfully finding a regression which they find most convenient to report. This practice is
52
often associated with data mining and it may cast doubts about the validity of the
empirical results obtained.
In relation to the above quandary, Leamer (1983) argues that there is a need for empirical
studies to supply evidence of stability in their conclusions. Reliable conclusions are more
likely to be achieved if they are insensitive towards changes in the set of variables and
observations used. He postulates that a variable is only considered robust if its statistical
significance is not conditional on the choice of information set. Consequently, the extreme
bounds analysis (EBA) as discussed in Leamer (1983, 1985) was developed to reduce the
effect of model uncertainty and to provide a rigorous methodology in which the robustness
of variables can be assessed. It is a useful procedure to report the sensitivity of estimated
results to specification changes (Moosa and Cardak, 2006).
The relationship between a given explanatory variable and a dependent variable is deemed
to be robust if it produces an estimated coefficient that remains statistically significant and
possesses the theoretically predicted sign when the set of explanatory variables in the
regression changes. Generally, the EBA is based on equation (2):
(2) = o +[
I + [
m
m +[
z
Z + u
The independent variables are grouped into three different classifications. I is a vector of
explanatory variables that are always included in the regression. These variables are called
free (or fixed) variables. Typically, theories and past empirical studies are used to identify
the variables that should be entered under this category. m denotes the variable of interest
for which its robustness is being tested. Z is a set or vector of other variables which are
potentially important in explaining the dependent variable.
The EBAs objective is to find the widest range of coefficient estimates on the variable of
interest (m) that standard hypothesis tests do not reject. This is done by varying the set of Z-
53
variables in Equation (2) to produce regressions that will yield different values of coefficient
estimates ([
m
) for the variable of interest (m). Each regression therefore contains the free
variables ( I ), the variable of interest ( m) and many combinations of Z-variables
21
.
Subsequently, from these regressions results, the procedure identifies the highest and
lowest values of [
m
that cannot be rejected at a particular significance level.
Accordingly, the upper extreme bound is defined as the maximum value of [
m
plus two of its
standard deviation ([
m
mux
+ 2o
m
mux
) while the lower extreme bound is the minimum value of
[
m
minus two of its standard deviation ([
m
mn
2o
m
mn
). The partial correlation between the
variable of interest (m) and the dependent variable can then be inferred from these
extreme bounds. If both bounds are of the same sign (indicating that the interval does not
contain the value of zero), the variable of interest (m) is then said to be robust. Otherwise,
the result will be considered as fragile. The extreme values of [
m
mux
and [
m
mn
serve to
describe the ambiguity of inferences about the coefficient [
m
which is caused by model
uncertainty (Leamer and Leonard, 1983). In other words, the EBA is designed to expose any
biased estimators resulting from the arbitrary selection or exclusion of explanatory variables
in a model.
5.2 Applications of the Extreme Bounds Analysis
Levine and Renelt (1992) applied the extreme bounds analysis (EBA) to their study of cross-
country growth regressions by investigating the sensitivity of potential growth determinants
to changes in the explanatory variables used in the equation. The initial GDP per capita, ratio
of investment to GDP, initial secondary school enrolment and population growth rate were
included as the free variables (I) in the study. Over 50 other variables, including the authors
own constructed policy indicators, were identified as the Z-variables. Although there were
many variables examined, they found few to be robustly correlated with growth under the
EBA. Admittedly, however, the possible implication of introducing this large number of
21
The variables in this group are non-exhaustive and may even run up to 50 variables as in the case of growth
determinants. Therefore, based on the possible combinations of the Z-variables, the number of regressions
needed to perform the EBA could range from hundreds to millions.
54
regressors in an equation could lead to the problem of multicollinearity. This has an effect of
amplifying estimated coefficients and standard errors and hence affecting significance
results.
Nevertheless, Leamer (1978) reasons that multicollinearity is a manifestation of a weak data
problem rather than a procedural problem. If there is not enough independent variation in
an explanatory variable to explain the dependent variable, then the partial correlation
between the two in a cross-section regression is unlikely to be robust and therefore will be
reported as fragile by the EBA.
Similarly, Levine and Renelt (1992) believe that one should only place confidence in a
correlation if it remains significant after controlling for other relevant variables. In order to
mitigate the multicollinearity problem, their EBA is restricted in several ways. First, only a
maximum of three Z-variables are allowed in the analysis so that the total number of
explanatory variables in any one regression does not exceed eight. Second, to create a small
but reasonable conditioning set, the pool of variables from which the Z-variables are chosen
is limited to only seven variables. Third, for every variable of interest (m), the pool of
variables is further reduced by removing variables that, a priori, may measure the same
aspect. Consequently, they argue that these steps would reduce multicollinearity and hence
making the results of their EBA more credible.
The extreme bounds analysis is criticized, however, for its rather restrictive criterion for a
variable to be robust. A variable is deemed to be robust only if all estimated [
m
are
statistically significant and the extreme bounds share the same sign. This implies that a
variable will be considered fragile by the EBA even if the coefficient sign differs in only one of
the many regressions estimated. Critics of the EBA tend to highlight this as the underlying
reason why it is most likely only a few (or no) variables will be robust under the procedure
(for example see McAleer et al., 1985). Sala-i-Martin (1997a) suggests that the test is too
strong for any variables to really pass it and argues that if a large enough number of
55
regressions are run, the researcher is bound to find at least one specification in which the
estimated coefficient becomes insignificant or changes sign. As a result, even genuine
explanatory variables can be ruled out by the EBA simply because of one unfavorable
regression.
There are several variants of the extreme bounds analysis that attempt to circumvent the
above problem by relaxing the robustness criteria used in Leamer (1985) and Levine and
Renelt (1992). One of them is by Sala-i-Martin (1997a, 1997b) who revisits the study of
growth determinants by adopting a more lenient version of the EBA in which the entire
distribution of the estimators [
m
is analyzed for the purpose of evaluating robustness. He
reasons that it is more practical to place a level of confidence on the variable of interest
rather than strictly designating it as robust or fragile. According to him, if at least 95
percent of the cumulative distribution (density) function for [
m
lies on either side of zero,
one should be able to interpret confidently that the variable is robust. Based on this
approach, more variables are found to be strongly related to growth in his study when
compared to the ones claimed in Levine and Renelts EBA.
Another earlier variant of the EBA is forwarded by Granger and Uhlig (1990) in a study of the
velocity of money. They propose a more reasonable test by seeking to eliminate
models/regressions that have poor goodness of fit (R
2
) from the calculation of the extreme
bounds. The models with low R
2
are considered irrelevant and therefore their removal
reduces the difference between the upper and lower extreme bound. The narrower bounds
would mean it is less likely that the two would be of different signs and thus improving the
chances of a variable being classified as robust. Other recent applications of EBA or its
variants include studies by Baxter and Kouparitsas (2005) and Nguyen (2007) on the
determinants of business cycle comovement and synchronization.
56
5.3 Extreme Bounds Analysis for Population Health
In this thesis, the extreme bounds analysis (EBA) is applied to a linear regression that is
developed to explain population health. The EBA regression framework for health
determinants takes the following form:
(S) E
= o
0
+ o
]
I
]
n
]=1
+ [m
+ y
]
Z
]
p
]=1
+ u
where E represents the type of health indicator (life expectancy, infant mortality or under-5
mortality), I is a vector of the free variables, as identified by past studies, that are always
included in the regression, m is the single variable of interest for which its robustness is
being tested, and Z is a vector of the potentially important explanatory variables for
population health.
The I-variables consist of four variables which are the log of gross domestic product per
capita (GDP
1
), female education (EDU
1
), access to safe water (WAT
1
) and the African dummy
variable (AFR). Most empirical studies investigating population health report these variables
in their regressions as potential health determinants or as control variables. On the other
hand, there are five variables considered as candidates for the Z-variables and they are
number of physicians (PHY
1
), the log of health expenditure per capita (EXP
1
), Gini index
(GIN
1
), urbanization (URB
1
) and governance quality (GOV
1
). As discussed in Chapter 4, these
five variables have been studied in the literature and were often noted for their important
association with population health although they do not always appear simultaneously in
any given health regression.
The EBA will test nine variables of interest (m). Out of these nine, four are drawn from the
set of I-variables and five from the set of Z-variables. In other words, all of the variables will
be tested under the procedure. In the case where the variable of interest (m) is also one of
the I-variables, this variable will simply be regarded as m and therefore the number of I-
57
variables will be reduced by one in the regression equation. For example, if the variable of
interest is GDP
1
, the I-variables are only EDU
1,
WAT
1
and AFR. Likewise, if the variable of
interest is EDU
1
, the I-variables will then be only GDP
1,
WAT
1
and AFR. In the case where the
variable of interest (m) is also one of the Z-variables, this variable will also simply be
regarded as m therefore limiting the maximum number of Z-variables possible in the
regression equation to four. For example, when the variable of interest is PHY
1
, the set of Z-
variables will only consist of EXP
1
, GIN
1
, URB
1
and GOV
1
22
. Figure 5.1 shows a schematic
representation of the EBA criterion for a robust or fragile variable.
The upper and lower extreme bounds will be calculated for all variables of interest by
estimating regressions with various combinations of the Z -variables. Similar to the
explanation provided in the preceding section, the upper extreme bound here is defined as
the highest value of [ plus two of its standard error ([
hgh
+ 2o
hgh
) while the lower
extreme bound is the lowest value of [ minus two of its standard error ([
Iow
2o
Iow
). The
highest and lowest [s are identified from the values of [ that cannot be rejected at the 5%
significance level. With five Z-variables, the total possible combinations will be 31 (=2
5
1) permutations. In effect, this means there are 31 different regressions for each variable of
interest (m). Following Levine and Renelt (1992), the thesis performs the EBA test for each
health indicator and below are the discussion of the obtained results.
5.3.1 EBA Results for Life Expectancy
Table 5.1 shows EBA results for the free variables (I-variables) as variables of interest when
the dependent variable is life expectancy (LFE
1
). The coefficient estimates for the log of GDP
per capita (GDP
1
) and female education (EDU
1
) are found to be positive and robust. The
positive relationship between life expectancy and the two variables is compatible with the
outcomes of other health regression studies and therefore is within expectations. However,
access to safe water (WAT
1
) is reported as fragile with different signs for its extreme bounds.
22
Levine and Renelt (1992) limit the number of Z-variables in their EBA regression model to three because of
the large pool of variables from which the procedure chooses the Z-variables. In contrast, the EBA in this thesis
has only five variables as the Z-variables and therefore considers every possible combination of them.
58
The variable of interest (m) is said to be ROBUST if both the upper and lower extreme bounds
a) have the same coefficient sign and
b) are significant at the 5% level.
The variable of interest (m) is said to be FRAGILE if the upper and lower extreme bounds
a) have different coefficient sign AND/OR
b) are not significant at the 5% level.
Figure 5.1: A Schematic Representation of the EBA Criterion for Robustness
59
Table 5.1: EBA Results for the Free (I) Variables (Dependent Variable: Life Expectancy, LFE1)
Variable of
interest
(m)
Estimated Coefficient
()
Countries R-squared Z-Variables
Extreme Bound
Value
Robust/Fragile
GDP per capita
(log) (GDP
1
)
High: 2.68 (0.41)*** 88 0.91 GIN
1
Upper: 3.50
Lower: 0.38
Robust Base: 2.65 (0.38)*** 95 0.91
Low: 1.74 (0.68)** 88 0.92 PHY
1
, GOV
1
, URB
1
EXP
1
, GIN
1
Female
education
(EDU
1
)
High: 0.44 (0.17)** 95 0.91 PHY
1
, URB
1
Upper: 0.79
Lower: 0.03
Robust Base: 0.40 (0.15)*** 95 0.91
Low: 0.36 (0.17)** 88 0.92 GOV
1
, URB
1
, EXP
1
, GIN
1
Access to safe
water
(WAT
1
)
High: 0.07 (0.04)* 88 0.91 GOV
1
, GIN
1
Upper: 0.14
Lower: -0.02
Fragile Base: 0.06 (0.04)* 95 0.91
Low: 0.05 (0.03) 95 0.91 GOV
1
, URB
1
, EXP
1
African dummy
variable
(AFR)
High: -10.38 (1.04)*** 95 0.91 GOV
1
, EXP
1
Upper: -8.29
Lower: -13.91
Robust Base: -10.54 (1.01)*** 95 0.91
Low: -11.34 (1.28)*** 88 0.92 PHY
1
, GOV
1
, URB
1
, GIN
1
The base [ is the estimated coefficient of the variable of interest (m) in a regression that only includes the free variables and no Z-variables.
The high [ is the estimated coefficient of the variable of interest (m) from the regression with the upper extreme bound value ([
hgh
+2o
hgh
).
The low [ is the estimated coefficient of the variable of interest (m) from the regression with the lower extreme bound value ([
Iow
2o
Iow
).
Figures in parentheses are standard errors.
Significance at 1%, 5% and 10% level are denoted with ***, **, and * respectively.
The Z-Variables are the other variables of interest in the regression that produce the extreme bounds. PHY is the number of physicians, GOV is the governance
quality, URB is urbanization, EXP is health expenditure per capita and GIN is the Gini index.
The robust/fragile column reports whether the variable of interest (m) is robust or fragile.
60
Table 5.2: EBA Results for the Z-variables (Dependent Variable: Life Expectancy, LFE
1
)
Variable of interest
(m)
Estimated Coefficient
()
Countries R-squared Z-Variables
Extreme Bound
Value
Robust/Fragile
Number of
physicians
(PHY
1
)
High: 0.08 (0.43) 88 0.91 GIN
1
Upper: 0.95
Lower: -1.21
Fragile
a
Base: -0.02 (0.39) 95 0.91
Low: -0.35 (0.43) 95 0.91 URB
1
, EXP
1
Governance quality
(GOV
1
)
High: 0.75 (0.73) 88 0.92 PHY
1
, URB
1
, GIN
1
Upper: 2.21
Lower: -1.28
Fragile Base: 0.20 (0.59) 95 0.91
Low: -0.07 (0.60) 95 0.91 EXP
1
Urbanization
(URB
1
)
High: 0.04 (0.02)* 95 0.91 PHY
1
, GOV
1
Upper: 0.09
Lower: -0.03
Fragile Base: 0.04 (0.02) 95 0.91
Low: 0.02 (0.03) 88 0.92 EXP
1
, GIN
1
Health expenditure
per capita (log)
(EXP
1
)
High: 0.69 (0.35)* 88 0.92 PHY
1
, URB
1
, GIN
1
Upper: 1.40
Lower: -0.22
Fragile Base: 0.60 (0.37) 95 0.91
Low: 0.54 (0.38) 95 0.91 GOV
1
, URB
1
Gini index
(GIN
1
)
High: 0.03 (0.04) 88 0.92 GOV
1
, EXP
1
Upper: 0.10
Lower: -0.07
Fragile
a
Base: 0.02 (0.03) 88 0.91
Low: 0.01 (0.04) 88 0.92 PHY
1
, URB
1
The base [ is the estimated coefficient of the variable of interest (m) in a regression that only includes the free variables and no Z-variables.
The high [ is the estimated coefficient of the variable of interest (m) from the regression with the upper extreme bound value ([
hgh
+2o
hgh
).
The low [ is the estimated coefficient of the variable of interest (m) from the regression with the lower extreme bound value ([
Iow
2o
Iow
).
Figures in parentheses are standard errors.
Significance at 1%, 5% and 10% level are denoted with ***, **, and * respectively.
The robust/fragile column reports whether the variable of interest (m) is robust or fragile.
a
The variable of interest (m)entered the base regression with the incorrect sign.
61
There are two possible explanations for the non-robustness of WAT
1
. First, the variable in
the regression equation could be collinear to another explanatory variable in the model thus
making its respective partial correlation to life expectancy difficult to isolate. Second, there
is not enough independent variation in the variable to explain life expectancy in an equation
that includes GDP per capita and female education. On the other hand, the coefficient
estimate of the African dummy variable (AFR) is robust in various specifications and
possesses a strong negative relationship with life expectancy.
Table 5.2 shows the EBA results for the Z-variables. None of them appear to be robust. As a
matter of fact, several coefficient estimates displayed the incorrect sign upon entering the
base regression and others are reported as insignificant. In particular, urbanization (URB
1
)
seems to have little effect on life expectancy based on its relatively lower coefficient
estimate in many specifications. Additionally, the number of physicians (PHY
1
) and the Gini
index (GIN
1
) failed to produce the correct coefficient sign in their respective base regressions.
5.3.2 EBA Results for Infant Mortality
Table 5.3 presents the EBA results for the free variables (I-variables) when the dependent
variable is infant mortality (IFM
1
). The coefficient estimate for the log of GDP per capita
(GDP
1
) is reported as fragile and the variable does not appear to have a robust partial
correlation with infant mortality. This is because the upper extreme bound and the lower
extreme bound do not share the same sign which in turn implies that the interval between
these two bounds may contain the value of zero. A value of zero in this context will denote
the absence of a relationship between GDP
1
and infant mortality. It must be noted, however,
that the relationship is highly significant at the 1% level and both extreme bounds are of the
same sign in the base regression when there are no Z-variables present.
Unlike in the case of life expectancy (LFE
1
), infant mortality shares a robust relationship with
both female education (EDU
1
) and access to safe water (WAT
1
). EDU
1
and WAT
1
are
consistently significant in all 31 regressions at the 1% level. The strong association gives
62
Table 5.3: EBA Results for the Free (I) Variables (Dependent Variable: Infant Mortality, IFM1)
Variable of
interest
(m)
Estimated Coefficient
()
Countries R-squared Z-Variables
Extreme Bound
Value
Robust/Fragile
GDP per capita
(log) (GDP
1
)
High: -3.73 (2.90) 88 0.88 GOV
1
, EXP
1
, GIN
1
Upper: 2.07
Lower: -11.87
Fragile Base: -6.83 (1.56)*** 95 0.88
Low: -5.85 (3.01)* 95 0.88 GOV
1
, URB
1
, EXP
1
Female
education
(EDU
1
)
High: -2.37 (0.76)*** 95 0.88 GOV
1
, URB
1
, EXP
1
Upper: -0.84
Lower: -4.41
Robust Base: -2.53 (0.74)*** 95 0.88
Low: -2.84 (0.78)*** 95 0.88 PHY
1
Access to safe
water
(WAT
1
)
High: -0.37 (0.13)*** 95 0.89 EXP
1
Upper: -0.10
Lower: -0.70
Robust Base: -0.37 (0.13)*** 95 0.88
Low: -0.43 (0.13)*** 88 0.89 PHY
1
, GOV
1
, URB
1
, GIN
1
African dummy
variable
(AFR)
High: 34.07 (4.52)*** 88 0.88 PHY
1
, GOV
1
, GIN
1
Upper: 43.10
Lower: 20.24
Robust Base: 28.57 (4.09)*** 95 0.88
Low: 28.20 (3.98)*** 95 0.89 URB
1
, EXP
1
The base [ is the estimated coefficient of the variable of interest (m) in a regression that only includes the free variables and no Z-variables.
The high [ is the estimated coefficient of the variable of interest (m) from the regression with the upper extreme bound value ([
hgh
+2o
hgh
).
The low [ is the estimated coefficient of the variable of interest (m) from the regression with the lower extreme bound value ([
Iow
2o
Iow
).
Figures in parentheses are standard errors.
Significance at 1%, 5% and 10% level are denoted with ***, **, and * respectively.
The Z-Variables are the other variables of interest in the regression that produce the extreme bounds. PHY is the number of physicians, GOV is the governance
quality, URB is urbanization, EXP is health expenditure per capita and GIN is the Gini index.
The robust/fragile column reports whether the variable of interest (m) is robust or fragile.
63
Table 5.4: EBA Results for the Z-variables (Dependent Variable Infant Mortality, IFM
1
)
Variable of interest
(m)
Estimated Coefficient
()
Countries R-squared Z-Variables
Extreme Bound
Value
Robust/Fragile
Number of
physicians
(PHY
1
)
High: 2.86 (1.79) 95 0.88 GOV
1
, EXP
1
Upper: 6.44
Lower: -3.94
Fragile
a
Base: 2.24 (1.64) 95 0.88
Low: -0.67 (1.64) 88 0.88 URB
1
, GIN
1
Governance quality
(GOV
1
)
High: -2.42 (2.59) 95 0.88 URB
1
, EXP
1
Upper: 2.76
Lower: -11.32
Fragile Base: -3.09 (2.53) 95 0.88
Low: -5.28 (3.02)* 88 0.88 PHY
1
, GIN
1
Urbanization
(URB
1
)
High: 0.11 (0.11) 88 0.88 PHY
1
, EXP
1
, GIN
1
Upper: 0.33
Lower: -0.17
Fragile
a
Base: 0.08 (0.10) 95 0.88
Low: 0.03 (0.10) 95 0.88 PHY
1
, GOV
1
Health expenditure
per capita
(EXP
1
)
High: -0.53 (3.05) 95 0.88 GOV
1
Upper: 5.57
Lower: -7.48
Fragile Base: -1.25 (3.08) 95 0.88
Low: -1.97 (2.76) 88 0.88 URB
1
, GIN
1
Gini index
(GIN
1
)
High: -0.29 (0.14)** 88 0.88 PHY
1
Upper: -0.01
Lower: -0.71
Robust
a
Base: -0.30 (0.13)** 88 0.88
Low: -0.39 (0.16)** 88 0.89 PHY
1
, GOV
1
, URB
1
, EXP
1
The base [ is the estimated coefficient of the variable of interest (m) in a regression that only includes the free variables and no Z-variables.
The high [ is the estimated coefficient of the variable of interest (m) from the regression with the upper extreme bound value ([
hgh
+2o
hgh
).
The low [ is the estimated coefficient of the variable of interest (m) from the regression with the lower extreme bound value ([
Iow
2o
Iow
).
Figures in parentheses are standard errors.
Significance at 1%, 5% and 10% level are denoted with ***, **, and * respectively.
The robust/fragile column reports whether the variable of interest (m) is robust or fragile.
a
The variable of interest (m)entered the base regression with the incorrect sign.
64
support to the empirical evidence found in the literature regarding the importance of
maternal education and the availability of water and nutrition to growing infants. Not
surprisingly, the African dummy variable (AFR) is also robust under the EBA test and adds
further evidence of the continents distinct and poor state of population health. On average,
over 80% of the cross-section variation in infant mortality is explained by the free variables.
Table 5.4 reports the EBA results for the Z-variables. All of the variables except the Gini
index (GIN
1
) are fragile. The coefficient estimates for both number of physicians (PHY
1
) and
urbanization (URB
1
) possess the incorrect sign in their respective base regressions. The non-
robustness of URB
1
and PHY
1
are plausibly related to their effect already being captured by
the variable GDP per capita. This is because it is arguable that countries with higher income
per capita tend to have more urbanized areas and a higher number of medical practitioners.
An interesting observation is the robustness of the variable GIN
1
. This evidence gives support
to the notion of a positive relationship between income inequality and population health as
discussed previously in section 4.4.
5.3.3 EBA Results for Under-5 Mortality
Table 5.5 shows the EBA results for the free variables (I-variables) when the dependent
variable is under-5 mortality (UFM
1
). Almost 90% of the cross-section variation in under-5
mortality is explained by the free variables. All but the log of GDP per capita (GDP
1
) report a
robust partial correlation with the health indicator. Neither the upper nor lower extreme
bound value for GDP
1
are statistically significant at the 5% level and in addition they have
different sign thus rendering the variables relationship with under-5 mortality fragile.
Female education (EDU
1
) and access to safe water (WAT
1
) are both significant at the 1% level
in all 31 regressions estimated and their extreme bounds possess the same sign hence
making them robust under the EBA. This observation again confirms that EDU
1
and WAT
1
are
strongly associated with the health conditions of infants and young children and implies the
important role played by mothers in the development of their offspring. On the other hand,
65
Table 5.5: EBA Results for the Free (I) Variables (Dependent Variable: Under-5 Mortality, UFM1)
Variable of
interest
(m)
Estimated Coefficient
()
Countries R-squared Z-Variables
Extreme Bound
Value
Robust/Fragile
GDP per capita
(log) (GDP
1
)
High: -5.67 (4.45) 88 0.89 GOV
1
, EXP
1
, GIN
1
Upper: 3.24
Lower: -17.58
Fragile Base: -7.35 (2.51)*** 95 0.88
Low: -8.42 (4.58)* 88 0.89 GOV
1
, URB
1
, EXP
1
Female
education
(EDU
1
)
High: -4.05 (1.02)*** 88 0.89 GOV
1
, EXP
1
, GIN
1
Upper: -2.01
Lower: -7.42
Robust Base: -4.17 (1.02)*** 95 0.88
Low: -5.06 (1.18)*** 95 0.89 PHY
1
, GOV
1
Access to safe
water
(WAT
1
)
High: -0.80 (0.25)*** 95 0.88 GOV
1
, EXP
1
Upper: -0.29
Lower: -1.34
Robust Base: -0.80 (0.25)*** 95 0.88
Low: -0.87 (0.24)*** 88 0.89 GOV
1
, URB
1
, GIN
1
African dummy
variable
(AFR)
High: 64.45 (7.86)*** 88 0.89 PHY
1
, GOV
1
, GIN
1
Upper: 80.17
Lower: 40.10
Robust Base: 55.01 (6.98)*** 95 0.88
Low: 53.93 (6.91)*** 95 0.88 GOV
1
, URB
1
, EXP
1
The base [ is the estimated coefficient of the variable of interest (m) in a regression that only includes the free variables and no Z-variables.
The high [ is the estimated coefficient of the variable of interest (m) from the regression with the upper extreme bound value ([
hgh
+2o
hgh
).
The low [ is the estimated coefficient of the variable of interest (m) from the regression with the lower extreme bound value ([
Iow
2o
Iow
).
Figures in parentheses are standard errors.
Significance at 1%, 5% and 10% level are denoted with ***, **, and * respectively.
The Z-Variables are the other variables of interest in the regression that produce the extreme bounds. PHY is the number of physicians, GOV is the governance
quality, URB is urbanization, EXP is health expenditure per capita and GIN is the Gini index.
The robust/fragile column reports whether the variable of interest (m) is robust or fragile.
66
Table 5.6: EBA Results for the Z-variables (Dependent Variable Under-5 Mortality, UFM
1
)
Variable of interest
(m)
Estimated Coefficient
()
Countries R-squared Z-Variables
Extreme Bound
Value
Robust/Fragile
Number of
physicians
(PHY
1
)
High: 3.40 (2.91) 88 0.89 GOV
1
, EXP
1
, GIN
1
Upper: 9.22
Lower: -3.48
Fragile
a
Base: 6.46 (2.61)** 95 0.89
Low: 2.46 (2.97) 88 0.89 URB
1
, GIN
1
Governance quality
(GOV
1
)
High: 2.60 (4.29) 95 0.88 URB
1
, EXP
1
Upper: 11.17
Lower: -12.50
Fragile
a
Base: 1.06 (4.05) 95 0.88
Low: -2.33 (5.08) 88 0.89 PHY
1
, GIN
1
Urbanization
(URB
1
)
High: 0.14 (0.15) 95 0.88 GOV
1
, EXP
1
Upper: 0.45
Lower: -0.25
Fragile
a
Base: 0.12 (0.15) 95 0.88
Low: 0.04 (0.15) 95 0.89 PHY
1
, EXP
1
Health expenditure
per capita
(EXP
1
)
High: -1.13 (4.00) 95 0.88 URB
1
Upper: 6.87
Lower: -12.65
Fragile Base: -1.08 (4.07) 95 0.90
Low: -2.79 (4.93) 95 0.89 PHY
1
, GOV
1
Gini index
(GIN
1
)
High: -0.43 (0.24)* 88 0.89 PHY
1
Upper: 0.05
Lower: -1.09
Fragile
a
Base: -0.50 (0.22)** 88 0.89
Low: -0.49 (0.30) 88 0.89 PHY
1
GOV
1
, URB
1
, EXP
1
The base [ is the estimated coefficient of the variable of interest (m) in a regression that only includes the free variables and no Z-variables.
The high [ is the estimated coefficient of the variable of interest (m) from the regression with the upper extreme bound value ([
hgh
+2o
hgh
).
The low [ is the estimated coefficient of the variable of interest (m) from the regression with the lower extreme bound value ([
Iow
2o
Iow
).
Figures in parentheses are standard errors.
Significance at 1%, 5% and 10% level are denoted with ***, **, and * respectively.
The robust/fragile column reports whether the variable of interest (m) is robust or fragile.
a
The variable of interest (m)entered the base regression with the incorrect sign.
67
the partial correlation between under-5 mortality and the African dummy variable (AFR)
is robust as evident by the significance of the latter in all specifications.
Table 5.6 presents the EBA results for the Z-variables. Since infant mortality and under-5
mortality are closely linked in terms of the health aspect they measure, most of their EBA
results are identical. Not surprisingly, none of the variables of interest recorded a robust
partial correlation with under-5 mortality. The fragility of these relationships is even
more striking when one considers that four out of the five Z-variables failed to produce
the theoretically predicted sign in their respective base regressions.
On average, the R-squared value remains approximately the same before and after
including the Z-variables. The inclusion of the variables number of physicians (PHY
1
),
governance quality (GOV
1
), urbanization (URB
1
), health expenditure per capita (EXP
1
),
and Gini index (GIN
1
) did not add any significant explanatory power to the model.
5.4 A Reasonable Extreme Bounds Analysis
It has been debated in the literature that the extreme bounds analysis (EBA) proposed by
Leamer (1983, 1985) and Levine and Renelt (1992) uses very restrictive criterion to label
a variable robust. Granger and Uhlig (1990) suggest a more reasonable variant of the EBA
by relaxing the criterion through the elimination of models with poor goodness of fit as
measured by the R
2
. Consequently, these models will be irrelevant and excluded from the
calculations of the upper and lower extreme bounds. The condition for a model under
this procedure to be included in the EBA calculation is presented in equation (4)
(4) R
0
2
|(1 0)R
mux
2
+ 0R
mn
2
]
where 0 < 0 < 1, such that if 0=0 then the extreme bounds will be derived from the
model with the highest value of R
2
(R
mux
2
) and if 0=1 then the extreme bounds will be
determined from all models including the one with the lowest value of R
2
(R
mn
2
).
Effectively, any other value of 0 denotes that the extreme bounds are calculated from
the models with R
2
in the top 0 percent of the (R
mux
2
0R
mn
2
) range.
68
Since the EBA results have not been encouraging so far, the thesis performs the
reasonable extreme bounds analysis (REBA) on equation (3) for all three health indicators
to examine whether the non-robustness of the Z-variables are due to the stringent
criterion used in the original EBA. Different arbitrary values of 0 are used to identify
models with R
2
in the top 75, 50, 10 and 5 percent of the (R
mux
2
0R
mn
2
) range.
Accordingly, in the context of this investigation, 0 takes the value of 0.75, 0.5, 0.1 and
0.05. In addition, the estimation where 0 equals to zero is also included to examine the
robustness of the variable of interest (m) based on the model with the highest R
2
.
The REBA test reports no changes in the original results obtained earlier (refer to Table
B1 to B3 in the Appendix). Table 5.7 presents a summary of both the original EBA and
REBA results.
Table 5.7: Summary of the Original EBA and REBA Results
Variable of interest
(m)
Dependent variable
Life expectancy
(LFE
1
)
Infant mortality
(IFM
1
)
Under-5 mortality
(UFM
1
)
Original
EBA
REBA
Original
EBA
REBA
Original
EBA
REBA
GDP per capita
(GDP
1
)
Robust Robust Fragile Fragile Fragile Fragile
Female education
(EDU
1
)
Robust Robust Robust Robust Robust Robust
Access to safe water
(WAT
1
)
Fragile Fragile Robust Robust Robust Robust
African dummy variable
(AFR)
Robust Robust Robust Robust Robust Robust
Number of physicians
(PHY
1
)
Fragile Fragile Fragile Fragile Fragile Fragile
Governance quality
(GOV
1
)
Fragile Fragile Fragile Fragile Fragile Fragile
Urbanization
(URB
1
)
Fragile Fragile Fragile Fragile Fragile Fragile
69
Health expenditure per
capita (EXP
1
)
Fragile Fragile Fragile Fragile Fragile Fragile
Gini index
(GIN
1
)
Fragile Fragile Robust Robust Fragile Fragile
5.5 A Further Sensitivity Test
In addition to the above, a test is also performed using the lag interval data previously
discussed in section 4.3. Generally, this data represents the past values of the potential
population health determinants. Equation (3) is estimated again for each (current value)
health indicator by using past values of the explanatory variables as regressors. For
instance, the EBA test for life expectancy (LFE
1
) is repeated but now with GDP
0
, EDU
0
and
other lag variables on the right hand side of the equation. The objective of this exercise is
to investigate the robustness of the statistical relationship between a lag potential
determinant and population health.
Several explanatory variables, however, have been excluded from the equation due to
unavailability of data. They are access to safe water and health expenditure per capita.
The exclusion of these two variables means they are no longer examinable as variable of
interest. Although there are no past data available for governance quality, it is still
included in the equation as a lag variable because it can be argued that the quality of
institutions pertaining to governance is most likely to be relatively stable over time in the
sample period of 48 years (1960 to 2008).
In this scenario, the I-variables are GDP per capita (GDP
0
), female education (EDU
0
) and
the African dummy variable (AFR) while the Z-variables are the number of physicians
(PHY
0
), governance quality (GOV
1
), urbanization (URB
0
) and Gini index (GIN
0
). In order to
economize space, the full EBA results is not shown here (refer to Tables C1 to C3 in the
Appendix for complete results). A summary of the robustness of potential population
health determinants across the three health indicators is presented in Table 5.8. In the
life expectancy and infant mortality models, all of the variables of interest are not robust
except AFR and GOV
1
. Likewise, there are no robust variables in the under-5 mortality
70
model except AFR. This rather negative result seems to suggest that the associative
relationship between a lag potential determinant and population health is largely fragile.
Table 5.8: Summary of EBA Results for the Lag Variables of Interest
Dependent variable
Variable of interest (m)
Life expectancy
(LFE
1
)
Infant mortality
(IFM
1
)
Under-5 mortality
(UFM
1
)
GDP per capita (log) (GDP
0
) Fragile Fragile Fragile
Female education (EDU
0
) Fragile Fragile Fragile
African dummy variable (AFR) Robust Robust Robust
Number of physicians (PHY
0
) Fragile
a
Fragile
a
Fragile
a
Governance quality (GOV
1
) Robust Robust Fragile
Urbanization (URB
0
) Fragile Fragile
a
Fragile
a
Gini index (GIN
0
) Fragile
a
Fragile
a
Fragile
a
a
The variable of interest (m) entered the base regression with the incorrect sign.
5.6 Summary
The extreme bounds analysis (EBA) has been developed as a procedure to examine the
sensitivity of empirical conclusions to changes in the model specification. EBA functions
by testing the statistical significance of the variable of interest in regressions with
different combinations of explanatory variables. Even though the procedure is often
criticized for its stringent results and biased estimations due to multicollinearity, there
were attempts made to reduce these problems by relaxing the criterion required to
classify a partial correlation between the dependent variable and the variable of interest
as robust.
Accordingly, the EBA test was then performed on the basic health regression model for
three different health indicators. Only the log of GDP per capita and the African dummy
variable are robust with respect to their partial correlation with life expectancy. On the
other hand, the EBA test for both infant mortality and under-5 mortality returned fairly
similar results with only female education, access to safe water and the African dummy
71
variable being robust. The variable Gini index (representing income inequality) is found to
be robust only when the dependent variable is infant mortality.
The test criterion is further relaxed with the application of the reasonable extreme
bounds analysis (REBA) but the main conclusion remains unchanged. In addition, the EBA
test for the lag variables of interest reported mostly fragile results. The overall
implication of these tests seem to suggest that life expectancy, infant mortality and
under-5 mortality are explained by a very small set of key variables such as income per
capita, female education, access to safe water and the African dummy variable. However,
it must be stressed again that the overall objective here is to evaluate the statistical
significance of potential population health determinants across different specifications.
The EBA carried out above is a test for robustness and does not seek to establish
evidence of causal effects.
72
Chapter 6: Conclusion
The thesis attempts to examine the robustness of findings regarding population health
determinants in the literature. The theoretical literature reveals three main themes in
population health which are namely nutrition, income and inequality. Nutrition and food
supply played an important part in improving life expectancy during the nineteenth
century by increasing the nutritional intake of households and reducing hunger and
susceptibility to diseases. Of particular interest is the relationship between income and
nutrition as explained in the nutritional/efficiency wage model. The role of income then
became more prominent in the twentieth century and is explored in many studies
including the seminal work of Preston (1975) who reports the cross-country positive
relationship between life expectancy and income per capita and subsequently
popularized the income hypothesis. Contemporary health studies introduce the issue of
income inequality in the determination of health and argue that a less egalitarian society
is more likely to have poor population health. Nevertheless, this argument has also
drawn many criticisms as evident in several studies disapproving the relationship
between income inequality and several health outcomes.
The empirical literature provides further evidence of population health determinants.
Fairly popular determinants such as income per capita, female education and access to
safe water have often been cited in health regression studies as explanatory variables or
control variables. Most of the variables related to population health can be grouped into
areas of economic development, human capital, health resources and governance.
Economic development variables such as income per capita and income inequality are
found to be significantly related to various population health measures in quite a number
of studies. In addition, human capital in the form of education is also positively
correlated with population health. Female education, in particular, is claimed to have a
strong effect in lowering mortality rates of infants and children. Advocates of the
education hypothesis further suggest that education has a positive effect on health which
is independent of income.
73
Health resources variables yield potential health determinants in access to safe water,
the number of physicians and health expenditure. The availability of health-related
personnel such as doctors, nurses and midwives enables households to receive prompt
and proper health care and treatment. Access to safe water or clean water supply is
necessary to reduce the virulence of diseases by improving sanitation processes while
health expenditure seeks to make health care services more accessible to the public.
Additionally, the quality of governance as a potential population health determinant is
proposed in several studies investigating the relationship between democracy and health.
They argue that elements of bureaucracy and corruption in less democratic countries
tend to hinder the provision of public goods such as health care and this in turn will have
a detrimental effect on population health.
In view of the many existing variables claiming to be population health determinants,
there exists a need to examine their robustness across different model specifications.
Robustness is important because it determines the reliability of empirical results
obtained. The scarcity of rigorous sensitivity analyses in the empirical health literature
provides further motivation for this research. The thesis employs an empirical technique
called the extreme bounds analysis (EBA) to examine whether these variables are
statistically robust. Life expectancy, infant mortality and under-5 mortality are used to
represent population health in a basic health regression model. Preliminary estimations
report a good fit between the data and the basic model using the explanatory variables
income per capita, female education, access to safe water and the African dummy
variable. Further diagnostic tests confirm that these variables should be included in the
EBA as the always included or free variables.
The extreme bounds analysis reveals very few robust or consistently significant
variables that are associated with population health. For example, only income per capita
and the African dummy variable are classified as robust under the EBA in a model where
the dependent variable is life expectancy. Likewise, in the infant mortality and under-5
mortality models, only female education, access to safe water and the African dummy
variable are robust while the rest of the explanatory variables are reported fragile. One
interesting exception is the Gini index (a proxy for income inequality) which is robust only
74
when the dependent variable is infant mortality. This research provided further
evidential support for the findings of other studies that claim these robust variables as
population health determinants.
In hindsight, however, one should not completely dismiss the importance of the fragile
variables reported by the EBA. The EBA is not a test of causality but rather a test of
robustness. A variable may still be a causal factor even though it is reported fragile under
the procedure. Nevertheless, the overall implication that can be gathered from this
research is that population health can be explained by a small set of key variables. In
addition, the significance of the African dummy variable implies that further efforts
should be made to assist sub-Saharan African countries to improve their population
health. Further research is possible in exploring the issue of causality and establishing
evidence of causal effects for the variables discussed above.
While it is acknowledged that the extreme bounds analysis used in this thesis has some
shortcomings, it remains a relatively good procedure to use in the context of population
health because it eliminates the need for arbitrary selection of results to report.
Essentially, the results presented here are impartial and are not dependent upon the
empirical outcomes obtained regardless whether they are favourable or not. In a way,
the EBA encourages the adoption of an unbiased approach towards investigating
population health determinants that are relevant in the literature. Therefore, this
research should be considered as a complement to other health determinant studies
which are often preconceived and ideologically driven.
75
References
ADELMAN, I. 1963. An econometric analysis of population growth. American Economic
Review, 53(3), 314-339.
ANAND, S. & BRNIGHAUSEN, T. 2004. Human resources and health outcomes: cross-
country econometric study. The Lancet, 364(9445), 1603-1609.
ANAND, S. & RAVALLION, M. 1993. Human development in poor countries: on the role of
private incomes and public services. Journal of Economic Perspectives, 7(1), 133-
150.
ARROW, K. J. 1963. Social choice and individual values. 2nd ed., New York: Wiley.
BARRO, R. J. & LEE, J.-W. 2010. A new data set of educational attainment. NBER Working
Paper No. 15902. National Bureau of Economic Research, Inc.
BAXTER, M. & KOUPARITSAS, M. A. 2005. Determinants of business cycle comovement: a
robust analysis. Journal of Monetary Economics, 52(1), 113-157.
BESLEY, T. & KUDAMATSU, M. 2006. Health and democracy. American Economic Review,
96(2), 313.
BLISS, C. & STERN, N. 1978a. Productivity, wages and nutrition: part I: the theory. Journal
of Development Economics, 5(4), 331-362.
BLISS, C. & STERN, N. 1978b. Productivity, wages and nutrition: part II: some observations.
Journal of Development Economics, 5(4), 363-398.
CAIN, L. P. & ROTELLA, E. J. 2001. Death and spending: urban mortality and municipal
expenditure on sanitation. Annales de Demographie Historique, 1139-154.
CALDWELL, J. C. 1979. Education as a factor in mortality decline an examination of
Nigerian data. Population Studies, 33(3), 395-413.
CALDWELL, J. C. 1986. Routes to low mortality in poor countries. Population and
Development Review, 12(2), 171-220.
CALDWELL, J. C. 1990. Cultural and social factors influencing mortality levels in
developing countries. Annals of the American Academy of Political and Social
Science, 510(July 1990), 44-59.
CHRISTENSON, B. A. & JOHNSON, N. E. 1995. Educational inequality in adult mortality: an
assessment with death certificate data from Michigan. Demography, 32(2), 215-
229.
CRMIEUX, P. Y., OUELLETTE, P. & PILON, C. 1999. Health care spending as determinants
of health outcomes. Health Economics, 8(7), 627-639.
CUTLER, D. & LLERAS-MUNEY, A. 2006. Education and health: evaluating theories and
evidence. NBER Working Paper No. 12352. National Bureau of Economic Research,
Inc.
CUTLER, D. & MILLER, G. 2005. The role of public health improvements in health
advances: the twentieth-century United States. Demography, 42(1), 1-22.
76
DEATON, A. 2003. Health, inequality, and economic development. Journal of Economic
Literature, 41(1), 113-158.
DEATON, A. 2009. Aging, religion, and health. NBER Working Paper No. 15271. National
Bureau of Economic Research, Inc.
DEATON, A., JACK, W. & BURTLESS, G. 2004. Health in an age of globalization. Brookings
Trade Forum, 83-130.
DEATON, A. & PAXSON, C. 1999. Mortality, education, income, and inequality among
American cohorts. NBER Working Paper No. 7140. National Bureau of Economic
Research, Inc.
DULEEP, H. O. 1995. Mortality and income inequality among economically developed
countries. Social Security Bulletin, 58(2), 34-50.
EASTERLIN, R. A. 1999. How beneficent is the market? A look at the modern history of
mortality. European Review of Economic History, 3(03), 257-294.
ELO, I. T. & PRESTON, S. H. 1996. Educational dfferentials in mortality: United States,
1979-1985. Social Science & Medicine, 42(1), 47-57.
FARAHANI, M., SUBRAMANIAN, S. V. & CANNING, D. 2009. Short and long-term
relationship between physician density on infant mortality: a longitudinal
econometric analysis. PGDA Working Papers No. 49. Program on the Global
Demography of Aging.
FILMER, D. & PRITCHETT, L. 1999. The impact of public spending on health: does money
matter? Social Science & Medicine, 49(10), 1309-1323.
FLEGG, A. T. 1982. Inequality of income, illiteracy and medical care as determinants of
infant mortality in underdeveloped countries. Population Studies, 36(3), 441-458.
FOGEL, R. W. 1986. Nutrition and the decline in mortality since 1700: some additional
preliminary findings. NBER Working Paper No. 1802. National Bureau of Economic
Research, Inc.
FOGEL, R. W. 1994. Economic growth, population theory, and physiology: the bearing of
long-term processes on the making of economic policy. American Economic
Review, 84(3), 369-395.
FOGEL, R. W., MARK, R. R. & ODED, S. 1997. New findings on secular trends in nutrition
and mortality: some implications for population theory. Handbook of Population
and Family Economics. Elsevier.
FRANCO, ., LVAREZ-DARDET, C. & RUIZ, M. T. 2004. Effect of democracy on health:
ecological study. British Medical Journal, 329(7480), 1421-1423.
GOVINDARAJ, R. & RANNAN-ELIYA, R. 1994. Democracy, communism and health status: a
cross-national study. Data for Decision Making Project Working Papers No. 7.2.
Harvard University.
GRANGER, C. W. J. & UHLIG, H. F. 1990. Reasonable extreme-bounds analysis. Journal of
Econometrics, 44(1-2), 159-170.
GROSSMAN, M. 1972. On the concept of health capital and the demand for health.
Journal of Political Economy, 80(2), 223-255.
77
HARPHAM, T. 1994. Urbanization and mental health in developing countries: a research
role for social scientists, public health professionals and social psychiatrists. Social
Science & Medicine, 39(2), 233-245.
HILL, R. C., GRIFFITHS, W. E. & LIM, G. C. 2008. Principles of econometrics. 3rd ed.,
Hoboken, NJ: Wiley.
HOBCRAFT, J. N., MCDONALD, J. W. & RUTSTEIN, S. O. 1984. Socio-economic factors in
infant and child mortality: a cross-national comparison. Population Studies, 38(2),
193-223.
JUDGE, K., MULLIGAN, J.-A. & BENZEVAL, M. 1998. Income inequality and population
health. Social Science & Medicine, 46(4-5), 567-579.
KAKWANI, N. 1993. Performance in living standards: an international comparison. Journal
of Development Economics, 41(2), 307-336.
KAPLAN, G. A., PAMUK, E. R., LYNCH, J. W., COHEN, R. D. & BALFOUR, J. L. 1996.
Inequality in income and mortality in the United States: analysis of mortality and
potential pathways. British Medical Journal, 312(7037), 999-1003.
KENKEL, D. S. 1991. Health behavior, health knowledge, and schooling. Journal of Political
Economy, 99(2), 287-305.
KENNEDY, B. P., KAWACHI, I. & PROTHROW-STITH, D. 1996. Income distribution and
mortality: cross sectional ecological study of the Robin Hood index in the United
States. British Medical Journal, 312(7037), 1004-1007.
KLOMP, J. & DE HAAN, J. 2009. Is the political system really related to health? Social
Science & Medicine, 69(1), 36-46.
KNOWLES, S. & OWEN, P. D. 2008. Which institutions are good for your health? The deep
determinants of ccomparative cross-country health status. University of Otago
Economics Discussion Papers No. 0811. University of Otago.
LEAMER, E. 1978. Specification searches: ad hoc inference with nonexperimental data.
New York: Wiley.
LEAMER, E. & LEONARD, H. 1983. Reporting the fragility of regression estimates. The
Review of Economics and Statistics, 65(2), 306-317.
LEAMER, E. E. 1983. Let's take the con out of econometrics. American Economic Review,
73(1), 31-43.
LEAMER, E. E. 1985. Sensitivity analyses would help. American Economic Review, 75(3),
308-313.
LEIBENSTEIN, H. 1957. Economic backwardness and economic growth: studies in the
theory of economic development. New York: Wiley.
LEVINE, R. & RENELT, D. 1992. A sensitivity analysis of cross-country growth regressions.
The American Economic Review, 82(4), 942-963.
LEVKOFF, S. E., MACARTHUR, I. W. & BUCKNALL, J. 1995. Elderly mental health in the
developing world. Social Science & Medicine, 41(7), 983-1003.
78
LLERAS-MUNEY, A. 2005. The relationship between education and adult mortality in the
United States. Review of Economic Studies, 72(1), 189-221.
LYNCH, J., SMITH, G. D., HARPER, S., HILLEMEIER, M., ROSS, N., KAPLAN, G. A. &
WOLFSON, M. 2004. Is income inequality a determinant of population health?
Part 1. A systematic review. The Milbank Quarterly, 82(1), 5-99.
MACINKO, J. A., SHI, L. & STARFIELD, B. 2004. Wage inequality, the health system, and
infant mortality in wealthy industrialized countries, 1970-1996. Social Science &
Medicine, 58(2), 279-292.
MARMOT, M. 2002. The influence of income on health: views of an epidemiologist.
Health Affairs, 21(2), 31-46.
MARTIN, S., RICE, N. & SMITH, P. C. 2008. Does health care spending improve health
outcomes? Evidence from English Programme Budgeting data. Journal of Health
Economics, 27(4), 826-842.
MCALEER, M., PAGAN, A. R. & VOLKER, P. A. 1985. What will take the con out of
econometrics? American Economic Review, 75(3), 293-307.
MCGUIRE, J. W. 2006. Basic health care provision and under-5 mortality: a cross-national
study of developing countries. World Development, 34(3), 405-425.
MCISAAC, S. J. & WILKINSON, R. G. 1997. Income distribution and cause-specific mortality.
European Journal of Public Health, 7(1), 45-53.
MCKEOWN, T. & BROWN, R. G. 1955. Medical evidence related to English population
changes in the eighteenth century. Population Studies, 9(2), 119-141.
MCKEOWN, T., BROWN, R. G. & RECORD, R. G. 1972. An interpretation of the modern
rise of population in Europe. Population Studies, 26(3), 345-382.
MCKEOWN, T. & RECORD, R. G. 1962. Reasons for the decline of mortality in England and
Wales during the nineteenth century. Population Studies, 16(2), 94-122.
MELLOR, J. M. & MILYO, J. 2001. Reexamining the evidence of an ecological association
between income inequality and health. Journal of Health Politics, Policy and Law,
26(3), 487-522.
MOOSA, I. A. & CARDAK, B. A. 2006. The determinants of foreign direct investment: an
extreme bounds analysis. Journal of Multinational Financial Management, 16(2),
199-211.
MULLER, A. 2002. Education, income inequality, and mortality: a multiple regression
analysis. British Medical Journal, 324(7328), 23.
NGUYEN, T. 2007. Determinants of business cycle synchronization in East Asia: an
extreme bound analysis. Working Papers No. 14. Development and Policies
Research Center (DEPOCEN), Vietnam.
NIXON, J. & ULMANN, P. 2006. The relationship between health care expenditure and
health outcomes. The European Journal of Health Economics, 7(1), 7-18.
NORTH, D. C. 1990. Institutions, institutional change, and economic performance.
Melbourne: Cambridge University Press.
79
OMRAN, A. R. 1971. The epidemiologic transition: a theory of the epidemiology of
population change. The Milbank Memorial Fund Quarterly, 49(4), 509-538.
OR, Z., WANG, J. & JAMISON, D. 2005. International differences in the impact of doctors
on health: a multilevel analysis of OECD countries. Journal of Health Economics,
24(3), 531-560.
PAMPEL, F. C., JR. & PILLAI, V. K. 1986. Patterns and determinants of infant mortality in
developed nations, 1950-1975. Demography, 23(4), 525-542.
PRESTON, S. H. 1975. The changing relation between mortality and level of economic
development. Population Studies, 29(2), 231-248.
PRESTON, S. H. 1977. Mortality trends. Annual Review of Sociology, 3(1997), 163-178.
PRITCHETT, L. & SUMMERS, L. H. 1996. Wealthier is healthier. The Journal of Human
Resources, 31(4), 841-868.
RAM, R. 2006. Further examination of the cross-country association between income
inequality and population health. Social Science & Medicine, 62(3), 779-791.
ROBINSON, J. & WHARRAD, H. 2000. Invisible nursing: exploring health outcomes at a
global level. Relationships between infant and under-5 mortality rates and the
distribution of health professionals, GNP per capita, and female literacy. Journal
of Advanced Nursing, 32(1), 28-40.
RODGERS, G. B. 1979. Income and inequality as determinants of mortality: an
international cross-section analysis. Population Studies, 33(2), 343-351.
SALA-I-MARTIN, X. 1997a. I just ran four million regressions. NBER Working Paper No.
6252. National Bureau of Economic Research, Inc.
SALA-I-MARTIN, X. 1997b. I just ran two million regressions. American Economic Review,
87(2), 178-183.
SCHULTZ 2005. Productive benefits of health: evidence from low-income countries.
Economic Growth Center Discussion Paper No. 903. Yale University.
SEN, A. 1984. The living standard. Oxford Economic Papers, 36(1984), Supplement, 74-90.
SEN, A. 1995. Mortality as an indicator of economic success and failure. Innocenti
Lectures. UNICEF Innocenti Research Centre.
SEN, A., MUELLBAUER, J. & HAWTHORN, G. 1987. The standard of living. New York:
Cambridge University Press.
SHANDRA, J. M., NOBLES, J., LONDON, B. & WILLIAMSON, J. B. 2004. Dependency,
democracy, and infant mortality: a quantitative, cross-national analysis of less
developed countries. Social Science & Medicine, 59(2), 321-333.
SMITH, J. P. 1999. Healthy bodies and thick wallets: the dual relation between health and
economic status. Journal of Economic Perspectives, 13(2), 145-166.
STECKEL, R. H. 1995. Stature and the standard of living. Journal of Economic Literature,
33(4), 1903-1940.
STRAUSS, J. & THOMAS, D. 1998. Health, nutrition, and economic development. Journal
of Economic Literature, 36(2), 766-817.
80
SUBBARAO, K. & RANEY, L. 1995. Social gains from female education: a cross-national
study. Economic Development and Cultural Change, 44(1), 105-128.
SUBRAMANIAN, S. V., BELLI, P. & KAWACHI, I. 2002. The macroeconomic determinants of
health. Annual Review of Public Health, 23(1), 287-302.
SZRETER, S. 2002. Rethinking McKeown: the relationship between public health and
social change. American Journal of Public Health, 92(5), 722-725.
THEODOSSIOU, I. & ZANGELIDIS, A. 2009. The social gradient in health: the effect of
absolute income and subjective social status assessment on the individual's
health in Europe. Economics & Human Biology, 7(2), 229-237.
VON DEM KNESEBECK, O., VERDE, P. E. & DRAGANO, N. 2006. Education and health in 22
European countries. Social Science & Medicine, 63(5), 1344-1351.
WALDMANN, R. J. 1992. Income distribution and infant mortality. Quarterly Journal of
Economics, 107(4), 1283-1302.
WENNEMO, I. 1993. Infant mortality, public policy and inequality: a comparison of 18
industrialised countries 1950-85. Sociology of Health & Illness, 15(4), 429-446.
WILKINSON, R. G. 1992. Income distribution and life expectancy. British Medical Journal,
304(6820), 165-168.
WILKINSON, R. G. 1994. The epidemiological transition: from material scarcity to social
disadvantage? Daedalus, 123(4), 61.
WILKINSON, R. G. 1997. Socioeconomic determinants of health: health inequalities:
relative or absolute material standards? British Medical Journal, 314(7080), 591.
WILKINSON, R. G. & PICKETT, K. E. 2006. Income inequality and population health: a
review and explanation of the evidence. Social Science & Medicine, 62(7), 1768-
1784.
WOLFE, B. L. & BEHRMAN, J. R. 1982. Determinants of child mortality, health, and
nutrition in a developing country. Journal of Development Economics, 11(2), 163-
193.
WORLD BANK 1991. Managing development: the governance dimension. Working Paper
No. 34899. The World Bank.
WORLD BANK 2010. World development indicators 2010. Washington D.C.: The World
Bank.
81
Appendix
List of Countries
Albania
Algeria
Argentina
Australia
Austria
Bangladesh
Belize
Benin
Bolivia
Botswana
Brazil
Bulgaria
Burundi
Cameroon
Canada
Central African Republic
Chile
China
Colombia
Congo, Democratic
Republic
Congo, Republic
Costa Rica
Cote dIvoire
Denmark
Dominican Republic
Ecuador
Egypt, Arab Republic
El Salvador
Estonia
Finland
Gabon
Gambia, The
Germany
Ghana
Greece
Guatemala
Guyana
Honduras
Hungary
India
Indonesia
Iran, Islamic Republic
Jamaica
Japan
Jordan
Kenya
Lao PDR
Latvia
Lesotho
Liberia
Malawi
Malaysia
Mali
Mauritania
Mexico
Mongolia
Morocco
Mozambique
Namibia
Nepal
Netherlands
Nicaragua
Niger
Pakistan
Panama
Papua New Guinea
Paraguay
Peru
Philippines
Portugal
Romania
Rwanda
Senegal
Sierra Leone
Singapore
Slovak Republic
South Africa
Spain
Sri Lanka
Swaziland
Sweden
Switzerland
Tajikistan
Thailand
Togo
Trinidad and Tobago
Tunisia
Turkey
Uganda
United States
Uruguay
Venezuela, RB
Vietnam
Zambia
Zimbabwe
82
Table A1: Standardized Beta Coefficients (without AFR)
Dependent variable
Life expectancy
(LFE
1
)
Infant mortality
(IFM
1
)
Under-5 mortality
(UFM
1
)
GDP per capita (log) (GDP
1
) 0.39 -0.29 -0.19
Gini index (GIN
1
) -0.11 -0.00 0.02
Female education (EDU
1
) 0.20 -0.31 -0.30
Access to safe water (WAT
1
) 0.31 -0.37 -0.45
83
Table B1: REBA Results (Dependent Variable: Life Expectancy, LFE
1
)
GDP per capita (log) (GDP
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 3.50 3.50 3.04 3.10 3.10
Lower extreme bound ([
Iow
2o
Iow
) 0.38 0.38 0.38 0.38 0.38
Robust/Fragile Robust Robust Robust Robust Robust
Female education (EDU
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 0.79 0.77 0.76 0.75 0.75
Lower extreme bound ([
Iow
2o
Iow
) 0.03 0.03 0.03 0.03 0.03
Robust/Fragile Robust Robust Robust Robust Robust
Access to safe water (WAT
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 0.14 0.14 0.13 0.13 0.13
Lower extreme bound ([
Iow
2o
Iow
) -0.02 -0.02 -0.01 -0.01 -0.01
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
African dummy variable (AFR) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) -8.47 -8.52 -8.49 -8.55 -8.58
Lower extreme bound ([
Iow
2o
Iow
) -13.91 -13.91 -13.80 -13.80 -13.80
Robust/Fragile Robust Robust Robust Robust Robust
84
Table B1: REBA Results (Dependent Variable: Life Expectancy, LFE
1
) (continued)
Number of physicians (PHY
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 0.95 0.95 0.74 0.73 0.73
Lower extreme bound ([
Iow
2o
Iow
) -1.22 -1.22 -1.18 -1.18 -1.18
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
Governance quality (GOV
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 2.21 2.21 2.00 2.00 2.00
Lower extreme bound ([
Iow
2o
Iow
) -1.08 -1.09 -1.01 -1.01 -0.99
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
Urbanization (URB
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 0.09 0.09 0.08 0.08 0.08
Lower extreme bound ([
Iow
2o
Iow
) -0.03 -0.03 -0.03 -0.03 -0.02
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
Health expenditure per capita (log)
(EXP
1
)
0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 1.40 1.40 1.40 1.32 1.32
Lower extreme bound ([
Iow
2o
Iow
) -0.22 -0.13 -0.13 -0.13 -0.13
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
Gini index (GIN
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 0.10 0.10 0.10 0.10 0.10
Lower extreme bound ([
Iow
2o
Iow
) -0.07 -0.06 -0.06 -0.06 -0.06
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
85
Table B2: REBA Results (Dependent Variable: Infant Mortality, IFM
1
)
GDP per capita (log) (GDP
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 2.07 2.07 1.60 1.60 1.70
Lower extreme bound ([
Iow
2o
Iow
) -11.87 -10.83 -10.73 -10.73 -10.73
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
Female education (EDU
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) -0.84 -0.99 -0.87 -0.87 -0.87
Lower extreme bound ([
Iow
2o
Iow
) -4.37 -4.30 -3.95 -3.95 -4.04
Robust/Fragile Robust Robust Robust Robust Robust
Access to safe water (WAT
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) -0.12 -0.12 -0.16 -0.16 -0.16
Lower extreme bound ([
Iow
2o
Iow
) -0.70 -0.70 -0.70 -0.70 -0.70
Robust/Fragile Robust Robust Robust Robust Robust
African dummy variable (AFR) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 43.10 43.10 42.66 42.66 42.33
Lower extreme bound ([
Iow
2o
Iow
) 21.01 22.31 25.02 25.02 25.02
Robust/Fragile Robust Robust Robust Robust Robust
86
Table B2: REBA Results (Dependent Variable: Infant Mortality, IFM
1
) (continued)
Number of physicians (PHY
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 6.44 3.88 3.24 3.24 3.24
Lower extreme bound ([
Iow
2o
Iow
) -3.94 -3.87 -3.57 -3.57 -3.59
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
Governance quality (GOV
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 1.64 1.28 1.28 1.28 1.28
Lower extreme bound ([
Iow
2o
Iow
) -11.32 -11.32 -11.08 -11.08 -10.34
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
Urbanization (URB
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 0.33 0.33 0.30 0.30 0.30
Lower extreme bound ([
Iow
2o
Iow
) -0.17 -0.13 -0.13 -0.13 -0.13
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
Health expenditure per capita (log)
(EXP
1
)
0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 4.52 4.52 4.31 4.31 4.31
Lower extreme bound ([
Iow
2o
Iow
) -7.48 -7.48 -6.32 -6.32 -6.37
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
Gini index (GIN
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) -0.04 -0.04 -0.07 -0.07 -0.06
Lower extreme bound ([
Iow
2o
Iow
) -0.71 -0.71 -0.71 -0.71 -0.71
Robust/Fragile Robust Robust Robust Robust Robust
87
Table B3: REBA Results (Dependent Variable: Under-5 Mortality, UFM
1
)
GDP per capita (log) (GDP
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 3.24 3.24 3.15 3.15 2.98
Lower extreme bound ([
Iow
2o
Iow
) -14.88 -14.88 -14.88 -15.12 -16.27
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
Female education (EDU
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) -2.01 -2.01 -2.03 -2.02 -2.02
Lower extreme bound ([
Iow
2o
Iow
) -7.41 -7.41 -6.92 -6.93 -6.87
Robust/Fragile Robust Robust Robust Robust Robust
Access to safe water (WAT
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) -0.33 -0.33 -0.35 -0.36 -0.38
Lower extreme bound ([
Iow
2o
Iow
) -1.34 -1.34 -1.34 -1.34 -1.34
Robust/Fragile Robust Robust Robust Robust Robust
African dummy variable (AFR) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 80.17 80.17 80.17 79.62 79.24
Lower extreme bound ([
Iow
2o
Iow
) 44.56 44.56 47.75 48.42 48.30
Robust/Fragile Robust Robust Robust Robust Robust
88
Table B3: REBA Results (Dependent Variable: Under-5 Mortality, UFM
1
)
(continued)
Number of physicians (PHY
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 9.23 9.23 8.76 8.76 8.76
Lower extreme bound ([
Iow
2o
Iow
) -3.48 -3.48 -3.33 -3.33 -3.20
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
Governance quality (GOV
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 8.07 8.07 9.28 8.48 8.48
Lower extreme bound ([
Iow
2o
Iow
) -12.50 -12.50 -12.50 -11.01 -10.78
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
Urbanization (URB
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 0.46 0.46 0.43 0.43 0.43
Lower extreme bound ([
Iow
2o
Iow
) -0.25 -0.25 -0.26 -0.25 -0.25
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
Health expenditure per capita (log)
(EXP
1
)
0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 5.77 5.77 5.39 6.08 5.78
Lower extreme bound ([
Iow
2o
Iow
) -12.65 -12.65 -10.67 -10.67 -10.03
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
Gini index (GIN
1
) 0 = 0.75 0 = 0.50 0 = 0.10 0 = 0.05 0 = 0.00
Upper extreme bound ([
hgh
+2o
hgh
) 0.05 0.05 0.11 0.08 0.11
Lower extreme bound ([
Iow
2o
Iow
) -1.09 -1.09 -1.04 -1.09 -1.09
Robust/Fragile Fragile Fragile Fragile Fragile Fragile
89
Table C1: EBA Results for the Lag Variables of Interest (Dependent Variable: Life Expectancy, LFE
1
)
Variable of interest
(m)
Estimated Coefficient
()
Countries R-squared Z-Variables
Extreme Bound
Value
Robust/Fragile
GDP per capita
(log) (GDP
0
)
High: 1.69 (0.43)* 35 0.91 PHY
0
, GIN
0
, GOV
1
Upper: 3.61
Lower: -0.56
Fragile Base: 3.03 (0.42)*** 95 0.88
Low: 1.55 (1.06) 35 0.90 PHY
0
, URB
0
, GIN
0
, GOV
1
Female education
(EDU
0
)
High: 0.39 (0.33) 35 0.87 PHY
0
, URB
0
, GIN
0
Upper: 1.05
Lower: -0.30
Fragile Base: 0.42 (0.19)** 95 0.88
Low: 0.10 (0.20) 39 0.90 URB
0
, GIN
0
, GOV
1
African dummy
variable
(AFR)
High: -8.19 (2.11)*** 35 0.90 PHY
0
, URB
0
, GIN
0
, GOV
1
Upper: -3.97
Lower: -15.23
Robust Base: -12.73 (1.01)*** 95 0.88
Low: -13.11 (1.06)*** 89 0.89 PHY
0
, GOV
1
Number of
physicians
(PHY
0
)
High: -0.01 (1.10) 35 0.87 URB
0
, GIN
0
Upper: 2.18
Lower: -2.98
Fragile
a
Base: -0.76 (0.33)** 95 0.88
Low: -1.21 (0.89) 35 0.90 GIN
0
, GOV
1
Governance quality
(GOV
1
)
High: 2.94 (0.99)*** 35 0.90 PHY
0
, GIN
0
Upper: 4.91
Lower: 0.58
Robust Base: 1.72 (0.56)*** 95 0.88
Low: 1.71 (0.56)*** 89 0.89 PHY
0
, URB
0
The base [ is the estimated coefficient of the variable of interest (m) in a regression that only includes the free variables and no Z-variables.
The high [ is the estimated coefficient of the variable of interest (m) from the regression with the upper extreme bound value ([
hgh
+2o
hgh
).
The low [ is the estimated coefficient of the variable of interest (m) from the regression with the lower extreme bound value ([
Iow
2o
Iow
).
Figures in parentheses are standard errors.
Significance at 1%, 5% and 10% level are denoted with ***, **, and * respectively.
The robust/fragile column reports whether the variable of interest (m) is robust or fragile.
a
The variable of interest (m)entered the base regression with the incorrect sign.
90
Table C1: EBA Results for the Lag Variables of Interest (Dependent Variable: Life Expectancy, LFE
1
) (continued)
Variable of interest
(m)
Estimated Coefficient
()
Countries R-squared Z-Variables
Extreme Bound
Value
Robust/Fragile
Urbanization
(URB
0
)
High: 0.05 (0.03)* 89 0.89 PHY
0
Upper: 0.12
Lower: -0.06
Fragile Base: 0.04 (0.03) 95 0.88
Low: 0.01 (0.03) 39 0.87 GIN
0
Gini index
(GIN
0
)
High: 0.09 (0.07) 35 0.90 PHY
0
, GOV
1
Upper: 0.23
Lower: -0.15
Fragile
a
Base: 0.04 (0.06) 95 0.88
Low: 0.01 (0.07) 35 0.87 PHY
0
, URB
0
The base [ is the estimated coefficient of the variable of interest (m) in a regression that only includes the free variables and no Z-variables.
The high [ is the estimated coefficient of the variable of interest (m) from the regression with the upper extreme bound value ([
hgh
+2o
hgh
).
The low [ is the estimated coefficient of the variable of interest (m) from the regression with the lower extreme bound value ([
Iow
2o
Iow
).
Figures in parentheses are standard errors.
Significance at 1%, 5% and 10% level are denoted with ***, **, and * respectively.
The robust/fragile column reports whether the variable of interest (m) is robust or fragile.
a
The variable of interest (m)entered the base regression with the incorrect sign.
91
Table C2: EBA Results for the Lag Variables of Interest (Dependent Variable: Infant Mortality, IFM
1
)
Variable of interest
(m)
Estimated Coefficient
()
Countries R-squared Z-Variables
Extreme Bound
Value
Robust/Fragile
GDP per capita
(log) (GDP
0
)
High: -2.83 (4.48) 35 0.84 PHY
0
, URB
0
, GIN
0
, GOV
1
Upper: 6.13
Lower: -10.99
Fragile Base: -8.87 (1.88)*** 95 0.88
Low: -4.04 (3.47) 39 0.85 GIN
0
, GOV
1
Female education
(EDU
0
)
High: -0.32 (0.93) 35 0.84 PHY
0
, GIN
0
, GOV
1
Upper: 1.53
Lower: -3.43
Fragile Base: -2.35 (0.81)*** 95 0.88
Low: -1.53 (0.95) 89 0.83 PHY
0
, GOV
1
African dummy
variable
(AFR)
High: 50.28 (12.22)*** 35 0.79 PHY
0
, GIN
0
Upper: 74.73
Lower: 12.85
Robust Base: 39.78 (4.91)*** 95 0.88
Low: 36.45 (11.80)*** 35 0.84 PHY
0
, URB
0
, GIN
0
, GOV
1
Number of
physicians
(PHY
0
)
High: 3.14 (5.96) 35 0.79 URB
0
, GIN
0
Upper: 15.06
Lower: -10.38
Fragile
a
Base: 1.64 (1.93) 95 0.88
Low: -2.53 (3.93) 35 0.84 GIN
0
, GOV
1
Governance quality
(GOV
1
)
High: -10.32 (3.58)*** 95 0.83 URB
0
Upper: -3.15
Lower: -25.06
Robust Base: -10.30 (3.56)*** 95 0.88
Low: -14.61 (5.22)*** 35 0.84 PHY
0
, GIN
0
The base [ is the estimated coefficient of the variable of interest (m) in a regression that only includes the free variables and no Z-variables.
The high [ is the estimated coefficient of the variable of interest (m) from the regression with the upper extreme bound value ([
hgh
+2o
hgh
).
The low [ is the estimated coefficient of the variable of interest (m) from the regression with the lower extreme bound value ([
Iow
2o
Iow
).
Figures in parentheses are standard errors.
Significance at 1%, 5% and 10% level are denoted with ***, **, and * respectively.
The robust/fragile column reports whether the variable of interest (m) is robust or fragile.
a
The variable of interest (m)entered the base regression with the incorrect sign.
92
Table C2: EBA Results for the Lag Variables of Interest (Dependent Variable: Infant Mortality, IFM
1
) (continued)
Variable of interest
(m)
Estimated Coefficient
()
Countries R-squared Z-Variables
Extreme Bound
Value
Robust/Fragile
Urbanization
(URB
0
)
High: 0.06 (0.12) 95 0.83 GOV
1
Upper: 0.30
Lower: -0.38
Fragile
a
Base: 0.05 (0.15) 95 0.88
Low: -0.08 (0.15) 35 0.79 PHY
0
, GIN
0
Gini index
(GIN
0
)
High: -0.27 (0.32) 35 0.79 PHY
0
, URB
0
Upper: 0.38
Lower: -0.87
Fragile
a
Base: -0.30 (0.26) 95 0.88
Low: -0.32 (0.27) 35 0.79 PHY
0
The base [ is the estimated coefficient of the variable of interest (m) in a regression that only includes the free variables and no Z-variables.
The high [ is the estimated coefficient of the variable of interest (m) from the regression with the upper extreme bound value ([
hgh
+2o
hgh
).
The low [ is the estimated coefficient of the variable of interest (m) from the regression with the lower extreme bound value ([
Iow
2o
Iow
).
Figures in parentheses are standard errors.
Significance at 1%, 5% and 10% level are denoted with ***, **, and * respectively.
The robust/fragile column reports whether the variable of interest (m) is robust or fragile.
a
The variable of interest (m)entered the base regression with the incorrect sign.
93
Table C3: EBA Results for the Lag Variables of Interest (Dependent Variable Under-5 Mortality, UFM
1
)
Variable of interest
(m)
Estimated Coefficient
()
Countries R-squared Z-Variables
Extreme Bound
Value
Robust/Fragile
GDP per capita
(log) (GDP
0
)
High: -3.18 (5.77) 35 0.88 PHY
0
, URB
0
, GIN
0
, GOV
1
Upper: 8.37
Lower: -14.33
Fragile Base: -13.02 (2.80)*** 95 0.88
Low: -5.46 (4.44) 39 0.88 GIN
0
, GOV
1
Female education
(EDU
0
)
High: -0.13 (1.14) 35 0.88 PHY
0
, GIN
0
, GOV
1
Upper: 2.14
Lower: -5.50
Fragile Base: -3.20 (1.23)** 95 0.88
Low: -2.37 (1.57) 89 0.82 PHY
0
, GOV
1
African dummy
variable
(AFR)
High: 84.91 (15.20)*** 35 0.84 PHY
0
, GIN
0
Upper: 115.30
Lower: 38.03
Robust Base: 75.10 (8.36)*** 95 0.88
Low: 66.78 (14.38)*** 35 0.84 PHY
0
, URB
0
, GIN
0
, GOV
1
Number of
physicians
(PHY
0
)
High: 3.29 (2.37) 89 0.82 GOV
1
Upper: 8.03
Lower: -15.37
Fragile
a
Base: 2.72 (2.68) 95 0.88
Low: -5.51 (4.93) 35 0.88 GIN
0
, GOV
1
Governance quality
(GOV
1
)
High: -12.50 (6.37)* 89 0.82 PHY
0
Upper: 0.23
Lower: -25.34
Fragile Base: -11.87 (5.94)** 95 0.88
Low: -12.53 (5.22)* 89 0.82 PHY
0
, URB
0
The base [ is the estimated coefficient of the variable of interest (m) in a regression that only includes the free variables and no Z-variables.
The high [ is the estimated coefficient of the variable of interest (m) from the regression with the upper extreme bound value ([
hgh
+2o
hgh
).
The low [ is the estimated coefficient of the variable of interest (m) from the regression with the lower extreme bound value ([
Iow
2o
Iow
).
Figures in parentheses are standard errors.
Significance at 1%, 5% and 10% level are denoted with ***, **, and * respectively.
The robust/fragile column reports whether the variable of interest (m) is robust or fragile.
a
The variable of interest (m)entered the base regression with the incorrect sign.
94
Table C3: EBA Results for the Lag Variables of Interest (Dependent Variable: Under-5 Mortality, UFM
1
) (continued)
Variable of interest
(m)
Estimated Coefficient
()
Countries R-squared Z-Variables
Extreme Bound
Value
Robust/Fragile
Urbanization
(URB
0
)
High: 0.15 (0.21) 95 0.82 GOV
1
Upper: 0.56
Lower: -0.49
Fragile
a
Base: 0.14 (0.23) 95 0.88
Low: -0.13 (0.18) 35 0.84 PHY
0
, GIN
0
Gini index
(GIN
0
)
High: -0.41 (0.37) 39 0.84 URB
0
Upper: 0.33
Lower: -1.23
Fragile
a
Base: -0.45 (0.32) 95 0.88
Low: -0.54 (0.34) 35 0.84 PHY
0
The base [ is the estimated coefficient of the variable of interest (m) in a regression that only includes the free variables and no Z-variables.
The high [ is the estimated coefficient of the variable of interest (m) from the regression with the upper extreme bound value ([
hgh
+2o
hgh
).
The low [ is the estimated coefficient of the variable of interest (m) from the regression with the lower extreme bound value ([
Iow
2o
Iow
).
Figures in parentheses are standard errors.
Significance at 1%, 5% and 10% level are denoted with ***, **, and * respectively.
The robust/fragile column reports whether the variable of interest (m) is robust or fragile.
a
The variable of interest (m)entered the base regression with the incorrect sign.