You are on page 1of 12

Stoch Environ Res Risk Assess (2009) 23:11431154

DOI 10.1007/s00477-008-0288-5

ORIGINAL PAPER

Adaptive Neuro-Fuzzy Inference System for drought forecasting


Ulker Guner Bacanli Mahmut Firat
Fatih Dikbas

Published online: 24 October 2008


Springer-Verlag 2008

Abstract Drought causes huge losses in agriculture and


has many negative influences on natural ecosystems. In this
study, the applicability of Adaptive Neuro-Fuzzy Inference
System (ANFIS) for drought forecasting and quantitative
value of drought indices, the Standardized Precipitation
Index (SPI), is investigated. For this aim, 10 rainfall
gauging stations located in Central Anatolia, Turkey are
selected as study area. Monthly mean rainfall and SPI
values are used for constructing the ANFIS forecasting
models. For all stations, data sets include a total of 516 data
records measured between in 1964 and 2006 years and data
sets are divided into two subsets, training and testing.
Different ANFIS forecasting models for SPI at time scales
112 months were trained and tested. The results of ANFIS
forecasting models and observed values are compared and
performances of models were evaluated. Moreover, the
best fit models have been also trained and tested by Feed
Forward Neural Networks (FFNN). The results demonstrate that ANFIS can be successfully applied and provide
high accuracy and reliability for drought forecasting.
Keywords Drought forecasting  ANFIS  Drought
indices  Central Anatolia  Turkey

1 Introduction
Drought is a threatening global and local problem that has
many damages in various ways. It causes huge losses in
agriculture and has many negative influences on natural
U. G. Bacanli  M. Firat (&)  F. Dikbas
Civil Engineering Department, Faculty of Engineering,
Pamukkale University, 20017 Denizli, Turkey
e-mail: mfirat@pau.edu.tr

ecosystems. Drought causes degradation of soils and


desertification (Nicholson et al. 1990; Pickup 1998), social
alarm and famine and impoverishment. Studies on climate
change also drew attention to drought in recent years (Byun
and Wilhite 1999) and many studies were made to analyze
the spatial patterns of drought risk in order to assist agricultural or environmental management (Dracup et al.
1980). The development of drought monitoring plans is
a priority in many of these studies (Wilhite 1997; Hayes
et al. 1999) and drought prediction is the subject of some
other studies that investigate the atmospheric causes of
droughts. Drought risk analysis aiming at improving techniques for drought prediction and management are based
on the spatial variation of drought and are mainly focused
on the magnitude, duration, intensity and spatial extent of
droughts. Currently, indirect characteristic features of soil
moisture time series namely drought indices are widely
used. Spatial and temporal extent and severity of drought
can be determined by the help of these indices (Palmer
1995; McKee et al. 1993; Edwards and Mckee 1997; Hayes
1997; Guttmann 1998; Hayes 2000). The Standardized
Precipitation Index (SPI), developed by McKee et al.
(1993), is an effective drought index which has several
advantages over the others. Calculation of the SPI is easier
than the more complex indices such as the Palmer Drought
Severity Index (PDSI; Palmer 1965), because the SPI
requires only precipitation data, whereas the PDSI uses
several parameters. The SPI is comparable in both time and
space and it can be calculated for several time scales
(Srdas and S en 2003; McKee et al. 1995) and it allows the
determination of duration, magnitude and intensity of
droughts. The SPI identifies various drought types as
hydrological, agricultural or environmental and it has been
extensively used for drought analysis of many areas of the
world. Several studies focused on the SPIs calculation

123

1144

procedures, which identify the most appropriate frequency


distributions (Guttmann 1998), the effect of time scales on
the parameters (Ntale and Gan 2003), and spatial and
temporal comparability (Keyantash and Dracup 2002).
However, the SPIs spatial stability and coherence in
relation to time scales have not been analysed. Mishra et al.
(2008) investigated the distribution of drought interval
time, mean drought interarrival time, joint probability
density function and transition probabilities of drought
events using the alternative renewable process and run
theory in the Kansabati River basin in India. For this aim,
the Standardized Precipitation Index (SPI) series were
employed and the time interval of SPI was found to have a
significant effect of the probabilistic characteristics of
drought. Mishra and Desai (2005) used the linear stochastic
models known as ARIMA and multiplicative Seasonal
Autoregressive Integrated Moving Average (SARIMA)
models to forecast droughts based on the procedure of
model development. The models were applied to forecast
droughts using standardized precipitation index (SPI) series
in the Kansabati river basin in India. Cancelliere et al.
(2007) proposed two methodologies for the seasonal forecasting of SPI, under the hypothesis of uncorrelated and
normally distributed monthly precipitation aggregated at
various time scales. In the first methodology, the autocovariance matrix of SPI values was analytically derived,
as a function of the statistics of the underlying monthly
precipitation process. In the second methodology, SPI
forecasts at a generic time horizon M were analytically
determined, in terms of conditional expectation, as a
function of past values of monthly precipitation. The
results showed that the proposed methodologies can be
applied for drought monitoring system. Hughes and
Saunders (2002) used monthly SPIs at time scales of 3, 6,
9, 12, 18, and 24 months for characterizing the drought
climatology of Europe. Bonaccorso et al. (2003) used the
SPI for drought analysis in Italy and Loukas et al. (2004)
applied the SPI for drought forecasting in Greece. VicenteSerrano and Lopez-Moreno (2005) analyzed the usefulness
of different SPI time scales to monitor droughts in river
discharges and reservoir storages. The objective was to
determine the most adequate time scales of SPI to monitor
droughts in two basic water usable sources: river discharges and reservoir storages. They found that Time
scales of SPI longer than 12 months do not seem useful to
monitor any drought type in their study areas. Moreira et al.
(2006) analyzed the SPI with the 12-month time scale
through adjusting loglinear models to the probabilities of
transitions between the SPI drought classes.
The new techniques such as artificial neural networks
(ANN), Fuzzy Logic (FL) and ANFIS have been recently
accepted as an efficient alternative tool for modeling
of complex hydrologic systems and widely used for

123

Stoch Environ Res Risk Assess (2009) 23:11431154

forecasting. Some specific applications of ANN to


hydrology include modeling rainfall-runoff process (Jeong
and Kim 2005; Kumar et al. 2005; Rajurkar et al. 2004),
hydrologic time series modeling (Jain and Kumar 2007),
sediment concentration estimation (Nagy et al. 2002), estimation of heterogeneous aquifer parameters (Mantoglou
2003), runoff and sediment yield modeling (Agarwal et al.
2006). Morid et al. (2007) examined the utility of ANN
approach for medium and long-term forecasting of both
the likelihood of drought events and their severity. Mishra
and Desai (2006) applied the feed-forward recursive
neural network and ARIMA models for drought forecasting using standardized precipitation index (SPI) series
as drought index. The results have demonstrated that
neural network method can be successfully applied for
drought forecasting. Wu et al (2008) applied the neural
network method to establish a risk evaluation model of
heavy snow disaster using back-propagation artificial
neural network (BP-ANN). According to results, BP-ANN
model showed an advantage in heavy snow risk evaluation in Xilingol compared to the conventional method.
Moreover, ASCE Task Committee reports (2000) did a
comprehensive review of the applications of ANN in the
hydrological forecasting context. On the other hand,
several studies have also been carried out using FL in
hydrology and water resources planning (Mahabir et al.
2000; Liong et al. 2000; Nayak et al. 2005; Altunkaynak
et al. 2005). In recent years, Adaptive Neuro-Fuzzy
Inference System (ANFIS), which is integration of ANN
and FL methods, has been used in the modeling of nonlinear engineering and water resources problems (Chang
and Chang 2006; Nayak et al. 2004; S en and Altunkaynak
2006; Firat 2007; Firat and Gungor 2007, 2008). Moreover, Chou and Chen (2007) have used the neuro fuzzy
computing technique for the development of drought early
warning index. For this aim, an approach has been proposed to develop drought early warning index (DEWI) for
southern Taiwan to detect the drought in advance for
setting up proper plans to mitigate the water shortage
impact.
Drought forecasting plays an important role in the
mitigation of impacts of drought on water resources systems.
Because SPI is one of the most widely used methods
related to drought, accurate and reliable estimation of SPI
is very important. Traditional methods like regression
analysis and autoregressive moving average models
are commonly used in the estimation of hydrological
processes. Moreover FL and ANN methods offer real
advantages over conventional modeling especially when
the underlying physical relationships are not fully understood. FL is employed to describe human thinking and
reasoning in a mathematical framework. The main problem
with FL is that there is no systematic procedure to define

Stoch Environ Res Risk Assess (2009) 23:11431154

the MF parameters and to design of fuzzy rules. The construction of the fuzzy rule necessitates the definition of
premises and consequences as fuzzy sets.
In this paper, Adaptive Neuro-Fuzzy Inference System
(ANFIS), which is an integration of ANN and FL methods,
is proposed as an alternative to the traditional methods for
drought forecasting using SPI for multiple time scales. The
main contribution of ANFIS method is that it eliminates the
basic problems in fuzzy modeling (defining the membership function parameters and design of fuzzy ifthen rules)
by using the learning capability of ANN for automatic
fuzzy rule generation and parameter optimization. To
illustrate the applicability of ANFIS method in drought
forecasting, 10 rainfall gauging stations located in Central
Anatolia, Turkey are selected as study area. Monthly mean
precipitation and SPI values are used for constructing the
ANFIS forecasting models. The best fit forecasting model
structure was determined by comparing the forecasted and
observed values.

2 Standard Precipitation Index (SPI)


Standard Precipitation Index calculation is based on longterm precipitation data. SPI is obtained by dividing the
difference between precipitation and mean to standard
deviation in a specific duration (McKee et al 1993). SPI is a
dimensionless index that takes negative values in drought
periods and positive values in wet periods. The magnitude,
length and duration of drought can be calculated with SPI.
The calculation of SPI is complex because the precipitation
does not fit normal distribution for the periods of
12 months and less and for this reason the precipitation
series are fitted to normal distribution
xi  xi
SPI
:
1
r
SPI permits to determine the rarity of a drought or an
anomalously wet event at a particular time scale for any
location that has a precipitation record. A drought event is
considered to occur at a time when the value of SPI is
continuously negative and end when SPI becomes positive
(Mishra et al. 2008). The classes according to the SPI index
are given in the Table 1.
The following steps are applied in the SPI method:
1.

Monthly precipitation data sets are organized for a


period of at least 30 years. Different time steps are
determined like 3, 6, 9, 12, 24 or 48 months to monitor
the variations of the indices by considering the
influence of precipitation deficit on various resources.
The time steps may vary according to the condition of
water resources in the area. In the proposed study,
estimation models were constructed with ANFIS

1145
Table 1 Classification according to the SPI values
SPI

Drought category

2[

Extremely wet

1.991.5

Very wet

1.491.0

Moderately wet

0.99(-0.99)

Near normal

(1.0)(-1.49)

Moderately dry

(1.5)(-1.99)

Severely dry

2\

Extremely dry

2.

method by using the SPI outputs for 1, 3, 6, 9 and


12 months.
Then Gamma distribution is fitted to the data set and
thus the observed precipitation probabilities are
defined. Gamma distribution is the best fitting distribution to the climatologic time series. Gamma
distribution is defined by either the frequency distribution or the probability density function

g x

1
xa1 ex=b
b Ca
a

for x [ 0:

a([0) is the shape parameter; b([0) is the scale parameter;


x([0) is the precipitation amount, and C (a) is the Gamma
function. In the calculation of a and b, maximum
probability solutions are used. According to this:
r!
1
4A
a
1 1
3
4A
3
b

x
a

4
P

A lnx 

ln x
n

3.

These probability definitions obtained from the present


data may later be used to determine the cumulative
probability of a value observed at any month. In this
situation, the cumulative probability distribution
function is defined as follows:
Zx
Zx
1
G x g xdx a
xa1 ex=b dx
6
b C a
0

4.

Gamma function is undefined for x = 0 and


precipitation distribution can have zero values. When
this is the case, the cumulative probability distribution
is defined as follows:

H x q 1  q  G x

In the equation above, n is the number of precipitation


observations, q represents the probability for zero value. If
m is used for denoting the zero values in a precipitation
series then the following definition can be made: q = m/n.

123

1146

5.

Stoch Environ Res Risk Assess (2009) 23:11431154

The cumulative probability value H(x) is converted to


Z variable with a standard normal random value
denoting the SPI value having zero mean value and
variance equal to 1. H(x) is the value of SPI.
Normalization of SPI values enables the consideration
of the variations of precipitation series of that station
by both time and place (McKee et al. 1993; Guttmann
1998).

basic problem in fuzzy system design (defining the membership function parameters and design of fuzzy ifthen
rules) by effectively using the learning capability of ANN
for automatic fuzzy rule generation and parameter optimization. There are two types of FISs, Sugeno-Takagi FIS
and Mamdani FIS, in literature. In this study, SugenoTakagi FIS is used for drought forecasting. The most
important difference between these systems is definition of
the consequent parameter. The consequence parameter in
Sugeno FIS is either a linear equation, called first-order
Sugeno FIS, or constant coefficient, zero-order Sugeno FIS
(Jang et al. 1997). It is assumed that the FIS includes two
inputs, SPI(t - 1) and P(t - 1) and one output, SPI(t). The
membership functions and the structure of are shown in
Fig. 1. For the first-order Sugeno-Takagi FIS, typical two
rules can be expressed as:

3 Adaptive Neuro Fuzzy Inference System (ANFIS)


The FL approach proposed by Zadeh (1965) is based on the
linguistic uncertainly expression rather than numerical
uncertainty. FL approach has become popular and has
been successfully used in various engineering problems
(Mahabir et al. 2000; Liong et al. 2000; Nayak et al. 2005;
S en 2001). Fuzzy inference system (FIS) is a rule based
system consisting of three conceptual components. These
are: (1) a rule-base, containing fuzzy if-then rules, (2) a
data-base, defining the Membership Function (MF) and (3)
an inference system, combining the fuzzy rules and produces the system results (Firat and Gungor 2007, 2008; S en
2001). The main problem with fuzzy logic is that there is
no systematic procedure to define the membership function
parameters and to design of fuzzy rules. In recent years,
ANFIS method, which is integration of ANN and FL
methods, has the potential to capture the benefits of both
these methods in a single framework. ANFIS eliminates the
Fig. 1 The scheme of Adaptive
Neuro-Fuzzy Inference System

Rule 1: IF SPIt  1 is A1 and Pt  1 is B1


THEN f1 p1  SPIt  1 q1  Pt  1 r1
Rule 2: IF SPIt  1 is A2 and Pt  1 is B2
THEN f2 p2  SPIt  1 q2  Pt  1 r21
Input notes (Layer 1) Each node in this layer generates
membership grades of the crisp inputs and each nodes
output O1i is calculated by:
O1i lAi SPIt  1
O1i

for i 1; 2;

lBi2 Pt  1 for i 3; 4

where, SPI(t - 2) is the SPI value at time (t - 2) to the


node i, P(t - 1) is the actual precipitation at time (t - 1),
Membership
B1
Degree

Membership A1
Degree

B1
f1 = p1 * SPI (t 1) + q1 * P(t 1) + r1

B2

A2

f2 = p2 *SPI(t 1) + q2 * P(t 1) + r2

SPI (t-2)

A1

Rule Base and Inference System

w1 ( SPI (t 1), P (t 1) w1 = w1
w1 + w2

SPI (t-2)

P (t-1)

f 1 ( SPI (t 1), P (t 1)

A2

w1 f1
f 1 ( SPI (t 1), P (t 1)

B1
P (t-1)

B2
Layer 1

123

f 2 ( SPI (t 1), P (t 1)

w2 f 2

w2
w2 ( SPI (t 1), P (t 1) w2 = w + w
1
2
Layer 2

Layer 3

Layer 4

Layer 5

Stoch Environ Res Risk Assess (2009) 23:11431154

1147

SPI(t) is the SPI value at time (t) to the node i, Ai and Bi are
the linguistic labels, pi, qi and ri are the consequence
parameters, lAi and lBi are the MFs for Ai and Bi linguistic
labels, respectively and in this study, the Gauss MF is used,
as
SPIt1c2
2r2

O1i lAi x e

consequent parameters. All these parameters are updated


using this hybrid learning algorithm until acceptable error
is reached. The details and mathematical background of
these algorithms can be found in Jang et al. (1997) and in
Nayak et al. (2004).

4 Study area and data

Rule nodes (Layer 2) The outputs of this layer, called


firing strengths O2i , are the products of the corresponding
degrees obtained from layer 1, named as w as follows:
O2i wi lAi SPIt  1lBi Pt  1;

i 1; 2

10

Average nodes (Layer 3) Main target is to compute the


ratio of firing strength of each ith rule to the sum firing
strength of all rules. The firing strength in this layer is
normalized as
wi
i P
O3i w
i 1; 2
11
i wi
Consequent nodes (Layer 4) The contribution of ith rule
towards the total output or the model output and/or the
function defined is calculated by Eq. (12)
 i fi w
i pi SPIt  1 qi Pt  1 ri i 1; 2
O4i w
12
Output nodes (Layer 5) This layer is called as the output
nodes in which the single node computes the overall output
by summing all incoming signals

The temperature difference between summer and winter is


high, the precipitation generally occurs in spring and winter
and dry periods dominate summers. This climate is experienced in Central, East, Southeast Anatolia and Trakya
region. Climate of Central Anatolia has the following
properties: The weather in the summer is a little hot and
winters are cold. The severity of cold weather increases
towards the east parts of Central Anatolia. Natural flora
consists of steppes in the lower regions and dry forests in the
higher regions because of summer droughts. Mean temperature of January, the coldest month, is 0.7C and it is 22C in
July, the hottest month. Annual mean temperature is 10.8C.
Mean annual precipitation is 413.8 mm and most of the
precipitation occurs in winter and spring seasons. The percent of summer rains among the annual total is 14.7%. The
annual mean proportional moisture in the region is 63.7%.
Observed monthly rainfall data records from ten meteorological stations (Aksaray, Ankara, Cankr, Eskis ehir,
Karaman, Kayseri, Konya, Krs ehir, Nevs ehir and Yozgat)
located in Central Anatolia, Turkey, have been selected for

w1 f1 w2 f2
w1 w2
w1 SPIt  1; Pt  1f1 SPIt  1; Pt  1 w2 SPIt  1; Pt  1f2 SPIt  1; Pt  1

w1 SPIt  1; Pt  1 w2 SPIt  1; Pt  1

f x; y

Q5i f SPIt  1; Pt  1
P
wi fi
Pi
i wi

i f1 w
 i f2
 i  fi w
w

13

this study. The length of available records at these stations is


between 1964 and 2006. The SPI for this study have been
calculated on the basis of these rainfall data.

14

i is the ith node output from the previous layer as


where w
demonstrated in the third layer. ANFIS applies the hybridlearning algorithm, which consists of the combination of
the gradient descent and the least-squares methods to
determine the input and output model parameters. The task
of the learning algorithm is to tune all the antecedent and
consequence parameters to make the ANFIS response
match the training data. The gradient descent method is
used to assign the nonlinear antecedent parameters and the
least-squares method is employed to identify the linear

5 Drought forecasting by ANFIS


5.1 Input variables
Different time steps like 3, 6, 12, 24 and 48 months are
determined as (1) for monitoring the variations in the
indexes by considering the effect of precipitation lack on
different water resources. In this study, the values of SPI
and precipitation in the previous months are used for
generating a drought estimation model with ANFIS

123

1148

Stoch Environ Res Risk Assess (2009) 23:11431154

Table 2 The statistical parameters for training data sets (19641986)


Min. Max. Mean SD

Variance Skewness Kurtosis

Aksaray

0.0

110.1 28.96 23.17

536.98

0.775

0.142

Ankara

0.0

121.5 34.86 26.08

680.54

0.799

-0.005

Cankr

0.0

137.7 34.62 26.64

709.96

1.012

0.856

Eskis ehir 0.0

128.2 34.07 25.92

671.92

0.954

0.949

Karaman 0.0

144.1 29.15 26.88

722.91

1.182

1.469

Kayseri

0.0

133.2 30.50 23.70

562.01

0.927

0.869

Konya

0.0

112.2 28.74 23.47

551.11

0.876

0.433

Krs ehir

0.0

145.8 31.84 26.49

702.12

0.820

0.430

Nevs ehir 0.0

116.7 34.29 26.56

705.89

0.668

-0.107

Yozgat

192.3 48.79 38.68 1496.22

0.876

0.475

0.0

Table 3 The statistical parameters for testing data sets (19872006)


Min. Max. Mean SD

Variance Skewness Kurtosis

Aksaray

0.0

101.3 28.47 23.21

539.14

0.813

0.100

Ankara

0.0

122.4 32.33 25.92

672.28

0.953

0.630

Cankr

0.0

149.8 32.76 27.46

754.28

1.381

2.175

Eskis ehir 0.0

129.7 29.16 22.36

500.28

1.329

2.823

Karaman 0.0

121.8 26.27 22.60

510.87

0.956

0.802

and the results of the model. Here, different estimation


models were constructed for each phase. The models for 1,
3, 6, 9 and 12 months were named as SPI-1, SPI-3, SPI-6,
SPI-9 and SPI-12, respectively. Here, SPI-1, SPI-3 and
SPI-6 were considered as the index for short term or seasonal variation, SPI-9 for short term drought and SPI-12
was considered as the drought index for long term. 20
models with different input numbers and structures were
constructed for each phase by using these variables. In this
study, forecasting models based on various combinations
of antecedent values of actual precipitations and SPI values
were constructed (Table 4). In each model every input
variable must be clustered into several class values in layer
1 to build up fuzzy rules. And each fuzzy rule would be
constructed through several parameters of membership
function in layer 2. As the number of parameters increases
with the fuzzy rule increment, the model structure becomes
more complicated. In this study, the subtractive fuzzy
clustering function was used to establish the fuzzy rule
based on the relationship between the inputoutput variables. In order to determine the nonlinear input and linear
output parameters, the hybrid algorithm was used. The

Kayseri

0.0

164.7 33.62 27.56

760.07

1.078

1.663

Konya

0.0

124.0 24.94 23.87

570.18

1.540

2.909

Table 4 The structures of forecasting models

Krs ehir

Model Input structure

Output

M1

SPI(t - 1)

SPI(t)

M2

SPI(t - 1), SPI(t - 2)

SPI(t)

M3
M4

SPI(t - 1), SPI(t - 2), SPI(t - 3)


SPI(t - 1), SPI(t - 2), SPI(t - 3), SPI(t - 4)

SPI(t)
SPI(t)

M5

SPI(t - 1), SPI(t - 2), SPI(t - 3), SPI(t - 4),


SPI(t - 5)

SPI(t)

M6

SPI(t - 1), SPI(t - 2), SPI(t - 3), SPI(t - 4),


SPI(t - 5), SPI(t - 6)

SPI(t)

M7

R(t - 1)

SPI(t)

M8

R(t - 1), R(t - 2)

SPI(t)

0.0

121.0 31.39 25.42

646.24

0.916

0.764

Nevs ehir 0.0

148.8 33.91 28.14

792.11

1.053

1.318

Yozgat

172.0 50.01 37.61 1414.92

0.819

0.245

0.0

method. For this, the SPI outputs for 1, 3, 6, 9 and


12 months were considered. In the construction of estimation models, again, different models were generated for
each of the SPI output for 1, 3, 6, 9 and 12 months. The
data sets for all stations were divided into two subsets,
training and testing data set. The training data set includes
data records measured between 1964 and 1986 years. In
order to get more reliable evaluation and comparison,
models are tested by evaluating a data set which was not
used during the training process. Testing data set consists
of data records observed between 1987 and 2006 years.
The statistical parameters, minimum value, maximum
value, mean, standard deviation, variance, skewness coefficient and Kurtosis for training and testing data sets are
calculated and given in Tables 2 and 3 to see a comparison
of the training and testing data sets.

M9

R(t - 1), R(t - 2), R(t - 3)

SPI(t)

M10

R(t - 1), R(t - 2), R(t - 3), R(t - 4)

SPI(t)

M11
M12

R(t - 1), R(t - 2), R(t - 3), R(t - 4), R(t - 5)


R(t - 1), R(t - 2), R(t - 3), R(t - 4), R(t - 5),
R(t - 6)

SPI(t)
SPI(t)

M13

SPI(t - 1) R(t - 1)

SPI(t)

M14

SPI(t - 1), SPI(t - 2) R(t - 1)

SPI(t)

M15

SPI(t - 1), SPI(t - 2) R(t - 1), R(t - 2)

SPI(t)

M16

SPI(t - 1), SPI(t - 2), SPI(t - 3) R(t - 1)

SPI(t)

M17

SPI(t - 1), SPI(t - 2), SPI(t - 3) R(t - 1),


R(t - 2)

SPI(t)

5.2 Model structures

M18

SPI(t - 1), SPI(t - 2), SPI(t - 3), SPI(t - 4)


R(t - 1)

SPI(t)

One of the most important steps in developing a satisfactory forecasting model is the selection of the input
variables. Because, these variables determine the structure
of forecasting model and affect the weighted coefficient

M19

SPI(t - 1), SPI(t - 2), SPI(t - 3), SPI(t - 4)


R(t - 1), R(t - 2)

SPI(t)

M20

SPI(t - 1), SPI(t - 2), SPI(t - 3), SPI(t - 4),


SPI(t - 5) R(t - 1)

SPI(t)

123

Stoch Environ Res Risk Assess (2009) 23:11431154

1149

learning procedure and the construction of the rules were


provided by this algorithm. The performance of ANFIS
models for training and testing data sets were evaluated
according to statistical criteria such as, Correlation Coefficient (CORR), Efficiency (E), and Root Mean Square
Error (RMSE). The CORR is a commonly used statistic
and provides information on the strength of linear relationship between the observed and the computed values.
The E is one of the widely employed statistics to evaluate
model performance. The values of CORR and E close to
1.0 indicate good model performance. The RMSE statistic
indicates a models ability to predict a value away from the
mean.
As it is impossible to show the model results for each
phase having 20 models because of space restrictions, only
the results for SPI-6 at Ankara station (Ankara is the capital
of Turkey and it is one of the cities where water shortage
and drought is severely experienced) are presented. The
testing performances of ANFIS models for SPI-6 are given
in Fig. 2.
When the results of the ANFIS models are compared, it
is seen that the performances of models composed of
precipitation values belonging to the previous time step are
lower than the performances of the other models. The
results of models in which SPI is used, show that the
performances of the models at all stations are close to
each other and that the model defined as M5 has a better
performance than the others. A general decrease in performance was observed in all models when the values at
(t - 6) time step were used. When the results of the models
consisting of only the precipitation values are evaluated, it
is seen that M11 is the model with the best performance for
all stations. It was also observed that the model (M7)
composed of precipitation values at the (t - 1) time step
has the lowest performance. The figure shows that the
model (M12) generated by using the values at (t - 6) time
step generally have lower performances. On the other side,
the investigation of the results given in the graphs for SPI-6
show that the models generated with the previous values of
SPI and precipitation data have a better performance. By
using the precipitation and SPI variables together for all
stations, an improvement has been achieved in the model
performances. According to the criteria, the model defined

Testing set
E

RMSE

CORR

RMSE

Aksaray (M20)

0.837

0.686

0.628

0.810

0.656

0.514

Ankara (M20)

0.824

0.685

0.549

0.876

0.754

0.526

Cankr (M5)
Eskis ehir(M5)

0.773
0.825

0.599
0.710

0.644
0.547

0.870
0.879

0.767
0.781

0.508
0.471

Karaman (M20)

0.826

0.714

0.578

0.860

0.741

0.521

Kayseri (M20)

0.846

0.712

0.601

0.855

0.731

0.462

Krs ehir (M20)

0.804

0.642

0.615

0.828

0.694

0.541

Konya (M20)

0.815

0.68

0.603

0.872

0.761

0.443

Nevs ehir (M14)

0.841

0.701

0.608

0.810

0.710

0.470

Yozgat (M20)

0.818

0.667

0.578

0.821

0.704

0.549

as M20 ANFIS for Aksaray, Ankara, Karaman, Kayseri,


Krs ehir, Konya and Yozgat stations, had the best results
over the other models. On the other hand, while the best
results are obtained from the M5 model for Eskis ehir and
Cankr stations, it was determined that M14 ANFIS model
had the best performance for Nevs ehir station. As a result,
the performances of the best fit ANFIS models for SPI-6 at
all stations (after the analysis of all stations, only performances of the models giving the most suitable results are
presented.) are shown in Table 5.
It can be stated that the model performances of ANFIS
models for all stations are at an acceptable level for SPI-6.
Figure 3 shows the performances of ANFIS models at
Ankara station for the data from 1 month to 12 months
(SPI-1 to SPI-12). In this figure, the variations of CORR, E
and RMSE criteria for SPI-1 to SPI-12 at Ankara station
during the testing period are demonstrated.
The results of other stations that are not presented here
due to space restrictions indicate that the ANFIS models
for SPI-12 have shown the best performance at all stations. It is seen that the performances of ANFIS model for
SPI-12 at Ankara station is better than those of other
models. The values of CORR and E of ANFIS models for
SPI-1 are lower than those of other models. The reason
for the ANFIS models developed by using SPI outputs of
12 months to show a better performance is that the SPI

Ankara Station

1.1

0.8

0.8

1.0

0.7

0.7

0.6

0.6

0.5

0.5
CORR
E

0.4

Training set

CORR

0.9

0.9

CORR

Station

0.4

E
RMSE

Fig. 2 Comparison of
performances of ANFIS Models
for SPI-6 at Ankara station

Table 5 The performances of the best fit models for SPI-6 at all
stations

Ankara Station
RMSE

0.9
0.8
0.7

0.3

0.3

0.6

0.2

0.2

0.5

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

Model

Model

123

Stoch Environ Res Risk Assess (2009) 23:11431154


Ankara Station

1
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1

0.5
0.4
0.3
0.2
0.1

CORR
E

Ankara Station

1
0.9
0.8
0.7
0.6

1.2
1

CORR

Fig. 3 The performances of


ANFIS models for SPI-1, SPI-3,
SPI-6, SPI-9 and SPI-12 at
Ankara station

12

Month

values calculated for a long term include dry and wet


periods for longer duration. Short term periods like 1 or
3 months may include a wet or a dry period for a short
time. For example, in 3 months period, drought occurs
more frequently and for a shorter time and when the
period increases the duration of drought increases but its
frequency decreases. This means that for shorter periods
the SPI values may contain 1 month dry and 1 month wet
period and this causes instability. Passages between
positive and negative values occur more frequently and
this also results with instability. For this reason, the
ANFIS estimation models constructed with the SPI values
calculated for shorter periods, cannot catch dry and wet
periods and give unsuccessful results. Besides, the SPI
outputs for 12 months have a more stable run. Thus, the
ANFIS models developed by using SPI outputs for
12 months can catch dry and wet periods and give better
results. Figure 4 shows the results of ANFIS models for
Ankara station from SPI-1 to SPI-12.
In order to evaluate the results of ANFIS models, the
best fit models for Ankara station (SPI-1 to SPI-12) have
also been tested by Feed Forward Neural Networks
(FFNN) and Multiple Linear Regression (MLR). The
FFNN models have been trained and tested using the same
data sets. The error back propagation algorithm and tangent
activation function is used for training/testing of the FFNN
models. The number of hidden layers and the hidden
neurons in this layer, the learning rate, the coefficient of
momentum and epochs were selected by trial and error
method during the training. The results of FFNN and MLR
models for SPI-12 at Ankara station are shown in Table 6
and Fig. 5.
Comparing performances of ANFIS models for Ankara
station, it is seen that the performance of the ANFIS model
for SPI-12 are better than other ANFIS models for SPI-1 to
SPI-9. As a result, it is said that ANFIS can be successfully
applied and provide high accuracy and reliability for
drought forecasting. On the other hand, comparing the
results of ANFIS and FFNN forecasting models for Ankara
station (SPI-1 to SPI-12), it can be seen that the RMSE
values of the ANFIS models are lower than that of FFNN
model. In addition, the values of E and CORR of the

123

RMSE

1150

RMSE

0.8
0.6
0.4
0.2
1

12

Month

ANFIS model are also higher than those of FFNN models.


The results suggest that the ANFIS method is superior to
the FFNN method in the forecasting of drought. It may be
noted that a trial and error procedure has to be performed
for FFNN models to develop the best network structure
while such a procedure is not required in developing an
ANFIS model. Figure and table indicate that the best result
was obtained from the models developed for SPI-12 as
in the ANFIS method. Comparing the performances of
ANFIS and MLR models, it can be seen that the values of
E and CORR of the ANFIS model are also higher than
those of MLR models. The NRMSE values of ANFIS
model are also lower than those of MLR models. The
results suggest that the ANFIS method is also superior to
the MLR method in the drought forecasting. The results
show that ANFIS method can be successfully applied to
establish accurate and reliable drought forecasting models.

6 Conclusions
SPI is one of the most widely used methods related to
drought and SPI should be estimated accurately and reliably. Traditional methods like regression analysis and
autoregressive moving average models are commonly used
in the estimation of hydrological processes.
In this paper, Adaptive Neuro-Fuzzy Inference System
(ANFIS) was proposed as an alternative drought forecasting tool to the traditional methods. The main contribution
of ANFIS method is that it eliminates the basic problems
in fuzzy modeling (defining the membership function
parameters and design of fuzzy ifthen rules) by using
the learning capability of ANN for automatic fuzzy rule
generation and parameter optimization.
To illustrate the applicability of ANFIS method in
drought forecasting, 10 rainfall gauging stations located in
Central Anatolia, Turkey were selected as study area.
Different ANFIS forecasting models for SPI-1, SPI-3,
SP-6, SPI-9 and SPI-12 were trained and tested. When the
results of the ANFIS models are compared, it is seen that
only the performances of models composed of precipitation values belonging to the previous time step are lower

Stoch Environ Res Risk Assess (2009) 23:11431154


Fig. 4 The results of ANFIS
models for for Ankara station
(SPI-1 to SPI-12)

1151
Ankara (SPI-1)

4.0

SPI

2.0
0.0
1 14 27 40 53 66 79 92 105 118 131 144 157 170 183 196 209 222 235

Forecasted

Forecasted
Observed

0
-4

-2.0

-2

-2
-4.0

-4

-6.0

Observed

Month
Ankara (SPI-3)

4.0

Forecasted

Forecasted
Observed

SPI

2.0

0.0

0
-4

1 14 27 40 53 66 79 92 105 118 131 144 157 170 183 196 209 222 235

-2

0
-2

-2.0

-4

-4.0

Month

Observed

Ankara (SPI-6)

4.0

Forecasted

Forecasted
Observed

SPI

2.0

0.0

0
-4

1 14 27 40 53 66 79 92 105 118 131 144 157 170 183 196 209 222 235

-2

0
-2

-2.0

-4

-4.0

Month

Observed

Ankara (SPI-9)

4.0

Forecasted

Forecasted
Observed

SPI

2.0

0.0
1

0
-4

14 27 40 53 66 79 92 105 118 131 144 157 170 183 196 209 222 235

-2

0
-2

-2.0

-4

-4.0

Month

Observed

Ankara (SPI-12)

4.0

Forecasted

Forecasted
Observed

SPI

2.0

0.0
1 14 27 40 53 66 79 92 105 118 131 144 157 170 183 196 209 222 235

0
-4

-2

0
-2

-2.0

-4

-4.0

Month

Observed

123

1152

Stoch Environ Res Risk Assess (2009) 23:11431154

Table 6 Comparison of performances of ANFIS, FFNN and MLR models for Ankara station
Station

Testing set

Training set

CORR

RMSE

CORR

RMSE

M20 ANFIS (for SPI-1)

0.392

0.371

1.016

0.573

0.502

0.968

M20 ANFIS (for SPI-3)

0.490

0.422

0.707

0.584

0.569

0.654

M20 ANFIS (for SPI-6)

0.824

0.685

0.549

0.876

0.754

0.526

M20 ANFIS (for SPI-9)

0.851

0.733

0.507

0.920

0.847

0.401

M20 ANFIS (for SPI-12)

0.893

0.808

0.425

0.930

0.865

0.375

M20 FFNN (for SPI-1)

0.314

0.298

1.254

0.487

0.451

1.026

M20 FFNN (for SPI-3)


M20 FFNN (for SPI-6)

0.417
0.752

0.402
0.625

0.916
0.652

0.561
0.833

0.524
0.694

0.845
0.575

M20 FFNN (for SPI-9)

0.813

0.674

0.577

0.858

0.738

0.525

M20 FFNN (for SPI-12)

0.851

0.722

0.512

0.887

0.794

0.453

M20 MLR (for SPI-1)

0.306

0.257

1.291

0.380

0.302

1.354

M20 MLR (for SPI-3)

0.411

0.398

1.096

0.576

0.528

0.885

M20 MLR (for SPI-6)

0.719

0.584

0.669

0.829

0.687

0.582

M20 MLR (for SPI-9)

0.804

0.600

0.621

0.894

0.799

0.462

M20 MLR (for SPI-12)

0.811

0.593

0.619

0.904

0.822

0.435

Ankara FFNN Mode l (SPI-12)

4.0

4
Forecasted
Observed

SPI

2.0

0.0

Forecasted

Fig. 5 The results of FFNN and


MLR models for SPI-12 at
Ankara station

0
-4

1 14 27 40 53 66 79 92 105 118 131 144 157 170 183 196 209 222 235

-2

-4

-4.0

Month

Observed

Ankara MLR Mode l (SPI-12)

Forecasted
Observed
2.0

0.0
1

14 27 40 53 66 79 92 105 118 131 144 157 170 183 196 209 222 235

0
-4

-2

0
-2

-2.0

-4

-4.0

Month

than the performances of the other models. The results of


models in which only the SPI is used, show that the
model named M5 has a better performance than the other
models. It was also observed that when the SPI value at
(t - 6) time step is used, there is a decrease in performance for all stations generally. On the other hand, when
the results of models containing only the precipitation
values were investigated, it was found that M11 has

Forecasted

4.0

SPI

-2

-2.0

123

Observed

shown the best performance for all stations. The model


defined as M7 composed of the precipitation value at time
step (t - 1) had the lowest performance. The results
indicate that the models (M12) generated by using the
precipitation values at time step (t - 6) generally have a
lower performance. By using the precipitation and SPI
variables together for all stations, an improvement was
achieved in the model performances. According to the

Stoch Environ Res Risk Assess (2009) 23:11431154

criteria given in the figures, the model defined as M20


ANFIS model, which consists of the combination of the
antecedent values of the rainfall and SPI variables, for
Aksaray, Ankara, Karaman, Kayseri, Krs ehir, Konya and
Yozgat stations, had the best results over the other
models. Moreover, while the best results are obtained
from the M5 ANFIS model, which includes the antecedent values of SPI variable, for Eskis ehir and C
ankr
stations, it was determined, that M14 ANFIS model had
the best performance for Nevs ehir station. Comparing the
performances of ANFIS models for SPI-1, SPI-3, SP-6,
SPI-9 and SPI-12 at 10 stations during the testing period,
it was seen that the performances of the models for SPI12 at all stations are better than those of other models.
The reason for the ANFIS models to show a better performance is that the SPI values calculated for long
periods contain longer periods of dry and wet periods.
This means that for shorter periods the SPI values may
contain 1-month dry and 1-month wet period and this
causes instability. Passages between positive and negative
values occur more frequently and this also results with
instability. For this reason, the ANFIS estimation models
constructed with the SPI values calculated for shorter
periods, cannot catch dry and wet periods and give
unsuccessful results. Besides, the SPI outputs for
12 months have a more stable run. Thus, the ANFIS
models developed by using SPI outputs for 12 months can
catch dry and wet periods and give better results. In order
to evaluate the results of ANFIS models, the best fit
models for Ankara station (SPI-1, SPI-3, SP-6, SPI-9 and
SPI-12) have also been trained and tested by FFNN
method. The FFNN models have been trained and tested
using the same data sets. Comparing the results of ANFIS
and FFNN forecasting models for Ankara station, it can
be seen that the RMSE values of the ANFIS models were
lower than that of FFNN model. In addition, the values of
E and CORR of the ANFIS model were also higher than
those of FFNN models. To get more reliable evaluation of
performance of ANFIS model, the best fit models for
Ankara station were compared to MLR model. It can be
seen that the NRMSE value of ANFIS models were lower
than those of MLR models. The values of E and CORR
of ANFIS models were also higher than those of MLR
models.
The results suggest that the ANFIS method is superior to
the FFNN and MLR methods in the forecasting of drought.
Moreover, the result showed that ANFIS method can be
successfully applied to establish accurate and reliable
drought forecasting models.
Acknowledgments The authors are grateful for editors and anonymous reviewers for their helpful and constructive comments on an
earlier draft of this paper.

1153

References
Agarwal A, Mishra SK, Ram S, Singh JK (2006) Simulation of runoff
and sediment yield using artificial neural networks. Biosyst Eng
94(4):597613
zger M, Cakmakc M (2005) Water consumption
Altunkaynak A, O
prediction of Istanbul City by using fuzzy logic approach. Water
Resour Manage 19:641654
ASCE Task Committee (2000) Artificial neural networks in hydrology. II. Hydrologic applications. J Hydrol Eng ASCE 5(2):124
137
Bonaccorso B, Bordi I, Cancielliere A, Rossi G, Sutera A (2003)
Spatial variability of drought: an analysis of the SPI in Sicily.
Water Resour Manage 17:273296
Byun HR, Wilhite DA (1999) Objective quantification of drought
severity and duration. J Clim 12:27472756
Cancelliere A, Di Mauro G, Bonaccorso B, Rossi G (2007) Drought
forecasting using the standardized precipitation index. Water
Resour Manage 21:801819
Chang FJ, Chang YT (2006) Adaptive neuro-fuzzy inference system
for prediction of water level in reservoir. Adv Water Resour
29:110
Chou FNF, Chen BPT (2007) Development of drought early warning
index: using neuro-fuzzy computing technique. In: 8th international symposium on advanced intelligence systems 2007,
Korea. Paper No:A1469
Dracup JA, Lee KS, Paulson EG (1980) On the statistical characteristics of drought events. Water Resour Res 16:289296
Edwards DC, McKee TB (1997) Characteristics of 20th century
droughts in the United States at multiple time scales. Climatology Report, 972, Department of Atmospheric Sciences,
Colorado State University, Fort Collins, CO, pp 155
Firat M (2007) Watershed modeling by adaptive Neuro-fuzzy
inference system approach, Doctor of Philosophy Thesis,
Pamukkale University, Turkey (in Turkish)
Firat M, Gungor M (2007) River flow estimation using adaptive neurofuzzy inference system. Math Comput Simul 75(34):8796
Firat M, Gungor M (2008) Hydrological time-series modeling using
adaptive neuro-fuzzy inference system. Hydrol Process
22(13):21222132
Guttmann NB (1998) Comparing the Palmer drought index and the
standardized precipitation index. J Am Water Resour Assoc
34:113121
Hayes M (1997) Drought indices, p 11. Available at: www.drought.
unl.edu
Hayes MJ (2000) Revisiting the SPI: Clarifying the Process. Drought
Network News, A Newsletter of the International Drought
Information Center and the National Drought Mitigation Center
12/1 (Winter 1999Spring 2000), 1315
Hayes MJ, Svoboda MD, Wilhite DA, Vanyarkho OV (1999)
Monitoring the 1996 drought using the standardized precipitation
index. Bull Am Meteorol Soc 80:429438
Hughes BL, Saunders MA (2002) A drought climatology for Europe.
Int J Climatol 22:15711592
Jain A, Kumar AM (2007) Hybrid neural network models for
hydrologic time series forecasting. Appl Soft Comput 7:585592
Jang JSR, Sun CT, Mizutani E (1997) Neuro-fuzzy and soft
computing. Prentice Hall, Englewood Cliffs, NJ, USA, 607pp.
ISBN 0-13-261066-3
Jeong D, Kim YO (2005) Rainfall-runoff models using artificial
neural networks for ensemble stream flow prediction. Hydrol
Process 19:38193835
Keyantash J, Dracup J (2002) The quantification of drought: an
evaluation of drought indices. Bull Am Meteorol Soc 83:167
118

123

1154
Kumar ARS, Sudheer KP, Jain SK, Agarwal PK (2005) Rainfallrunoff modelling using artificial neural networks: comparison of
network types. Hydrol Process 19:12771291
Liong SY, Lim WH, Kojiri T, Hori T (2000) Advance flood
forecasting for flood stricken Bangladesh with a fuzzy reasoning
method. Hydrol Process 14:431448
Mahabir C, Hicks FE, Fayek AR (2000) Application of fuzzy logic to
the seasonal runoff. Hydrol Process 17:37493762
Mantoglou A (2003) Estimation of heterogeneous aquifer parameters
from piezometric data using ridge functions and neural networks.
Stoch Environ Res Risk Assess 17:339352
McKee TB, Doesken NJ, Kleist J (1993) The relationship of drought
frequency and duration to time steps. Preprints, 8th Conference
on Applied Climatology, January 1722 Anaheim, California,
pp 179184
McKee TB, Doesken NJ, Kleist J (1995) Drought monitoring of
climate. Geogr Rev 38:5594
Mishra AK, Desai VR (2005) Drought forecasting using stochastic
models. Stoch Environ Res Risk Assess 19:326339
Mishra AK, Desai VR (2006) Drought forecasting using feed-forward
recursive neural network. Ecol Model 198:127138
Mishra AK, Singh VP, Desai VR (2008) Drought characterization: a
probabilistic approach. Stoch Environ Res Risk Assess (in
press). doi:10.1007/s00477-007-0194-2
Moreira EE, Paulo AA, Pereira LS, Mexia JT (2006) Analysis of SPI
drought class transitions using loglinear models. J Hydrol
331:349359
Morid S, Smakhtin V, Bagherzadeh K (2007) Drought forecasting
using artificial neural networks and time series of drought
indices. Int J Climatol 27:21032111
Nagy HM, Watanabe K, Hirano M (2002) Prediction of sediment load
concentration in rivers using artificial neural network model.
J Hydr Eng 128:588595
Nayak PC, Sudheer KP, Rangan DM, Ramasastri KS (2004) A Neuro
Fuzzy computing technique for modeling hydrological time
series. J Hydrol 291:5266

123

Stoch Environ Res Risk Assess (2009) 23:11431154


Nayak PC, Sudheer KP, Ramasastri KS (2005) Fuzzy computing
based rainfall-runoff model for real time flood forecasting.
Hydrol Process 19:955968
Nicholson SE, Davenport ML, Malo AR (1990) A comparison of the
vegetation response to rainfall in the Sahel and east Africa, using
normalized difference vegetation index from NOAA-AVHRR.
Clim Change 17(23):209241
Ntale HK, Gan T (2003) Drought indices and their application to East
Africa. Int J Climatol 23:13351357
Palmer WC (1965) Meteorological drought. US Weather Bureau
Research Paper 45. Washington, DC
Palmer WC (1995) Meteorological Drought. Research Paper, US
Pickup G (1998) Desertification and climate change the Australian
perspective. Clim Res 11:5163
Rajurkar MP, Kothyari UC, Chaube UC (2004) Modeling of the
daily rainfall-runoff relationship with artificial neural network.
J Hydrol 285:96113
Srdas S, S en Z (2003) Spatio-temporal drought analysis in the
Trakya region, Turkey. Hydrol Sci 48(5):809820
S en Z (2001) Fuzzy logic and foundation. BKS Publisher, 172pp.
ISBN: 9758509233 (in Turkish)
S en Z, Altunkaynak A (2006) A comparative fuzzy logic approach to
runoff coefficient and runoff estimation. Hydrol Process
20:19932009
Wilhite DA (1997) A methodology for drought preparedness. Nat
Hazards 13:229252
Wu JD, Li N, Yang HJ, Li CH (2008) Risk evaluation of heavy snow
disasters using BP artificial neural network: the case of Xilingol
in Inner Mongolia. Stoch Environ Res Risk Assess (in press).
doi:10.1007/s00477-007-0181-7
Vicente-Serrano SM, Lopez-Moreno JI (2005) Hydrological response
to different time scales of climatological drought: an evaluation
of the Standardized Precipitation Index in amountainous
Mediterranean basin. Hydrol Earth Syst Sci 9:523533
Zadeh LA (1965) Fuzzy sets. Inf Control 8(3):338353

You might also like