Professional Documents
Culture Documents
net/publication/321947685
CITATIONS READS
0 47
6 authors, including:
Thao Nguyen-Trang
Ton Duc Thang University
32 PUBLICATIONS 70 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Tai Vo-Van on 19 March 2018.
1 Introduction
Forecasting is the process of making prediction for the future based on summing
experiences, assembling knowledge and analyzing related problems. It is consid-
ered as the basis process, the first step for organizations as well as governments to
build their policies and objectives. Because of its important role in many fields,
forecasting has received much attention from scientists. Despite several discus-
sions in the literature, the problems of forecasting have not yet been completely
solved. Based on the historical data, looking for principles and rules to establish
a suitable forecasting model is the major method of statistics. Time series and
regression models have important roles in forecasting using statistical methods,
but they have many disadvantages in practice. A regression model (Galton (1888);
Pearson (1896)) requires a number of assumptions that are unsatisfactory, whereas
a time series model, like ARIMA (Box and Jenkins (1976)), performs poorly when
there are abnormal changes or the time series is nonstationary. To overcome the
distadvantages of these two models, various models have recommended by many
researches, such as (Zecchin et al. (2011); Wang and Fu (2006); Wang et al. (2001);
Ren et al. (2016); Gupta and Wang (2010); Zhu and Wang (2010); Park (2010);
c Springer International Publishing AG 2018
L. H. Anh et al. (eds.), Econometrics for Financial Applications, Studies in Computational
Intelligence 760, https://doi.org/10.1007/978-3-319-73150-6_38
An Improved Fuzzy Time Series Forecasting Model 475
Teo et al. (2001); Ghazali et al. (2009)). These proposals are the important con-
tributions for forecasting problem because they have given good results in consid-
ered data sets. However, we could not obtain optimum for all cases. Some other
models like Artificial Neuron Network, Supported Vector Regression (Cortes and
Vapnik (1995)), Multivariable Adaptive Regression Spline (Friedman (1991)),
Adaptive Spline Threshold Autoregressive (Lewis and Stevens (1991)), Autore-
gressive conditional heteroscedasticity (Engle (1982)) or hybrid models (Zhang
(2003)); (de Oliveira and Ludermir (2014)) were also proposed; however, most of
them still have many disadvantages in real forecasting applications.
Based on the fuzzy theory of Zadeh (Zadeh (1965)), fuzzy time series (FTS)
introduced by Song (Song and Chissom (1993)) can solve the gap mentioned
above. FTS has been then interested to research and have been shown to be
more efficient than traditional statistical techniques (Song and Chissom (1993));
(Tseng and Tzeng (2002)). Among them, Abbasov and Manedova (AM) pro-
posed the model where the variations of data are represented by language level to
forecast the population of Azerbaijan (Abbasov and Mamedova (2003)). Because
of its better performance for some kinds of forecasting problems, AM model has
been applied in many applications; for instance, Sasu utilized the AM model to
forecast the population of Romanian (Sasu (2010)). Some other important stud-
ies of FTS can be listed as the models in (Chen (2004); Huarng (2001); Singh
(2008)). Nonetheless, all of the above methods use only historical fuzziness data
without forecasting. Moreover, the parameters in the models are not properly
investigated to find the optimal values for each data set. One model is only rated
as better than the others in some specific cases. As a result, there is no model
that is considered optimal in all situations.
To overcome the gap mentioned above, this article proposes the methods to
identify the suitable parameters in the AM model. Specifically, w, the number of
elements in the data set used as prior information to forecast the data is chosen
through the value of partial autocorrelation function (PACF), the number of
fuzzy sets n is selected through an index that can evaluating the compactness of
the divided intervals. After determining suitable w and n, the optimal choice of C
is searched via an efficient algorithm so that the forecasting error is the smallest.
The numerical examples illustrate the proposed theories in detail and prove that
this method can improve the performance in term of forecasting accuracy.
The remainder of this paper is organized as follows. Section 2 reviews the AM
model and some the related definitions. Section 3 proposes the modifications for
AM model, in which the suitable parameters are determined by new methods and
algorithm. The numerical examples are presented in Sects. 4 and 5 is the conclusion.
Vt = Xt − Xt−1 (4)
or ⎡ ⎤
R11 R12 ... R1j
⎢ R R22 ... R2j ⎥
R(t) = Ow (t) ⊗ K(t) = ⎢ 21
⎣ ...
⎥,
... ... ... ⎦
Ri1 Ri2 ... Rij
where Ow (t) is the operation matrix, K(t) is the criteria matrix, ⊗ is the min
operator (∩).
Define F (t), the fuzzy forecasting of variations for the year t, in a fuzzy form
as follows.
F (t) = [max(R11 , . . . , Ri1 ), . . . , max(R1j , . . . , Rij )]
= [μA1 (Vt ), μA2 (Vt ), . . . , μAm (Vt )].
– Step 6: Defuzzify the obtained results of the 5-th step according to the
Formula 7.
m
μAi (Vt ) × uim
i=1
V (t) =
m , (7)
μAi (Vt )
i=1
where μAi (Vt ) is the value of membership function of the forecast variation
in interval i, V (t) is the defuzzified forecast variation.
In orders to estimate the forecast value X(t) for year t, the following formula
is utilized:
result is better if we utilize a less complex model, with smaller w (Song and
Chissom (1993)). However, this conclusion is only drawn from a few specific sur-
veys, so lose generality. In fact, w is number of previous times that have a strong
influence on current value of time series (it is similar to the partial autocorre-
lation p in autoregressive integrated moving average, ARIMA). Therefore, it is
not reasonable if we utilize a model, with a fixed value of w, for all type of time
series. For instance, when dealing with a monthly or quarterly data, w = 7 is
consider as an unreasonable parameter. According to above remarks, it is certain
that the forecasting performance of the AM model can be significantly improved
if its parameters are determined in reasonable ways. To overcome the limita-
tions mentioned above, this section proposes a method called MAM (Modified
Abbasov-Mamedova model), which can identify the parameters n, w and C in a
reasonable way. Details of the proposed method are presented as follows.
In the fuzzification step, the middle point ui0 is used as the representative element
of ith interval. Therefore, if the data in each interval are well-represented by
ui0 , the forecasting performance can be improved. In general, we can evaluate
whether data are well-represented by ui0 or not according to the compact measure
between this middle point and elements in the interval i. Figure 2 illustrates a
few cases of representative elements. It can be seen that the universal set U
is defined as the interval (0, 3) and divided into three equal-length intervals.
The distance between middle points (red points) and elements belonging to the
intervals (0, 1) and (1, 2) are really large; therefore, using u10 and u20 as the
representative elements can lead to a low forecasting performance. Conversely,
u30 is close to the elements in 3rd interval, and it can lead to a good measure of
compactness as well as a high forecasting performance. Therefore, it is important
to point out the number of intervals n so that the measure of compactness is
optimized. According to mentioned idea, this paper proposes a measure denoted
as MMSE to evaluate the compactness of algorithm. MMSE is computed by (9):
n 2
1 vt − ui0
M M SE = , (9)
n i=1 v ∈u ni
t i
3.2 Determine w
p
rp+1 − φp,j rp+1−j
j=1
φp+1,p+1 = , (11)
p
1− φp,j rj
j=1
Note that, when PACF presents the largest value at lag 1, w is considered as
2 so that the AM model conditions are fitted. In practice, based on statistical
programs including R, Matlab, etc., it is possible to calculate the PACF and
determine the reasonable w for fuzzy time series model.
b(t) − a(t)
b(t) = a(t−1) + n(t−1) + 1 ΔC (t−1) , ΔC (t) =
k
(t)
If a=0 and b = 1, then Ci = a(t) + iΔC (t) , i = 1, 2, . . . , k − 1.
(t)
If a=0 and b = 1, then Ci = a(t) + iΔC (t) , i = 1, 2, . . . , k.
(t)
If a = 0 and b = 1, then Ci = a(t) + iΔC (t) , i = 1, 2, . . . , k − 1.
(t)
If a = 0 and b = 1, then Ci = a(t) + iΔC (t) , i = 1, 2, . . . , k.
(t)
Step 4. Run the Abbasov-Mamedova model with all the values Ci in Step 3.
(t)
Find Cn at which the criterion CEF is the current best.
(m)
Step 5. With the new n, repeat the Step 3 and Step 4 to find C = Cn =
a(m) + nΔC (m) until b(m) − a(m) < ε.
Note that,
4 Numerical Examples
Section 3 proposes the methods of determining w, n and C in order to improve
the forecasting performance of AM model. In Sect. 4, this paper presents two
examples to illustrate and test the forecasting performance of proposed method.
Specifically, Example Sect. 4.1 presents in detail the proposed method when deal-
ing with the well-known data, Azerbaijan’s population. This example, in addi-
tion to clarifying the proposed algorithm, can test its forecasting performance.
In Example Sect. 4.2, the new method is applied to forecast the GDP per capita
in Vietnam. In each example, we compare the forecasting results of proposed
method with those of the AM model (Abbasov and Mamedova (2003)), the
Chen model (Chen (1996)) and the Huarng model (Huarng (2001)). Further-
more, to present that the proposed method is more efficient in predicting time
series than the traditional statistical methods, MAM is also compared with the
auto-regressive model AR(p) where p is choose based on AIC criterion.
An Improved Fuzzy Time Series Forecasting Model 483
4.1 Example 1
Series variation
25
0.0 0.2 0.4 0.6
20
Partial ACF
15
10
−0.4
2 4 6 8 10 12
0
2 3 4 5 6 7 8 9 10 11 12
Lag
Step 5. Apply the min − max operator to forecast the population in 1990, we
have the results:
O2 (1990) = [0.01 0.01 0.01 0.02 0.04 0.09 0.36 0.76 0.14]
K(1900) = [0.01 0.02 0.04 0.09 0.40 0.70 0.13 0.05 0.02]
R(1990) = [0.01 0.01 0.01 0.02 0.04 0.09 0.13 0.05 0.02]
Hence, the fuzzy forecasting of the variation for the year 1990, F (1990), is
Perform in a similar way for the remainder, the forecasting results are
presented in Table 2 and Fig. 6. As shown in Table 3, in addition to the pro-
posed method, the performance of models presented in (Abbasov and Mamedova
(2003)); Chen (1996); Huarng (2001)) are examined for comparison purpose.
It can be observed that MAM model outperforms others in term of accuracy
for all cases of criterion. The result verifies that the proposed method is suitable
at first and need to be retested in actual application as follows.
486 H. Che-Ngoc et al.
Actual Forecasted
7500
data
6500
point
4.2 Example 2
2000
Actual GDP
Abbasov−Mamedova
New model
Chen
Huarng
GDP
Year Forecast
2016 5908.166
2017 6148.920
2018 6372.591
2019 6588.328
2020 6796.071
Table 6. The results of MAM and AR models for Example 1 and Example 2
Example 1 Example 2
MAM AR MAM AR
MAE 9.989 93.908 25.669 165.612
MSE 127.751 8995.331 1117.06 30881.76
MAPE 0.136 1.329 0.744 5.323
5 Conclusion
This study proposes an improved fuzzy time series forecasting model based on
the methods to determine the suitable parameters for each data set in the AM
model. The numerical examples prove that the proposed method is more feasible
and capable of practical problems. In future, a program will be written in the
R statistical software to apply the proposed model in many different practice
problems.
An Improved Fuzzy Time Series Forecasting Model 489
References
Abbasov, A.M., Mamedova, M.H.: Application of fuzzy time series to population fore-
casting. Vienna Univ. Technol. 12, 545–552 (2003)
Box, G.E.P., Jenkins, G.M.: Time series analysis: forecasting and control. Holden-Day
Series in Time Series Analysis, Revised edn. Holden-Day, San Francisco (1976)
Chen, S.M.: Forecasting enrollments based on fuzzy time series. Fuzzy Sets Syst. 81(3),
311–319 (1996)
Chen, S.M., Hsu, C.C.: A new method to forecast enrollments using fuzzy time series.
Int. J. Appl. Sci. Eng. 2(3), 234–244 (2004)
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
Durbin, J.: The fitting of time-series models. Revue de l’Institut International de Statis-
tique 28(3), 233–244 (1960)
Engle, R.F.: Autoregressive conditional heteroscedasticity with estimates of the vari-
ance of United Kingdom inflation. Econometrica 50, 987–1007 (1982). Journal of
the Econometric Society
Friedman, J.H.: Multivariate adaptive regression splines. Ann. Stat. 19, 1–67 (1991)
Galton, F.: Co-relations and their measurement, chiefly from anthropometric data.
Proc. Roy. Soc. Lond. 45(273–279), 135–145 (1888)
Ghazali, R., Hussain, A.J., Al-Jumeily, D., Lisboa, P.: Time series prediction using
dynamic ridge polynomial neural networks. In: 2009 Second International Conference
on Developments in eSystems Engineering (DESE), pp. 354–363. IEEE (2009)
Gupta, S., Wang, L.P.: Stock forecasting with feedforward neural networks and gradual
data sub-sampling. Aust. J. Intell. Inf. Process. Syst. 11(4), 14–17 (2010)
Huarng, K.: Heuristic models of fuzzy time series for forecasting. Fuzzy Sets Syst.
123(3), 369–386 (2001)
Lewis, P.A.W., Stevens, J.G.: Nonlinear modeling of time series using multivariate
adaptive regression splines (MARS). J. Am. Stat. Assoc. 86(416), 864–877 (1991)
de Oliveira, J.F.L., Ludermir, T.B.: A distributed PSO-ARIMA-SVR hybrid system
for time series forecasting. In: 2014 IEEE International Conference on Systems, Man
and Cybernetics (SMC), pp. 3867–3872. IEEE (2014)
Park, D.C.: A time series data prediction scheme using bilinear recurrent neural net-
work. In: 2010 International Conference on Information Science and Applications
(ICISA), pp 1–7. IEEE (2010)
Pearson, K.: Mathematical contributions to the theory of evolution. III. Regression,
heredity, and panmixia. Philos. Trans. Roy. Soc. Lond. Ser. A Contain. Pap. Math.
Phys. Character 187, 253–318 (1896)
Ren, Y., Suganthan, P.N., Srikanth, N., Amaratunga, G.: Random vector functional
link network for short-term electricity load demand forecasting. Inf. Sci. 367, 1078–
1093 (2016)
Sasu, A.: An application of fuzzy time series to the Romanian population. Bulletin
Transilv. Univ. Brasov 3, 52 (2010)
Singh, S.R.: A computational method of forecasting based on fuzzy time series. Math.
Comput. Simul. 79(3), 539–554 (2008). https://doi.org/10.1016/j.matcom.2008.02.
026
Song, Q., Chissom, B.S.: Forecasting enrollments with fuzzy time series part I. Fuzzy
Sets Syst. 54(1), 1–9 (1993)
Teo, K., Wang, L., Lin, Z.: Wavelet packet multi-layer perceptron for chaotic time series
prediction: effects of weight initialization. In: Computational Science-ICCS 2001, pp
310–317 (2001)
490 H. Che-Ngoc et al.
Tseng, F.M., Tzeng, G.H.: A fuzzy seasonal ARIMA model for forecasting. Fuzzy
Sets Syst. 126(3), 367–376 (2002). https://doi.org/10.1016/S0165-0114(01)00047-1.
http://www.sciencedirect.com/science/article/pii/S0165011401000471
Wang, L., Fu, X.: Data Mining with Computational Intelligence. Springer Science &
Business Media, New York (2006)
Wang, L., Teo, K.K., Lin, Z.: Predicting time series with wavelet packet neural net-
works. In: Proceedings of the International Joint Conference on Neural Networks,
IJCNN 2001. vol 3, pp 1593–1597. IEEE (2001)
Zadeh, L.A.: Fuzzy sets. Inf. Control 8(3), 338–353 (1965)
Zecchin, C., Facchinetti, A., Sparacino, G., De Nicolao, G., Cobelli, C.: A new neural
network approach for short-term glucose prediction using continuous glucose moni-
toring time-series and meal information. In: 2011 Annual International Conference
of the IEEE Engineering in Medicine and Biology Society, EMBC, pp 5653–5656.
IEEE (2011)
Zhang, G.P.: Time series forecasting using a hybrid ARIMA and neural network model.
Neurocomputing 50, 159–175 (2003)
Zhu, M., Wang, L.: Intelligent trading using support vector regression and multilayer
perceptrons optimized with genetic algorithms. In: The 2010 International Joint
Conference on Neural Networks (IJCNN), pp. 1–5. IEEE (2010)