You are on page 1of 8

Fuzzy Deep Learning for Diabetes

Detection

Tushar Deshmukh and H. S. Fadewar

Abstract The use of science for the betterment of society is the main cause for
research for years. That is the reason the framework of diabetes diagnosis is always
changing with new dimensions. The new and advance algorithms on the horizons
are tried in hope of getting better accuracy and speed. Apart from normal algorithms
researchers have tried the possible hybrid combinations. In recent times, the Con-
volution Neural Network (CNN) has outperformed most of the application areas of
traditional prediction algorithms. Here is an attempt to use the deep convolutional
neural network for diagnosis of diabetes. This work has two major contributions,
first is the application of CNN for diabetes detection and second is data fuzzification
in matrix form to suit needs of CNN. In the experiments, the comparison is made
between classical NN and CNN for diabetes detection. Results prove that fuzzifi-
cation of data significantly improves the accuracy of CNN and CNN outperforms
classical NN.

Keywords Deep learning · Convolutional neural network · Fuzzy deep learning


Classification · Diabetes detection

1 Introduction

The defects in the secretion of insulin, insulin action or both can cause diabetes,
which is a metabolism disorder characterized by hyperglycemia [1]. The severity
of disease is associated with dysfunction or damage of several organs like eyes,
kidneys, nerves, and heart and blood vessels [2]. Even though there are various
symptoms which signifies the disease, in many of the cases, the symptoms are either
go unnoticed or no significant symptoms are shown out. So it is highly required

T. Deshmukh · H. S. Fadewar (B)


School of Computational Sciences, SRTMU, Nanded, India
e-mail: fadewar_hsf@yahoo.com
T. Deshmukh
e-mail: gemini.tushar@gmail.com

© Springer Nature Singapore Pte Ltd. 2019 875


B. Iyer et al. (eds.), Computing, Communication and Signal Processing,
Advances in Intelligent Systems and Computing 810,
https://doi.org/10.1007/978-981-13-1513-8_89
876 T. Deshmukh and H. S. Fadewar

that diabetes should be diagnosed in time. Various methods and techniques, and
algorithms have been used for detecting diabetes.
Lofti Zedah has coined the term fuzzy logic for the first time. Fuzzy logic is
used to represent the uncertainties or the approximations in reasoning [3]. Wherever
one has to take decisions based on incomplete information or the information where
vagueness is involved fuzzy logic can be the solution.
Artificial neural network is such parallel system which is modeled on human
nervous system. It is a network of highly interconnected small processing units
called neurons which learn by example [4]. The most important characteristics of
ANN is their adaptability and the network can deal with incomplete, noisy data.
Neural networks are considered to be the best tool for generalization.
Even though both fuzzy logic and artificial neural network are strong in their
respective domains, there is always a need for a hybrid system, which is a combina-
tion of best of both worlds [5]. Fuzzy neuro systems are integration of fuzzy logic
and neural network. Here, the dataset is fuzzified and then passed as input to the
multilayered neural network. Then the neural network is trained on the basis of the
fuzzy inputs, it then auto adjusts the weights and then produces the desired output.
The branch of machine learning that is based on different level of representation
each of which corresponds to the different feature available. Whenever the data
like image, text, or audio contains different level of representation, deep learning
can be the best way to learn. The deep neural network simply specifies that they
should have more than one hidden layer. Here each layer’s training features are
decided by the output of the previous layer [6]. The notable difference between
normal neural network and deep neural network is that signal function. When we
are using composite multiple linear function, there combine result would be another
linear function. But when we are training deep neural network, we have to be very
specific in choosing a separate nonlinear activation function for each of the hidden
layer.
Convolutional neural network is a special kind of neural network where the com-
plexity of multilayer neural network has been minimized. Convolutional neural net-
work has got a good ability to abstract the data that could be used for the predictions
[7]. CNN got a wide range of applications in pattern recognition and image process-
ing.
The rest of the research work is organized as follows: Sect. 2 discusses the related
work in the domain; in Sect. 3, the proposed framework is documented; whereas the
Sect. 4 is about the conclusion.

2 Related Work

The related research work can be viewed as detection of diabetes using neural net-
work, the use of deep neural network for diabetic retinopathy and deep neural network
in other related area.
Fuzzy Deep Learning for Diabetes Detection 877

When the data is highly multidimensional, it becomes very difficult to store, visu-
alize or classify such data. Hinton and Salakhutdinov, [8] in their research, suggest
a multilayer encoder network which is used to transform the high dimensional data
to a lower dimensional data. It is a nonlinear generalization of principal component
analysis. These auto encoders with back propagation are very effective for nonlinear
data reduction and that can be in both the directions between data and code space.
In 2016, Kamble and Patil [9] use deep learning approach for the prediction of
diabetes. The researcher uses restricted Boltzmann machine for the classification of
data that is whether the patient is diabetic or not, and once it is tagged as diabetic,
then uses a decision tree to check whether it is Type 1 or Type 2 diabetes.
Convolutional neural network has been used for diabetes retinopathy by Pratt et al.
[10]. The network goes on learning more features as the number of layers goes on
increasing. The network could classify the data with 75% accuracy with five different
classes. The research has used Keras and Theano as machine learning tool library.
Tharani S. and C. Yamini have used CNN to train the neural network for prediction
of diabetes [11]. The first layer produces the feature map to forward it to the next
layer where the convolution calculations are done and then find the feature value.
Then input deviation is back propagated and if the feature is selected, the output
layer gives result for classification.
In an attempt of automatic detection of diabetic retinopathy, Chandore Vishakha
and Asati Shivam have used CNN [12]. The researcher has used three consecutive
convolution layer and then a maxpool layer to increase accuracy. To train the network
faster, the researchers made RELU layer after every convolutional layer. To reduce
the over fitting, horizontal and vertical reflections with 50% probability are generated
called data augmentation which increases the size of dataset. Also, dropout layers
are used for avoiding over fitting.
Daojian Zeng, Kang Liu, and team have used the convolutional deep neural net-
work to extract lexical and sentence level features for relation classification [13]. The
researchers have shown that there is significant improvement when position features
are added. The sentence level features are learned using convolutional approach.
In the paper published in 2014, Karen Simonyan and Andrew Zisserman use very
deep convolutional neural network for large image classification [14]. They have
used a small convolutional filter of size (3, 3). The convolutional layers are followed
by multiple fully connected layers in their architecture. The very deep convolutional
layer up to 19 layers proved that it benefits accuracy by large degree.
We have previously undergone a literature survey to find out, what are other
techniques that have been used for diabetes prediction [15, 16]. There are a lot of work
that has been done for such kind of prediction, but we can no find any related work
where the researchers have used fuzzy deep learning method for diabetes detection.
So in that sense, we can say it is a novel approach we are proposing.
878 T. Deshmukh and H. S. Fadewar

Fig. 1 Matrix form of input


after fuzzification

3 Proposed Framework

In this system, convolutional neural network (CNN) is used for processing the fuzzy
inputs. The unique feature of CNN is that it accepts an image like inputs and identifies
unique patterns in them. In the proposed system, after fuzzification, each sample is
represented as a matrix than in a row.
So, as shown in Fig. 1, each sample of original input is converted to one matrix.
Matrix consists of feature as column and fuzzy value as row. So, each original sample
is treated as a matrix input to CNN. This allows deeper pattern identification leading
to better accuracy compared to classical Neural Networks.

3.1 Experiment and Results

Dataset used for training the network is taken from National Institute of Diabetes
and Digestive and Kidney Diseases [17]. The dataset consists of nine parameters out
of which eight are used as input parameter and the last one is the output which is a
binary classification about diabetes or nondiabetes. There are a total of 768 samples.
In fuzzification step, each feature has defined ranges. Table 1 shows standard
range of all attributes. As per the range with 10% overlapping, the fuzzification of
data is done. In this experimentation, NN are configured as per the standard method.
The number of neurons in first hidden layer depends on number of features, number
of samples and number of outputs expected. Formula is described by Eq. 1.
n samples
Nneur ons     , where 2 ≤ α ≤ 10 (1)
α n f eatur es + n out put
Fuzzy Deep Learning for Diabetes Detection 879

Table 1 Analysis of the proposed technique


S. no Name of attribute Term applied Range Comment
1 No of times pregnant Nulli para No pregnancy Not fuzzified
Primi-para First time pregnant
Multi-para More than one
pregnancy
Multi-para Low 1, 2
Medium 3, 4, 5
High >6
2 Plasma glucose Normal Less than 140 mg/dl Oral glucose test
concentration a 2 h
in an oral glucose
tolerance test
Prediabetes or impaired glucose tolerance 140–190 mg/dl
Diabetes ≥200
After 1 h >140 (but if reading OGTT in pregnancy
is >190 then without
further test diabetes)
After 2 h <155 mg/dl
After 3 h <140 mg/dl
3 Diastolic blood Low 40–60
pressure (mm Hg)
Ideal 60–80
Pre-high 80–90
High 90–100
4 2 h serum insulin Fasting <25 mlu/L
(mu U/ml)
30 min after glucose admin 30–230
1 h after glucose admin 18–276
2 h after glucose admin 16–166
5 Body mass index Underweight <18.5
(weight in kg/(height
in m)ˆ2)
Normal/healthy 18.5–24.9
Overweight 25–29.9
Obese 30–39.9
6 Diabetes pedigree Low <40%
function
Medium 40–80%
High >80
7 Age Young 16–25
Lower middle 26–35
Middle age 36–45
Senior >46
880 T. Deshmukh and H. S. Fadewar

Fig. 2 Comparison of accuracy of NN (“α”  2), NN (“α”  5) and CNN

In this experiment, “α” has value 2 and 5. In CNN, configuration standard size of
pooling matrix (2, 2) is used. Also, the convolution matrix is of size (3, 3). Number of
epoch is kept 2000, for all. Dropout layers are used at appropriate places for avoiding
overfitting.
Results are shown in Fig. 2. Neural network with “α”  2 performs better than
“α”  5 in terms of accuracy. Smaller values of “α” enable more number of neurons
per layer, so the results are obvious. CNN performs better than both of them. Pattern
of accuracy shows in the first phase of epochs neural networks learn and go on
increasing their accuracy. In the second phase, the accuracy is stabilized and will be
steady even if the number of epochs increases.
In future, with the notion of IoT based systems, it is expected that a huge data
will be generated for the analysis [18]. Under such scenario, techniques that will
reduce the dimensionality during analysis will be most preferred for speedy analysis
and data processing. PCA and ICA can be the best candidate to process the big data
arised from medical applications [19]. Hence, in future, we will extend the present
work towards the integration of PCA and ICA based module.

4 Conclusion

In this work, novel approach to apply CNN to detect diabetes is showcased. Also,
fuzzification in matrix form is a proposed here. The entire data set is fuzzified and
Fuzzy Deep Learning for Diabetes Detection 881

thus it gets multiple values for each of the feature. In other words, each attribute got
different membership in different classes, and thus the entire data set is populated.
So data is represented in a form of matrix of fuzzy values.
Here, three different experiments have been conducted. The first two experiments
were on classical neural networks with value of “α”  2 and “α”  5 and then we
compare the results with convolutional network with fuzzified input. It is found that
using CNN with fuzzification for diabetes detection is better than normal NN.

References

1. American Diabetes Association: Diagnosis and classification of diabetes mellitus. Diabetes


Care 37(1), s81–s90 (2014)
2. Diagnosis and classification of diabetes mellitus: new criteria. Am. Fam. Physician 58(6),
1355–1362 (1998)
3. Ross, T.: Fuzzy Logic with Engineering Applications, 3 edn. Wiley Student Edition
4. Kumar, S.: Neural Networks: A Classroom Approach, 2 edn. Tata McGraw-Hill Education
(2012)
5. Rajasekaran, S., Pai, G.A.V.: Neural Networks, Fuzzy Logic, and Genetic Algorithms Synthesis
and Applications, 8 edn. PHI (2003)
6. Gulshan, V., Peng, L., Coram, M.: Development and validation of a deep learning algorithm
for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316(22), 2402–2410
(2016)
7. Pinheiro, P., Collobert, R.: Recurrent convolutional neural networks for scene labeling. Proc.
Mach. Learn. Res. 32(1), 82–90 (2014)
8. Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks.
Science 313(5786), 504–507 (2006)
9. Kamble, T.P., Patil, S.T.: Diabetes detection using deep learning approach. Int. J. Innov. Res.
Sci. Technol. 2(12), 342–349 (2016)
10. Coenen, F., Broadbent, D.M., Harding, S.P., Zheng, Y., Pratt, H.: Convolutional neural networks
for diabetic retinopathy. Procedia Comput. Sci. (2016)
11. Yamini, C., Tharani, S.: Classification using convolutional neural network for heart and dia-
betics datasets. Int. J. Adv. Res. Comput. Commun. Eng. 5(12), 417–422 (2016)
12. Asati, V.C.S.: Automatic detection of diabetic retinopathy using deep convolutional neural
network. Int. J. Adv. Res. Ideas Innov. Technol. 3, 633–641 (2017)
13. Liu, K., Lai, S., Zhou, G., Zhao, J., Zeng, D.: Relation classification via convolutional deep
neural network. In: National Laboratory of Pattern Classification (2014)
14. Karen Simonyan, Z.: Very deep convolutional neural network for large scale image recognition.
ICLR (2015)
15. Fadewar, H.S., Deshmukh, T.: Machine predicts the diagnosis a brief review of medical diag-
nosis by machine learning techniques. Indian J. Comput. Sci. Eng. 8(5), 636–638 (2017)
16. Fadewar, H., Deshmukh, T.: Data mining techniques for diagnosis of diabetes: a review. Int. J.
Emerg. Res. Manag. Technol. 6(9), 212–214 (2017)
17. Lichman, M. http://archive.ics.uci.edu/ml (2013)
18. Deshpande, P., Iyer, B.: Research directions in the Internet of Every Things (IoET). In: Interna-
tional Conference on Computing, Communication and Automation (ICCCA), pp. 1353–1357
(2017)
882 T. Deshmukh and H. S. Fadewar

19. Patil, M., Iyer, B., Arya, R.: Performance evaluation of PCA and ICA algorithm for facial
expression recognition application. In: Proceedings of Fifth International Conference on Soft
Computing for Problem Solving, pp. 965–976 (2016)
20. Chandore, V., Asati, S.: Automatic detection of diabetic retinopathy using deep convolutional
neural network. Int. J. Adv. Res. Ideas Innov. Technol. 3(4), 633–641 (2017)
21. https://deeplearning4j.org/neuralnet-overview. (2017)

You might also like