Professional Documents
Culture Documents
THESIS SUBMITTED TO
BHARATI VIDYAPEETH UNIVERSITY, PUNE
FOR AWARD OF DEGREE OF
DOCTOR OF PHILOSOPHY IN COMPUTER APPLICATION
UNDER THE FACULTY OF MANAGEMENT STUDIES
SUBMITTED BY
Miss Swati Sah
RESEARCH CENTRE
Bharati Vidyapeeth Deemed University
Institute of Management
Kolhapur, Maharashtra, India
MARCH 2016
CERTIFICATE
subject of Computer Application under the faculty of Management Studies has been
carried out by Miss Swati Sah in the Department of Computers at Bharati Vidyapeeth
period from November 2012 to February 2016 under the guidance of Dr. Ashutosh
Gaur.
Seal
i
CERTIFICATION OF GUIDE
Management Studies has been carried out in the Department of Computers at Bharati
India during the period from November 2012 to February 2016 under my direct
Associate Professor
Bharati Vidyapeeth University, Pune
Place :
Date :
ii
DECLARATION BY THE CANDIDATE
submitted by me to the Bharati Vidyapeeth University, Pune for the degree of Doctor
Studies, is original piece of work carried out by me under the supervision of Dr.
Ashutosh Gaur.
I, further declare that it has not been submitted to this or any other university
I also confirm that all the materials, which I have borrowed from other
sources and incorporated in my thesis, are duly acknowledged. If any material is not
responsibility. I am fully aware of the implications of any such act which might have
iii
Acknowledgement
First and foremost, my deepest gratitude goes to God for giving me the grace,
the strength and the wisdom to undertake and complete this research work even when
several people. I would like to express my sincere gratitude to all of them. First of all,
Bharati Vidyapeeth Deemed University, and for his valuable guidance, scholarly
inputs and consistent encouragement received throughout the research work. This
achievement was possible only because of the unconditional support provided by him.
A person with an friendly and positive disposition, my supervisor has always made
himself available to clarify my doubts despite his busy schedules and I consider it as a
great opportunity to do my doctoral programme under his guidance and to learn from
his research expertise. Thank you very much for all your help and support.
I thank to Dr. Nitin Nayak, Director, BVIM Kolhapur, for the academic
support and the facilities provided to carry out the research work at the institute. As I
enrolled in Ph.D. program during his tenure He, has been very encouraging and
I would like to thank faculty members of the Institute have been very
kind enough to extend their help at various phases of this research, whenever I
personal and academic life, and longed to see this achievement come true. I deeply
iv
miss my mother Late Mrs Saroj Sah, and my father Late Mr Madan Lal Sah who is
I am thankful to my family for their constant support and for all their
My Bhabhi Mrs Nupur Sah who has been a great support. I owe them so much for
their care. I would also like to thank my sister Mrs Shalini Sah and My brother in law
Mr Manu Sah.
I would also like to thank Sweet niece Myra for not disturbing me when I used
I further acknowledge Mr. Ganesh Dixit for his continuous help and support in
preparation of this thesis. I also like to thank to Mrs Anuja Sharma and Mr. T. P.
Sharma for their kind help and support to achieve this journey of research.
Above all, I owe it all to Almighty God for granting me the wisdom, health
and strength to undertake this research task and enabling me to its completion.
v
List of figures
vi
Figure 4.22 3D_Scatterplot_Petal 90
Figure 4.23 Matrix_Scatterplot 91
Figure 4.24 Levelplot 92
Figure 4.25 Parallel_coordinate_plot 92
Figure 4.26 Parallel_plot 93
Figure 4.27 qp_plot 93
Figure 4.28 Contour Plot 94
Figure 4.29 Self-Organizing Map 100
Figure 4.30 Correlation Analysis with pair plot 104
Figure 4.31 Correlation Analysis 104
Figure 4.32 Correlation matrix plot 105
Figure 4.33 K-Means Clustering with 3 clusters 106
Figure 4.34 a clusters and their center point 106
Figure 4.34 b K-means clustering (K=3) 107
Figure 4.35 K-means clustering with 4 clusters 107
Figure 4.36 clusters and their center point 108
Figure 4.37 Distribution of data in 3 clusters using K medoid 110
Figure 4.38 Comparison of time complexity of K-medoid and 110
K-means
Figure 4.39 Comparison of time complexity of K-medoid and 112
K-means
Figure 4.40 Comparison of space complexity of K-medoid an 112
d K-means
Figure 4.41 Dendrogram of newiris dataset 112
Figure 4.42 Clusters of newiris dataset using agglomerative 113
algorithm
Figure 4.43 Dendrogram showing four Clusters of newiris 113
dataset using agglomerative algorithm
Figure 4.44 a 3X5 vector code books of newiris data 114
Figure 4.44 b Energy evolution of SOM clustering 114
Figure 4.45 SOM Grid 116
Figure 4.46 Overview of SOM clusters 116
Figure 4.47 overview of Fuzzy c-means clustering 118
Figure 4.48 center points of Fuzzy c-means clustering 118
Figure 4.49 Time complexity of fuzzy c-means and k-means 119
on number of clusters
Figure 4.50 Time complexity of fuzzy c-means and k-means 120
on number of iterations
vii
List of Tables
Page
S. No. Title of Table
No.
Table 1.1 Evolution of internet from 1995 to 2015 3
Comparison between k-means and k-medoids
Table 2.1 32
algorithms
Difference between Descriptive & Predictive data
Table 2.2 41
mining
Table 2.3: Example applications of large-scale data
Table 2.3 clustering
45
Table 4.1 Summary of the dataset 80
Table 4.2 Total no of species in data set 84
Table 4.3 Summary of dataset 103
Table 4.4 Mean of 3 clusters by k-means method 105
Table 4.5 Distribution of species in 3 clusters of k-means method 107
Table 4.6 Mean of 4 clusters by k-means method 108
Table 4.7 Distribution of species in 3 clusters of k-means method 108
Table 4.8 Distribution of data in 3 clusters using K-medoid 109
Comparison of time complexity of K-medoid and K-
Table 4.9 110
means
Comparison of time complexity of K-medoid and K-
Table 4.10 111
means considering no. of iterations
Table 4.11 Comparison of space complexity of K-medoid 111
Comparative results of the hierarchical and K-means
Table 4.12 113
clustering
Comparison of K-means, Hierarchical Clustering and
Table 4.13 117
SOM
Table 4.14 Membership functions 118
Table 4.15 Distribution of different species among all 3 clusters 119
Comparison of time complexity of fuzzy c-means and
Table 4.16 119
k-means on no. of cluster
Comparison of time complexity of fuzzy c-means and
Table 4.17 120
k-means on no. of iterations
viii
Table of Contents Page No.
Certificate i
Declaration iii
Acknowledgement iv
List of Figures vi
1.1 Introduction 1
1.7 Motivation. 15
1.8 Objectives. 16
References 18
ix
Chapter 2 Review of Literature 20-66
2.1 Introduction 20
2.6.1 BIRCH 32
2.6.3 CLARA 32
2.6.4 DBSCAN 33
References 55
3.1 Introduction 67
References 73
4.1 Introduction 75
4.2.2 R Language 76
xi
4.5 Experiments and results 101
xii