Machine Learning CS5011 Assignment #2: Dr. B. Ravindran

Machine Learning CS5011
Assignment #2
Submission Date: 07 October 2016
Instructor: Dr. B. Ravindran
Avinash Sharma(CS16D401)
1
Avinash Sharma(CS16D401) Assignment #2
Q1: SVM
You have been provided with training instances for an image classification problem DS2. You
have to train an SVM to classify the test images into either of the following four categories:
coast, forest, inside-city, mountain.
Use the training data to build classification models using the following kernels: Linear, Poly-
nomial, Gaussian and Sigmoid kernel.
Come up with the kernel parameters for the various models. You can use a fraction of data
supplied to do a n-fold cross validation to find the best model parameters.
Optimal Hyperparameter values for different SVC kernals are:
Linear Kernal
C = 20
Polynomial Kernal
C=2
=1
Degree = 4
Linear Kernal
C = 10
= 0.5
Linear Kernal
C = 20
= 0.1
Table 1: Performance Index for SVC using different Kernals
Kernal Accuracy Precision Recall F-measure

Linear 0.6 0.626 0. 0.626
Polynomial 0.65 0.642 0.642 0.642
RBF (Gaussian) 0.7 0.704 0.704 0.704
Sigmoid 0.537 0.578 0.578 0.578
Page 2 of 8
Avinash Sharma(CS16D401) Assignment #2 Q1: SVM
Q2: Neural Network
Implement original back-propagation algorithm. Use DS2 for training your neural network.
Report per-class precision, recall and F-measure on the test data used in Question 1. Now
consider the alternate error function. Derive the gradient descent update rule for this definition
of R. Now train your neural network with this new error function. Report per-class precision,
recall and F-measure on the same test data. What will happen when you vary the value of ?
Vary the value of from 102 to 102 in multiples of 10 and repeat the experiment and report
the results. Can you figure out the effect of in the results? Look at the weights learnt using
the new error function. What do you infer from them?
Table 2: Performance Index for NN using different values
Accuracy Precision Recall F-measure

0 0.6875 0.689 0.6875 0.6775
0.01 0.725 0.742 0.725 0.728
0.1 0.7 0.713 0.7 0.686
1 0.6875 0.718 0.6875 0.656
10 0.25 0.0625 0.25 0.1
100 0.25 0.0625 0.25 0.1
Gradient Descent Weight Update Rule for L2 normalized is given in Eq. 1 and 2.
R
km km (1 )/N (1)
km
R
km km (1 )/N (2)
km
On increasing the accuracy, precision, recall and F-measure are getting decreased. But at 0.1 and 0.01
values the performance in terms of these performance indices is better in comparison to that without the
regularization. This is due to decrease in the variance of the fit. Further, as we are increasing the the
learned weights are getting reduced as it essentially applies penalty on the weights of the network.
Q3: Decision Tree
You need to use Weka for this question. We will use Mushroom dataset from UCI ma-
chine learning repository (https://archive.ics.uci.edu/ml/datasets/Mushroom). This is a 2-
class problem with 8124 instances. Use the last 1124 instances as test data and the rest as
training data.
Convert the data into ARFF format.
Q3: Decision Tree continued on next page. . . Page 3 of 8

Avinash Sharma(CS16D401) Q3: Decision Tree
Assignment
(continued)Q3:
#2 Decision Tree continued on next page. . .
Run J48 Decision Tree algorithm from Weka. Report precision, recall and f1- measure.
What is the effect of MinNumObj on the performance? What happens when you do
reducedErrorPruning?
What are the important features in deciding whether a mushroom is edible or not?
Turn in the Decision Tree learnt by the model (the decision tree with the best perfor-
mance).
Unprunned Tree:
Accuracy = 1
Precision =1
Recall = 1
F1-measue = 1
By default the value of MinNumObj is 2. On incresing its value the performance of tree remains same upto
MinNumObj = 24. After that the performance indices slightly reduces.
Accuracy = 0.994
Precision = 0.985
Recall = 1
F1-measue = 0.993
On using reduced error prunning the performance of tree is not affected by change in confidence interval but
it gets reduced to below mentioned values at 5 folds. Important features are: odor, stalk-shape, spore-print-
color, gill-size, gill-spacing and population.
Q3: Decision Tree (continued)Q3: Decision Tree continued on next page. . . Q3: Decision Tree (continued)Q3:
Decision Tree continued on next page. . .
Q3: Decision Tree (continued)Q3: Decision Tree continued on next page. . . Q3: Decision Tree
Q3: Decision Tree (continued)Q3: Decision Tree continued on next page. . . Page 4 of 8
Assignment
(continued)Q3:
Figure 1: Unprunned tree
Assignment
(continued)Q3:
Figure 2: Prunned Tree at 4 folds
Assignment
(continued)Q3:
Assignment
(continued)Q3:

Machine Learning CS5011 Assignment #2: Dr. B. Ravindran

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Machine Learning CS5011 Assignment #2: Dr. B. Ravindran

Uploaded by

Copyright:

Available Formats

Machine Learning CS5011

Instructor: Dr. B. Ravindran

Optimal Hyperparameter values for different SVC kernals are:

Table 1: Performance Index for SVC using different Kernals

Kernal Accuracy Precision Recall F-measure

Q2: Neural Network

Table 2: Performance Index for NN using different values

Accuracy Precision Recall F-measure

Q3: Decision Tree

Convert the data into ARFF format.

Q3: Decision Tree continued on next page. . . Page 3 of 8

Figure 1: Unprunned tree

Figure 2: Prunned Tree at 4 folds

Figure 3: Prunned Tree at 10 folds

Figure 4: Prunned Tree at 70 folds

You might also like