You are on page 1of 2

Big Data Analytics (15CS82)

BIG DATA ANALYTICS

Module 4 Question Bank

1. Define Decision tree problem. Explain how to construct decision tree.


2. Write pseudo code for decision tree algorithm. Give the difference between popular
decision tree algorithms.
3. Define Regression. Give the key steps of regression. Explain the different types of
relationship among different variables.
4. Write a note on non-linear regression.
5. Construct the decision tree for the given data set.
Outlook Temp Humidity Windy Play
Sunny Hot High False No
Sunny Hot High True No
Overcast Hot High False Yes
Rainy Mild High False Yes
Rainy Cool Normal False Yes
Rainy Cool Normal True No
Overcast Cool Normal True Yes
Sunny Mild High False No
Sunny Cool Normal False Yes
Rainy Mild Normal False Yes
Sunny Mild Normal True Yes
Overcast Mild High True Yes
Overcast Hot Normal False Yes
Rainy Mild High True No

Predict the play decision given the atmospheric conditions.


Outlook Temp Humidity Windy Play
Sunny Hot Normal True ??

6. Explain the advantages and disadvantages of regression models.


7. Briefly explain design principles of ANN.
8. Give the advantages and disadvantages of using ANN.
9. Describe the different steps involved in building a ANN and ANN Training dataset.
10. Define Cluster Analysis. Also explain application of Cluster Analysis.
11. Write a note on cluster analysis technique. Give the pseudo code for clustering.
12. Write the advantage and disadvantage of K-Means Algorithm.
13. Explain the k-means algorithm for clustering with its pseudo code.

DEPT OF CSE/BNMIT 1
Big Data Analytics (15CS82)

14. Define Associate rule mining. Explain the apriori algorithm.


15. Write a note on Pruning.

DEPT OF CSE/BNMIT 2

You might also like