1. Define Decision tree problem. Explain how to construct decision tree.
2. Write pseudo code for decision tree algorithm. Give the difference between popular decision tree algorithms. 3. Define Regression. Give the key steps of regression. Explain the different types of relationship among different variables. 4. Write a note on non-linear regression. 5. Construct the decision tree for the given data set. Outlook Temp Humidity Windy Play Sunny Hot High False No Sunny Hot High True No Overcast Hot High False Yes Rainy Mild High False Yes Rainy Cool Normal False Yes Rainy Cool Normal True No Overcast Cool Normal True Yes Sunny Mild High False No Sunny Cool Normal False Yes Rainy Mild Normal False Yes Sunny Mild Normal True Yes Overcast Mild High True Yes Overcast Hot Normal False Yes Rainy Mild High True No
Predict the play decision given the atmospheric conditions.
Outlook Temp Humidity Windy Play Sunny Hot Normal True ??
6. Explain the advantages and disadvantages of regression models.
7. Briefly explain design principles of ANN. 8. Give the advantages and disadvantages of using ANN. 9. Describe the different steps involved in building a ANN and ANN Training dataset. 10. Define Cluster Analysis. Also explain application of Cluster Analysis. 11. Write a note on cluster analysis technique. Give the pseudo code for clustering. 12. Write the advantage and disadvantage of K-Means Algorithm. 13. Explain the k-means algorithm for clustering with its pseudo code.
DEPT OF CSE/BNMIT 1 Big Data Analytics (15CS82)
14. Define Associate rule mining. Explain the apriori algorithm.