You are on page 1of 2

ACHARYA NAGARJUNA UNIVERSITY

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING


PRE Ph.D PAPER II DATA WAREHOUSING AND DATA MINING
(Syllabus of Pre Ph.D Paper 2 form the academic year 2013-2014)

UNIT-I
Introduction: Fundamentals of data mining, Data Mining Functionalities, Classification of Data
Mining systems, Major issues in Data Mining. Data Warehouse and OLAP Technology for Data
Mining: Data Warehouse, Multidimensional Data Model, Data Warehouse Architecture, Data
Warehouse Implementation, Further Development of Data Cube Technology, From Data
Warehousing to Data Mining.

UNIT-II
Data Preprocessing: Need for Preprocessing the Data, Data Cleaning, Data Integration and
Transformation, Data Reduction, Discretization and Concept Hierarchy Generation.
Data Mining Task Primitives, languages and System Architecture: Data Mining Primitives,
Data Mining Query Languages, Designing Graphical User Interfaces Based on Data Mining
Query Language Architectures of Data Mining Systems.

UNIT-III
Characterization and Comparison: Data Generalization and summarization based
Characterization, Analytical Characterization: Analysis of Attribute Relevance, Mining class
comparisons: Discrimination between different classes, Mining Descriptive Statistical Measures
in Large Databases.
Mining Association Rules in Large Databases: Association rule mining, mining single-
Dimensional Boolean association rules from Transactional databases, mining multilevel
association rules from Transaction databases, Mining multi-Dimensional Association rules from
relational Databases and Data Warehouses From Associaiton Mining to Correlation Analysis,
Constraint-Based Association Mining

UNIT-IV
Classification and Prediction: Issues Regarding Classification and Prediction, Classification by
Decision Tree Induction, Bayesian Classification, Classification by Backpropagation
Classification Based on Concepts from Association Rule Mining, Other Classification Methods,
Prediction, Classifier Accuracy.
Cluster Analysis Introduction: Types of Data in Cluster Analysis, A Categorization of Major
Clustering Methods, Partitioning Methods, Density-Based Methods, Grid-Based Methods,
Model-Based Clustering Methods, Outlier Analysis.

UNIT-V
Mining Complex types of Data: Multidimensional Analysis and Descriptive Mining of
Complex Data Objects, Spatial Data Mining, Multimedia Data Mining, mining time-series and
sequence data, Text databse Mining, Mining the World Wide Web.
TEXT BOOKS:
Data Mining Concepts and Techniques - Jiawei Han & Micheline Kamber, Morgan Kaufmann
Publishers, Elsevier,2nd Edition, 2006.
REFERENCE BOOKS:
1 Data Mining introductory and advanced topics margarate h dunham, pearson education.
2. Data Mining Techniques Arun K Pujari,2nd edition, Universities Press.
3. Data Warehousing in the Real World Sam Aanhory & Dennis Murray Pearson Edn Asia.
4. Data Warehousing Fundamentals : Paulraj ponnaiah , wiley student edition.
5. Data Warehousing Fundamentals Paulraj Ponnaiah Wiley student Edition
ACHARYA NAGARJUNA UNIVERSITY

PART1/PRE Ph.D PAPER II: DATA WAREHOUSING AND DATA MINING


Time: 3 Hrs Model Question Paper Max.Marks: 100
Answer Any FIVE questions.
All questions carry equal marks. (5x20=100)

1. (a) Explain data mining as a step in the process knowledge discovery.


(b) Explain major issues of data mining.

2. a) Explain the differences between OLAP & OLTP systems


b) Explain in detail the architecture of data warehousing

3. Explain data pre-processing in detail

4. a) Explain data mining primitives


b) Explain designing graphical user interfaces based on a data mining query language

5. a) Write in detail about data generalization and summarization


b) Write about statistical measures in large databases for mining

6. a) Explain different types of association rule mining


b) Explain APRIORI algorithm with a sample transactional dataset

7. a) Compare classification and prediction


b) Explain ID3 algorithm with an example

8. a) Explain types of clusters


b) Explain different types of methods in clustering

9. Explain (a) Multimedia Mining (b) Spatial Mining

10. Explain (a) Text Mining (b) Mining World wide web