Professional Documents
Culture Documents
Code: 9A05706
B.Tech IV Year I Semester (R09) Regular & Supplementary Examinations December/January 2013/14
Max. Marks: 70
Answer any FIVE questions
All questions carry equal marks
*****
(a) Draw and explain the architecture of typical data mining system.
(b) Differentiate OLTP and OLAP.
(a) How can we specify a data mining query for characterization with DMQL?
(b) Describe the transformation of a data mining query to a relational query.
(a) Explain the basic concept of association rule mining and a road map of it.
(b) Briefly explain about constraint based association mining.
(a)
(b)
(c)
(d)
(a) What is information retrieval? What methods are there for information retrieval?
(b) What is sequential pattern mining? Explain.
(c) Discuss about mining the webs link structures to identify authoritative web pages.
*****
Code: 9A05706
B.Tech IV Year I Semester (R09) Regular & Supplementary Examinations December/January 2013/14
Max. Marks: 70
Answer any FIVE questions
All questions carry equal marks
*****
(a) Describe three challenges to data mining regarding data mining methodology and user
interaction issues.
(b) Draw and explain the three-tier architecture of a data warehouse.
(a) How can we specify a data mining query for characterization with DMQL?
(b) Describe the transformation of a data mining query to a relational query.
(a) Give the algorithm to generate a decision tree from the given training data.
(b) Explain the concept of integrating data warehousing techniques and decision tree
induction.
(c) Describe multilayer feed-forward neural network.
(a) What is cluster analysis? What are some typical applications of clustering? What are
some typical requirements of clustering in data mining?
(b) Define data matrix and dissimilarity matrix. Discuss about interval-scaled variables.
Each scientific or engineering discipline has its own subject index classification standard
that is often used for classifying documents in its discipline.
(a) Design a web document classification method that can take such a subject index to
classify a set of web documents automatically.
(b) Discuss how to use web linkage information to improve the quality of such classification.
(c) Discuss how to use web usage information to improve the quality of such classification.
*****
Code: 9A05706
B.Tech IV Year I Semester (R09) Regular & Supplementary Examinations December/January 2013/14
Max. Marks: 70
Answer any FIVE questions
All questions carry equal marks
*****
(a) Discuss the importance of establishing a standardized data mining query language.
(b) What are some of the potential benefits and challenges involved in such a task?
(c) List and explain a few of the recent proposals in this area.
(a) Why is tree pruning useful in decision tree induction? What is a drawback of using a
separate set of samples to evaluate pruning?
(b) How rough set approach and fuzzy set approaches are useful for classification?
Explain.
(a) Give an example of how specific clustering methods may be integrated, for example,
where one clustering algorithm is used as a preprocessing step for another.
(b) Write CURE algorithm and explain.
(a) Explain about multidimensional analysis and descriptive mining of complex data
objects.
(b) Describe similarity search in time series analysis.
(c) What are cases and parameters for sequential pattern mining?
*****
Code: 9A05706
B.Tech IV Year I Semester (R09) Regular & Supplementary Examinations December/January 2013/14
Max. Marks: 70
Answer any FIVE questions
All questions carry equal marks
*****
The four major types of concept hierarchies are: schema hierarchies, set-grouping
hierarchies, operation derived hierarchies and rule based hierarchies.
(a) Briefly define each type of hierarchy.
(b) For each hierarchy type, provide an example.
(a) Why naive Bayesian classification called naive? Briefly outline the major ideas of
naive Bayesian classification.
(b) Define regression. Briefly explain about linear, non-linear and multiple regressions.
(a) What is cluster analysis? What are some typical applications of clustering? What are
some typical requirements of clustering in data mining?
(b) Discuss about model based clustering methods.
*****