You are on page 1of 4

What is data warehousing?

1. What is data mart?


2. What are some of the factors that influence an organization’s
satisfaction with its data warehouse?
3. What is data mining and why is it important?
4. In data mining what is false positive? Why should we concerned
about these?
5. As a general rule , what level of correlation is considered “strong”?
Would
you be comfortable making a decision based on data that was
correlated
at r = 0.5?Why?
6. Why would you be interested in a weak correlation?
7. What is data visualization? Why is it important?
8. What is spurious relations in data mining? How can it be identified?
What
are the dangers of not identifying it as spurious?
9. What is meta data and what are its components?
10. Understand the current limitations and challenges to data mining.
11. Compare the two approaches of conducting OLAP analysis and how
do
the two approaches differ?
12. Explain how is data classified?
13. Explain the design of a project plan for the construction of DW.
14. Determine the project sources factors associated with DW
implementations
15. Examine the concepts associated with the economic justification of
the
product.
16. What is text mining? How is it different from typical data mining?
17. What are the some Web mining applications?
18. Identify and discuss the five areas associated with the data staging
process
19. What are the two most important components of a fully specified
enterprise model?
20. Describe the process of defining the project scope for a DW.
21. What are the four steps in developing a dimensional schema ?
22. List and describe several data mining technologies.
23. What is data visualization?
24. What major technical elements should be found in a meta data
catalog?
25. What are the factors associated with DW project success?
26. What is the future of executive decision-making?
27. What can a data warehouse do?
28. What do you understand by data correlation?
29. Explain the different layers in a data warehouse structure?
30. What does the concept of specific structure refer to?
31. What can a data warehouse do?
32. List five common data warehousing applications.
33. List some of the costs and benefits associated with implementing a
data
warehouse.
34. Where are the differences in the steps involved with generating a
report
from a data warehouse and with generating a report from a legacy
system?
35. In data warehousing terms, what kind of a user would be described
as a
“farmer”? What kind of user would be described as an “explorer”?
36. When cost justifying a data warehouse, which type of user should
be used
as the basis of the calculation?
37. What are the main categories of data mining?
38. What is metadata?
39. What does the term “granularity of the data” refer to?
40. What are some of the advantages and disadvantages associated
with
moving to a finer granularity in the data?
41. In data mining, what is a false positive? Why should we be
concerned
about these?
42. What are some of the considerations that a data miner must have
when
interpreting data? Why are these so critical?
43. As a general rule, what level of correlation is considered “strong”?
Would
you be comfortable making a decision based on data that was
correlated
at r =0.5? Why?
44. Why would you be interested in a weak correlation?
45. What is a spurious relationship in data mining? How can it be
identified?
What are the dangers of not identifying it as spurious?
46. 1. Describe the concert of an enterprise model and its importance
to
a successful DW project.
47. What are the two most important components fully specified
enterprise
model?
48. Describe the two basic approaches to designing an enterprise
model.
What are there relative advantages and disadvantages?
49. Compare the contrast horizontal, vertical, and enterprise
integration.
50. What are the five-orgenisartio0nal readiness factors for DW?
51. What methods are available to address DW readiness shortfall?
52. What is scope creep?
53. What are the primary elements of economic feasibility analysis?
54. Why are intangible are so important in constructing the business
justification for DW?
55. Describe the component in a fact table.
56. What are the four steps in developing a dimensional schema?
57. What are the key component areas of DW architecture?
58. Identify and discuss the five areas associated with the data staging
catalog
59. What is the need of data mining? Explain the various techniques
such as
regression, dependency modeling.
60. Explain clustering by using sales data as example.
61. Compare different data mining techniques.
62. Discuss indirect access of data warehouse in retail personalization
system.
63. Building a data warehouse inside the Enterprise resource planning
(ERP)
environment.
64. Explain the criterion to feed from ERP to non ERP system.
65. Explain MOLAP.
66. Describe how ROLAP works.
67. Compare the two approaches of conducting OLAP analysis. How do
the
two approaches differ?
68. Explain how data are classified.
69. How does the association technique apply to data mining?
70. List and brief describe several data mining technologies?
71. List the KDD process and briefly describe the steps of the process.
72. Describe the three measures of association used in market basket
analysis.
73. What are the current limitations and challenges to date mining.
74. What is data visualization?
75. What is text mining? How does it differ from data mining?
76. What are the some Web mining applications?

1. What is data warehouse?


2. Difference between operational database system and data warehouse
3. Explain multidimensional data model?
4. Explain data warehouse architecture?
5. Explain data warehouse implementation?
6. Explain OLAP?
7. Explain metadata repository?
8. Explain data mining primitives?
9. Explain data mining query language?
10. Explain visualization of discovered patterns?
11. Explain association rule mining?
12. Explain multilevel association rules?
13. Explain issues regarding classification and prediction?
14. Explain decision tree induction?
15. Explain Bayesian classification?
16. Explain types of data in cluster analysis?
17. Explain partitioning methods?
18. Explain partition-based methods?
19. Explain hierarchical based methods?
20. Explain density-based methods?
21. What is data cleaning?
22. Explain neural networks approach?
23. Explain statistical approach?
24. Explain data mining application?
25. Explain data mining system products and research prototypes?
26. Explain data mining and intelligent?
27. Explain classification by back propagation?
28. Explain classification based on the concepts from association rule?
29. What Data integration and transformation?
30. Explain data transformation?
31. Explain data reduction?

You might also like