This document discusses various concepts related to data warehousing and data mining including: data marts, factors influencing data warehouse satisfaction, data mining techniques like association rules and decision trees, metadata, data visualization, spurious relationships, OLAP analysis approaches, data classification, dimensional modeling, and data warehousing applications. It provides definitions and comparisons of key terms and explores technical and implementation aspects of data warehousing projects.
This document discusses various concepts related to data warehousing and data mining including: data marts, factors influencing data warehouse satisfaction, data mining techniques like association rules and decision trees, metadata, data visualization, spurious relationships, OLAP analysis approaches, data classification, dimensional modeling, and data warehousing applications. It provides definitions and comparisons of key terms and explores technical and implementation aspects of data warehousing projects.
Copyright:
Attribution Non-Commercial (BY-NC)
Available Formats
Download as DOC, PDF, TXT or read online from Scribd
This document discusses various concepts related to data warehousing and data mining including: data marts, factors influencing data warehouse satisfaction, data mining techniques like association rules and decision trees, metadata, data visualization, spurious relationships, OLAP analysis approaches, data classification, dimensional modeling, and data warehousing applications. It provides definitions and comparisons of key terms and explores technical and implementation aspects of data warehousing projects.
Copyright:
Attribution Non-Commercial (BY-NC)
Available Formats
Download as DOC, PDF, TXT or read online from Scribd
2. What are some of the factors that influence an organization’s satisfaction with its data warehouse? 3. What is data mining and why is it important? 4. In data mining what is false positive? Why should we concerned about these? 5. As a general rule , what level of correlation is considered “strong”? Would you be comfortable making a decision based on data that was correlated at r = 0.5?Why? 6. Why would you be interested in a weak correlation? 7. What is data visualization? Why is it important? 8. What is spurious relations in data mining? How can it be identified? What are the dangers of not identifying it as spurious? 9. What is meta data and what are its components? 10. Understand the current limitations and challenges to data mining. 11. Compare the two approaches of conducting OLAP analysis and how do the two approaches differ? 12. Explain how is data classified? 13. Explain the design of a project plan for the construction of DW. 14. Determine the project sources factors associated with DW implementations 15. Examine the concepts associated with the economic justification of the product. 16. What is text mining? How is it different from typical data mining? 17. What are the some Web mining applications? 18. Identify and discuss the five areas associated with the data staging process 19. What are the two most important components of a fully specified enterprise model? 20. Describe the process of defining the project scope for a DW. 21. What are the four steps in developing a dimensional schema ? 22. List and describe several data mining technologies. 23. What is data visualization? 24. What major technical elements should be found in a meta data catalog? 25. What are the factors associated with DW project success? 26. What is the future of executive decision-making? 27. What can a data warehouse do? 28. What do you understand by data correlation? 29. Explain the different layers in a data warehouse structure? 30. What does the concept of specific structure refer to? 31. What can a data warehouse do? 32. List five common data warehousing applications. 33. List some of the costs and benefits associated with implementing a data warehouse. 34. Where are the differences in the steps involved with generating a report from a data warehouse and with generating a report from a legacy system? 35. In data warehousing terms, what kind of a user would be described as a “farmer”? What kind of user would be described as an “explorer”? 36. When cost justifying a data warehouse, which type of user should be used as the basis of the calculation? 37. What are the main categories of data mining? 38. What is metadata? 39. What does the term “granularity of the data” refer to? 40. What are some of the advantages and disadvantages associated with moving to a finer granularity in the data? 41. In data mining, what is a false positive? Why should we be concerned about these? 42. What are some of the considerations that a data miner must have when interpreting data? Why are these so critical? 43. As a general rule, what level of correlation is considered “strong”? Would you be comfortable making a decision based on data that was correlated at r =0.5? Why? 44. Why would you be interested in a weak correlation? 45. What is a spurious relationship in data mining? How can it be identified? What are the dangers of not identifying it as spurious? 46. 1. Describe the concert of an enterprise model and its importance to a successful DW project. 47. What are the two most important components fully specified enterprise model? 48. Describe the two basic approaches to designing an enterprise model. What are there relative advantages and disadvantages? 49. Compare the contrast horizontal, vertical, and enterprise integration. 50. What are the five-orgenisartio0nal readiness factors for DW? 51. What methods are available to address DW readiness shortfall? 52. What is scope creep? 53. What are the primary elements of economic feasibility analysis? 54. Why are intangible are so important in constructing the business justification for DW? 55. Describe the component in a fact table. 56. What are the four steps in developing a dimensional schema? 57. What are the key component areas of DW architecture? 58. Identify and discuss the five areas associated with the data staging catalog 59. What is the need of data mining? Explain the various techniques such as regression, dependency modeling. 60. Explain clustering by using sales data as example. 61. Compare different data mining techniques. 62. Discuss indirect access of data warehouse in retail personalization system. 63. Building a data warehouse inside the Enterprise resource planning (ERP) environment. 64. Explain the criterion to feed from ERP to non ERP system. 65. Explain MOLAP. 66. Describe how ROLAP works. 67. Compare the two approaches of conducting OLAP analysis. How do the two approaches differ? 68. Explain how data are classified. 69. How does the association technique apply to data mining? 70. List and brief describe several data mining technologies? 71. List the KDD process and briefly describe the steps of the process. 72. Describe the three measures of association used in market basket analysis. 73. What are the current limitations and challenges to date mining. 74. What is data visualization? 75. What is text mining? How does it differ from data mining? 76. What are the some Web mining applications?
1. What is data warehouse?
2. Difference between operational database system and data warehouse 3. Explain multidimensional data model? 4. Explain data warehouse architecture? 5. Explain data warehouse implementation? 6. Explain OLAP? 7. Explain metadata repository? 8. Explain data mining primitives? 9. Explain data mining query language? 10. Explain visualization of discovered patterns? 11. Explain association rule mining? 12. Explain multilevel association rules? 13. Explain issues regarding classification and prediction? 14. Explain decision tree induction? 15. Explain Bayesian classification? 16. Explain types of data in cluster analysis? 17. Explain partitioning methods? 18. Explain partition-based methods? 19. Explain hierarchical based methods? 20. Explain density-based methods? 21. What is data cleaning? 22. Explain neural networks approach? 23. Explain statistical approach? 24. Explain data mining application? 25. Explain data mining system products and research prototypes? 26. Explain data mining and intelligent? 27. Explain classification by back propagation? 28. Explain classification based on the concepts from association rule? 29. What Data integration and transformation? 30. Explain data transformation? 31. Explain data reduction?