Professional Documents
Culture Documents
A data warehouse is a subject oriented, integrated, non-volatile, and time variant collection of derived data to describe an organizations activities and support of managements decisions.
Data Warehouse
A data warehouse is a repository of an organization's electronically stored data. Data warehouses are designed to facilitate reporting and analysis. This classic definition of the data warehouse focuses on data storage. However, the means to retrieve and analyze data, to extract, transform and load data, and to manage the data dictionary are also considered essential components of a data warehousing system. Many references to data warehousing use this broader context. Thus, an expanded definition for data warehousing includes business intelligence tools, tools to extract, transform, and load data into the repository, and tools to manage and retrieve metadata.
Architectre
Data Warehouse
OLTP Vs OLAP
OLTP System Online Transaction Processing (Operational System) Source of data Operational data; OLTPs are the original source of the data. To control and run fundamental business tasks
OLAP System Online Analytical Processing (Data Warehouse) Consolidation data; OLAP data comes from the various OLTP Databases To help with planning, problem solving, and decision support Multi-dimensional views of various kinds of business activities Periodic long-running batch jobs refresh the data
Purpose of data
Queries
Processing Speed
Depends on the amount of data involved; batch data refreshes and complex queries may take many hours; query speed can be improved by creating indexes
Space Requirements
Larger due to the existence of aggregation structures and history data; requires more indexes than OLTP
Database Design
Typically de-normalized with fewer tables; use of star and/or snowflake schemas