Professional Documents
Culture Documents
of the
Data Warehouse
Data Warehouse
Evangelist Program
CHED FDP on Business Analytics
Business Administration Track
Learning Outcomes
By the end of the lesson, participants should be
able to:
Be familiar with the fundamentals of the Data
Warehouse Architecture and its importance
Be familiar with the fundamentals of Backroom
Technical Architecture and its importance
Be familiar with the fundamentals of Frontroom
Technical Architecture and its importance
Appreciate the importance of metadata
Understand the importance of security in a data
warehouse environment
Overview
Why do we need blueprints before
building a house?
Overview
Why do we need blueprints before
building a house?
Architectural blueprints for a house
help the architect and customer
Blueprints
Help the architect and the customer
Communicate about desired results
Communicate about the nature of
construction effort
Blueprints
Help the architect and the customer
Communicate about desired results
Communicate about the nature of
construction effort
Determine resources required
Determines dependencies
Determines timing
Determines cost
The Analogy
Blueprints of a house is an analogy
to the importance of the
Information System Architecture
The Analogy
Blueprints of a house is an analogy
to the importance of the
Information System Architecture
Sad reality:
The Analogy
Blueprints of a house is an analogy
to the importance of the
Information System Architecture
Sad reality:
Common for people to dive into a data
warehousing project without any clear
idea of what they are building (!)
The Analogy
Blueprints of a house is an analogy
to the importance of the
Information System Architecture
Sad reality:
Common for people to dive into a data
warehousing project without any clear
idea of what they are building (!)
Starts with RDBMS running on a
leftover server
Communication
Planning
Flexibility and Maintenance
Learning
Productivity and Reuse
Data
(What)
Backroom
(Technical:
How)
Frontroom
(Technical:
How)
Infrastruct
ure
(Where)
Business
Requiremen
ts
What
information
do we need
to make
better
business
decisions?
How will we
get at the
data,
transform it,
make it
available to
the users?
What are
the major
business
issues we
face? How
do we
measure the
issues? How
do we
analyze the
data?
What HW
and system
level
capabilities
do we need
to be
successful?
Architecture
Models and
Documents
The
dimensional
model: What
are the facts
and
Where is the
data coming
from?
Where is it
going to?
Data
(What)
Backroom
(Technical:
How)
Frontroom
(Technical:
How)
Infrastruct
ure
(Where)
Detailed
Models and
Specs
The logical
and physical
models:
individual
elements?
What
standards and
products
provide
needed
capabilities?
How will we
hook them
together?
Development
standards?
Naming?
What are
the specifics
for the
report
templates?
Who needs
them? How
often?
How do we
interact with
these
capabilities?
What are
the system
utilities?
APIs? Calls?
Implementati
on
Creating the
databases,
indexes,
backup, etc
Write the
extracts and
loads,
Automate
Process
Implement
reporting
and analysis
environment
. Train users.
Install and
test
infrastructur
e
components
Technical Architecture
Overview
Technical Architecture
Overview
Services
Data Stores
Technical Architecture
Overview
Services
Functions needed to accomplish the
required tasks of the warehouse
For example: Copying a table from one
place to another (basic data movement
service)
Data Stores
Temporary or permanent landing places
for data
Technical Architecture:
Architecture for the Backroom
Data Warehouse
Evangelist Program
CHED FDP on Business Analytics
Business Administration Track
Source Systems
The transaction systems are the
obvious sources of interesting
business information
Access to the core operational
systems of the business
Order entry
Production
Shipping
Customer Service
Accounting Systems
Presentation Servers
The target platforms where the data is stored for direct
querying by end users, reporting systems and other
applications.
The Data Warehouse Bus allows parallel development of
business process data marts with the ability to integrate
these data marts ensured by their use of conformed
dimensions.
Ideal to be loading detail and aggregate data into data
marts segmented by business process.
The conformed dimensions used in these data marts will
allow query management software to combine data
across data marts for fully integrated enterprise analysis
and reporting.
Data Marts
Atomic Data Marts
Aggregate Business Process Data
Marts
Data Marts
Atomic Data Marts
Hold data at the lowest level of detail
necessary to meet most of the high-value
business requirements.
Metadata Catalogue
An integral part of the overall
architecture
Represents the set of information
that describes the warehouse and
plays an active role in its creation,
use, and maintenance
Services Involved
Data Staging Services
Extract Services
Data Transformation Services
Data Loading Services
Data Staging Job Control Services
Extract Services
Pulling the data from the source
systems
Probably the largest single effort in
the data warehouse project,
especially if the source systems are
decades-old, mainframe-based,
mystery-house-style systems.
Data Transformation
Services
Once the data is extracted from the
source system, a range of unnatural
acts are performed on it to convert it
into something presentable to the
users and valuable to the business.
Job Definition
Job Scheduling
Monitoring
Logging
Exception/Error Handling
Notifications
Technical Architecture:
The Frontroom.
Intro to Metadata
Data Warehouse
Evangelist Program
CHED FDP on Business Analytics
Business Administration Track
The Frontroom
The front room is the public face of
the warehouse.
Its what the business users see and
work with day-to-day.
For most folks, the user interface is
the data warehouse.
Downstream Systems
As the data warehouse becomes the
authoritative data source for analysis
and reporting, other systems are
drawn to it as the data source of
choice.
The basic purpose of these systems is
still reporting, but they tend to fall
closer to the operational edge of the
spectrum.
Services Included
About Metadata
The back room metadata is process
related, and it guides the extraction,
cleaning, and loading processes.
The front room metadata is more
descriptive, and it helps query tools
and report writers function smoothly.
Process and descriptive metadata
overlap, but it is useful to think about
them separately.
About Metadata
Source System Metadata
Backroom and Frontroom Metadata
Security
Security
Data -> Information -> Knowledge ->
Implementation -> Results
Hardware Security
Software Security
Security involving human resources
Role-based access control
Guidelines, Policies
Accountability