Professional Documents
Culture Documents
INTEGRATION/ETL/INTEGRATION
AN INTRODUCTION
Presented by: Gautam Sinha
BI is not a single computer system, but framework for leveraging data for tactical and
strategic use
Used for:
Extract
Transform
Load
Single
Reporting
Repository
AIMSPC
OLTP
Real-time
Dashboards
Static and
Ad-hoc Reporting
TIMS DW
RECBASS
OLTP
ATRRS
Other Possible Data Sources
RATSS
RFMSS
Graphical
Data Analysis
Components of BI
Data Integration ( Informatica, DataStage)
Data Integration
Data integration involves combining data residing in
different sources and providing users with a unified
view of these data.This process becomes significant in
a variety of situations both commercial (when two
similar companies need to merge their database) and
scientific (combining research results from different
bioinformatics repositories, for example).
Data integration appears with increasing frequency as
the volume and the need to share existing data
explodes It has become the focus of extensive
theoretical work, and numerous open problems remain
unsolved. In management circles, people frequently
refer to data integration as "Enterprise Information
Integration" (EII).
ETL Glossary
Source System
A database, application, file, or other storage facility from which the
data in a data warehouse is derived.
Mapping
The definition of the relationship and data flow between source and
target objects.
Metadata
Data that describes data and other structures, such as objects,
business rules, and processes. For example, the schema design of a
data warehouse is typically stored in a repository as metadata, which
is used to generate scripts used to build and populate the data
warehouse. A repository contains metadata.
Staging Area
A place where data is processed before entering the warehouse
ETL Glossary
Cleansing
The process of resolving inconsistencies and fixing the anomalies in
source data, typically as part of the ETL process.
Transformation
The process of manipulating data. Any manipulation beyond copying
is a transformation. Examples include cleansing, aggregating, and
integrating data from multiple sources.
Transportation
The process of moving copied or transformed data from a source to a
data warehouse.
Target System
A database, application, file, or other storage facility to which the
"transformed source data" is loaded in a data warehouse.
ETL Tools
PowerCenter - Components
PowerCenter - Components
PowerCenter - Domain
Informatica-Power Center
Repository Service
Informatica-Power Center
Integration Service
PowerCenter Repository
Source Analyzer: This is used to either import or create the source definitions.
2.
3.
Mapping Designer: This is used to create mappings that will be run by the
Informatica Server to extract, transform and load data.
4.
5.
2.
3.
Create a Mapping
4.
5.
6.
Informatica Transformations
Informatica Transformations
Informatica Transformations
Aggregator Transformation
Aggregator transformation is an Active and Connected transformation. This
transformation is useful to perform calculations such as averages and Sums
Expression Transformation
Expression transformation is a Passive and Connected transformation. This can
be used to calculate values in a single row before writing to the Target
Filter Transformation
Filter transformation is an Active and Connected transformation. This can be
used to filter rows in a mapping that do not meet the condition.
Joiner Transformation
Joiner Transformation is an Active and Connected transformation. This can be
used to join two sources coming from two different locations or from same
location
Rank Transformation
Rank transformation is an Active and Connected transformation. It is used
to select the top or bottom rank of data
Any Suggestions