Professional Documents
Culture Documents
Kenilworth, NJ
Prepared By:
4.Filter data, convert codes, perform table lookups, calculate derived values
6.Incremental aggregation & computation of aggregates by the ETL tool in one pass of the source
data
7.Support for Unicode & multi-byte character sets localized for Japanese and other languages
16.Native interfaces to legacy files, relational databases, ERP sources (e.g.,SAP R/3 and
PeopleSoft), eBusiness applications, Web log files, IBM MQ-Series, XML sources etc.
18.Support for an enterprise eBusiness environment, including integration at the metadata level with
BI tools, ERP applications, CRM applications, analytic applications, corporate portals, etc.
22.Support for concurrent processing of multiple source data streams, without writing procedural code
25.Ability to specify complex transformations using only built-in transformation objects. The goal is to
specify transformations without writing any procedural code
27.Metadata exchange architecture that supports automatic synchronization of central metadata with
local metadata for multiple end-user BI tools
28.Central management of distributed ETL engines and metadata using a central console and a
global metadata repository
32.Ability to schedule ETL sessions on time or the occurrence of a specified event, including support
for command-line scheduling using external scheduling programs
37.Support for the analysis of transformations that failed to be accepted by the ETL process
38.Extensive reporting of the results of an ETL session, including automatic notification of significant
failures of the ETL process
Requirement by the Project
1.Ease of Use and Maintenance
4.Targets – Oracle
14.Version Controlling
16.Incremental Load
17.It should be able to work with Uni Code multibyte character localized for different countries (Data
is in different places)
18.Sort, Aggregate, Join Transformations (Look for Inner, Semi Outer, Outer Join)
31.Ability to merge data from more than one sources in a single Mapping/Graph/JobStream
Development and
Maintenance Process
Support
Visual Metaphor Multiple Screens for Handling Has different screens for Source GDE (Graphical Two environment one
development, Scheduling and import, Target Import, Scheduling Development one for
Administrative tasks and Administrative tasks. The Environment) is Job Scheduling
mappings become complex when present. It is not
number of transformations are very intuitive. It has
added and are easy to handle as the many components
project evolves. each represented by
a square box. Newer
version has GUI for
running the graphs
and administrative
jobs.
Feature Ascential Software Informatica Ab Initio Cognos
DataStage XE PowerCenter GDE (Graphical Decision Stream
Development
Enterprise, Ab
Initio Co->Op
System)
Development and
Maintenance Process
Support
Multiple Sources and Targets DataStage XE includes unlimited Informatica PowerCenter has It can read from and Though it says that it
number of heterogeneous data access to Multiple source and write to heterogeneous sourc
sources and targets. Multiple, Targets. But in one mapping it can heterogeneous heterogeneous targe
unlimited targets can be of mixed access data from heterogeneous sources because of
origin, can have multiple source but can not write to its Own Operating
destinations and receive data heterogeneous targets a limitation system, which has a
through a variety of loading which will be take care in newer greater flexibility.
strategies (bulk loader, flat file version.
I/O, direct SQL) in the same job.
Feature Ascential Software Informatica Ab Initio Cognos
DataStage XE PowerCenter GDE (Graphical Decision Stream
Development
Enterprise, Ab Initio
Co->Op System)
Native It has different product It has different product On mainframes, Ab Initio Extraction from Mainframe will be by the
Mainframe Data DataStage XE/390 that call Power Connect for can read and write and use of 3rd Party Software.
Extraction generates COBOL Mainframe Data. It update and delete rows
programs that run directly accesses DB2 on in DB2 databases and
on the mainframe, mainframe using DB2 records in VSAM files
providing native mainframe Connect. directly. It can read and
extraction, transformation write any MVS dataset.
and loading capabilities. Ab Initio reads
mainframe IMS data. All
data types are supported
Feature Ascential Software Informatica Ab Initio Cognos
DataStage XE PowerCenter GDE (Graphical Decision Stream
Development
Enterprise, Ab Initio
Co->Op System)
Built-in functions and A lot of Built In functions A lot of Built In functions and There is lot of built in Yes
routines and the use of Script the use of Script language functions available to do
language like Basic or like Sql to create your own the different jobs including
SQL to create your own functions near about 50 built Compress, Database,
functions. in components for the use to Datasets, De partition,
very common job in the FTP, Miscellaneous,
datawarehouse. Partition, Sort, Transform,
and Validate.
Advanced transformation Script language like basic Script Language only use of Ab Initio includes a full Call functions like C
support available so have Meta SQL and can call External programming language
Data with in the tool and Procedures for Advanced (called DML) that can
can write advanced Transformations but meta express if-then-else, case,
Transformations data of the transformations is cascading (prioritized)
not integrated rules, looping, and much
more.
Feature Ascential Software Informatica Ab Initio Cognos
DataStage XE PowerCenter GDE (Graphical Decision Stream
Development Enterprise,
Ab Initio Co->Op System)
Version Control DataStage XE includes Version control The built-in version management Through Source Control
and Configuration a component for implemented on to the system allows users to navigate
Management version control that 'folder' level in to old versions of an object,
saves the history of all PowerCenter. It is not navigate to the version of a graph
the data integration great but it is taken used in a particular run of a job,
development. It care in 6.0 or 7.0 view the state of the entire
preserves application release. Currently repository at a particular point in
components such as Informatica users use time, and find the differences
table definitions, third party versioning between two versions of a graph,
transformation rules, control applications data transformation, or data type
and source/target like PVCS. definition.
column mappings within
a 2-part numbering
scheme.
Graphical Job Very good tool to Newer version 6.0 has No till the last version. New Do not Know
Sequencer visually create the this in Powercenter. version has it that is what I came
sequence of jobs to be Currently it is through to know
run through GUI. session manager.
External function External functions are Yes, but limited to C+ Custom components can be Yes for C like functions
support easily integrated within +, Java, PL/SQL etc. implemented in any language,
DataStage. More but most of the job and including Cobol, C, C++, Perl,
importantly the flexibility transformation can be Java, Forte, PL SQL etc.
and completeness of achieved with the built
the tools allows the in functions and
developer to stay inside transformations. There
the tool using the easy is no way to import
to use scripting meta data from the
language like Basic and external function. It will
SQL be take care in newer
version.
Feature Ascential Software Informatica Ab Initio
DataStage XE PowerCenter GDE (Graphical
Development
Enterprise, Ab Initio
Co->Op System)
Canvas Annotation The users can “write Every Mapping and Component specific or
notes” or add text onto the Transformation has 2 type of graph specific annotations
canvas or screen as they annotations, a name or are available.
create DataStage jobs. object Description and
Users can now add comment box. The name will
comments, labels or other be displayed when you point
explanations to the the mouse to the
designs. transformations.
Publication of meta data Meta data information can Through Webzine - Web Not Very Clear. But the Do not Know
directory be published in XML based Meta Data Reporter. specified user can show
and/or HTML format It is a web based meta-data all or any of their
complete with hyperlinks reporting tool to facility properties such as
for easy end-user access publication of meta-data. business area, subject
and navigation. area, steward group,
database, modified by,
created and last modified
date.
Feature Ascential Software Informatica Ab Initio Cognos
DataStage XE PowerCenter GDE (Graphical Decision Stream
Development
Enterprise, Ab Initio
Co->Op System)
Meta data reuse DataStage XE uses a PowerCenter collects (load) It has way of integrating to Yes
publish and subscribe and distributes (unload) meta Designer tool and OLAP
mechanism to distribute data between tools using tool. It is not clear that
standard meta data from a bridges - i.e. import from how the reuse of
variety of sources. Other Oracle Designer, Erwin or Metadata is achieved ?
users can subscribe to PowerDesigner or export to
meta data publications on BI tools such as BO,
a one-time or recurring Microstrategy and
basis. When meta data Impromptu. It is one
changes, subscribers are directional. There is another
automatically notified. tool called Metadata
Exchange SDK which is
used for bi directional use of
Meta data from Designer
Tools or Olap tools and vice
versa. Moreover it can be
Exported and Imported
through XML Files
Data Lineage - Integration A way to know from what Through Scripts it is possible Through Scripts it is No
of design and event meta sources the data has been possible
data populated to the target
when you have more than
one source to extract in a
graph.
Integration with Business With Business Objects, Through Power Bridges it Not Very Clear. How this Cognos Powerplay
Intelligence Tools Brio, Cognos Impromptu, has integration with Brio, is done? Through Architect
MicroStrategy Cognos, MicroStrategy,
Business Objects
Feature Ascential Software Informatica Ab Initio Cognos
DataStage XE PowerCenter GDE (Graphical Decision Stream
Development
Enterprise, Ab Initio
Co->Op System)
Performance
Parallelism It has advanced parallel The PowerCenter Because of its parallel Hashing Techniques for
processing capabilities architecture limits the architecture aka MPP and Sort/Join and Aggregate
which enable it effectively product ability to perform the concept of partitioning
and efficiently use the parallel processing. While (Component, Pipe and
capabilities of SMP, SMP PowerCenter can partition a Data), it has been able to
Clusters and MPP data set and run separate handle very high volume
platforms. In addition, processes, the newer version of data. It can do parallel
DataStage XE efficiently has greater capabilities for and pipeline loading.
leverages database Sort, Join and Aggregate
parallelism (i.e. DB2 transformations. It is faster
IEEE, Oracle Enterprise than previous versions.
Edition with parallel Pipeline bulk loading of data,
tables). memory caching.
Named Pipe Support Ability to break large job Sort, Aggregate When the graphs are Do not Know
into smaller job and then Transformation has the same running on the same
these smaller jobs has the facility. server they communicate
ability to communicate through Named Pipe or
with each other making Share Memory, when they
the whole process to run are different servers, they
faster. communicate through
network. All these is
determined by Ab Initio
Operating System
Hardware support Unix, NT and Mainframe Unix and NT platforms only Unix and NT and
platforms Windows Platforms
Extensibility Central to DataStage’s Not aware of any such things Simple mechanism of
architecture is the ‘wrapping’ user programs,
DataStage Plug-in API. the Co>Operating System
This allows Ascential delivers the ultimate in
Software engineering, extensibility
vars and end-users alike
to code their own
interfaces to external
databases, processes,
and bulk loaders.
Integrated Bulk Loader Have it bundled within the Have it bundled within the Have features for Load
software and can be software and can be invoked and Unload bulk data
invoked within the scripts within the scripts for Oracle, through Load and Unload
Sybase and SQL Server Component
Feature Ascential Software Informatica Ab Initio
DataStage XE PowerCenter GDE (Graphical
Development
Enterprise, Ab Initio
Co->Op System)
Data Quality and
Cleansing
Ability to assess current The development teams Data cleansing can be This can be achieved
data quality and business users have achieved by building the through Data Profiler
the ability to audit, monitor rules within the logic. Rows Product which sits on the
and certify the quality of can be flagged as problem or top of Ab Inito Co-
data at key points routed to a problem file/table. >Operating system. It
throughout the data analyzes the graph the
integration lifecycle. data and stores in the
metadata repository,
which then can be used
by the developer to look at
the quality of the data.
More ever it has in build
components like
checksum, compare
records which can be
incorporated to graphs to
access the data quality.
Integrate 3rd party data With Trillium and First With Trillium and First Logic With Trillium and First
cleansing tools Logic (How easy is it, I do (How easy is it, I do not know Logic (How easy is it, I do
not know as I have not as I have not tried it out) not know as I have not
tried it out) tried it out)
Feature Ascential Software Informatica Ab Initio
Feature Ascential
DataStageSoftware
XE Informatica
PowerCenter Ab
GDEInitio
(Graphical
DataStage XE PowerCenter GDE (Graphical
Development
Development
Enterprise, Ab Initio
Enterprise, Ab Initio
Co->Op System)
PRODUCT Co->Op System)
Ease of Use and
Component wise product, Component wise product buy One Product and there
Maintenance
Buy what you use, but what you use (It has now its are add on products for
It has improved
Datastage a lot over
XE comes up Very easy totools).
own OLAP use and
You have User saysTransfer
Metadata the new (but
GUI is
the time had a demo of
with alll the features that Maintain (Own Experience)
to buy the products for the good but have no
comes with access to
the tool over
we require the web
like Transferring metadata from experience
mainframe and other
communicating with Designer tool and source)
New Releases and How muchtools
Designer costand
is involved
Olap transferring metadata to
Patches Tools OLAP tools
18 % to 22 % of the 20% of the Purchase Cost Do not Know
COST
Purchase Cost price every price Every year
year
Cost can be spread Cost can be spread because One time cost
Customer Support because
Can of itsacomponent
we have dedicated of its component in nature
and Services in nature
person for our questions
to
PerbeCPU
solved? It is based on Per Server Per Per CPU and Per
Do not Know Repository
Do not Know(we can have Developer
Do not Know
many servers pointing to the
Case Studies and Can we have the client Repository) and also
Access to Site under number and visit their site depends on number of
using ETL tool for their Sources and Targets. The
Production warehousing licensing is done for no of
Environment using CPU's like 4 CPU's or 6
the ETL tool CPU's etc.
Yes Yes Yes
Availability of
Manpower (Price &
Consultancy)
Good Very Good (Low Price and
High Quality)