You are on page 1of 98

IBM Metadata Workbench

delivering on metadata

Marc Haber
Course Instructor

IBM Confidential © 2009 IBM Corporation


Introduction

IBM Confidential © 2009 IBM Corporation


Information Management Software

Objectives

• Enablement
• Learn how to benefit from the Metadata Workbench,
specifically its ability to deliver Lineage and Analysis in
addition to a holistic view of the Information Assets

• Understanding
• Comprehend the metadata offerings of Metadata Workbench
and Business Glossary, and their unique value to the
Information Server platform.

• Proficiency
• Demonstrate our metadata capabilities, and perform
standard user and administrative actions in each tool.

IBM Confidential © 2009 IBM Corporation


Information Management Software

Agenda
• Day 1
• Course Introduction
• Introduction to IBM Information Server
• Metadata Workbench Demonstration
• Getting Started
• Import of Database Tables as shared metadata
• Import of Files as shared metadata
• Import of BI Reports
• Standard Practice for ETL Development

• Metadata Workbench Administration

IBM Confidential © 2009 IBM Corporation


Information Management Software

Agenda
• Day 2
• Metadata Workbench Administration
• Automated Services
• Manual Services
• Database Alias Binding

• Metadata Workbench Usability


• Lineage and Analysis
• Queries and Linear Reporting
• Search and Display Information Assets

Continue

IBM Confidential © 2009 IBM Corporation


Information Management Software

Agenda
• Day 3
• Metadata Workbench User Excercises
• Metadata Workbench
• Logging and Error Information
• Installation Considerations
• System Requirements
• Business Glossary Introduction
• Business Glossary Administration
• Identifying, Import and Create Sources of a Glossary
• Managing and Extending a Glossary
• Business Glossary Usability
• Glossary Browser and Glossary Anywhere
• Metadata Workbench 8.1.1 Preview
• Summary and Conclusions Continue

IBM Confidential © 2009 IBM Corporation


Information Management Software

Capacity Planning for Metadata Workbench

• CPU -
• The calculation is one core for every 5 concurrent users

• RAM -
• Requires 150 MB of RAM plus 100 MB for each concurrent user.

• Metadata Workbench with an expected capacity for 2 concurrent


users would require 450MB of extra RAM and an extra one
core

IBM Confidential © 2009 IBM Corporation


Information Server

IBM Confidential © 2009 IBM Corporation


Information Management Software

Information Server
• Installation
• Common Suite Installer
• Metadata Workbench requires separate Licence Key
• Business Glossary requires separate Licence Key

• Common Repository
• All Information Assets are shared across suite components
• Edits and Additions are immediately available for Analysis and
Reporting

• Common Services
• Security and User Administration
• Logging and System Administration

IBM Confidential © 2009 IBM Corporation


Information Management Software

Information Server
• Suite Components

• DataStage and QualityStage Designer


• FastTrack
• Information Analyzer
• Informaiton Services Director
• Information Server Web Console
• Business Glossary
• Metadata Workbench

• Import Export Manager

IBM Confidential © 2009 IBM Corporation


Information Management Software

Information Server: Optimizing Application Development


Import Industry Exchange Data Structures
Data Models Services Oriented
Architecture

Rational Data Information


Architect Services Director
Link
Populate Deploy

Common Enterprise Search and Profile Map Sources to Transform and


Vocabulary Source Data Target Model Cleanse

DataStage and
Business Glossary Information Analyzer FastTrack QualityStage

Share Share Share Share

Unified metadata for active administration, management and reporting


Metadata Workbench offers complete visibility and control of metadata
IBM Confidential © 2009 IBM Corporation
What is Metadata?

IBM Confidential © 2009 IBM Corporation


Information Management Software

With a trademark
Stored in
So what exactly is Metadata? label
a can

Made by

Since

Type of
food

With a
special
ingredient
With
many
varieties

Metadata enables you to put context and meaning to things.


It is generated and consumed by every organization and software product.

IBM Confidential © 2009 IBM Corporation


Information Management Software

Metadata Primer for Business

B • Business Metadata
• Business rules, Stewardship, Business Definitions, Auditing Terminology,
Glossaries, Algorithms and Lineage using business language. Audience:
Business users.

T • Technical Metadata
• Defines Source and Target systems, their Table and Fields structures and
attributes, Documentation for Auditing Derivations and Dependencies.
Audience: Specific Tool Users – BI, ETL, Profiling, Modeling.

O • Operational Metadata
• Information about application runs: their frequency, record counts,
component by component analysis and other statistics for auditing purposes.
Audience: Operations, Management and Business Users.

Literally, “data about data” that describes your company’s


information from both a business and a technical perspective

IBM Confidential © 2009 IBM Corporation


Information Management Software

Metadata Business Drivers

 Governance and Compliance Regulations are increasing

- How do organizations comply and meet documentation requirements?


- How can organizations ensure accountability and responsibility?

 Business Competition continues to grow

- How do organizations individualize their customer experience?


- How can organizations get access to information to make correct decisions?

 Costs and system complexities are expanding

- How can organizations drive optimization with integration?


- How do organizations manage complex software environments?

IBM Confidential © 2009 IBM Corporation


Information Management Software

Metadata Business Challenges


Metadata is naturally a very complex subject which virtually all organizations
address at some point and time. Some successfully and some unsuccessfully.
Key challenges:
• Obtaining agreement on what metadata means
• What does metadata mean to a specific organization or division?
• What metadata is important to track and manage?
• How does each group use metadata for their particular job?

• Selecting the correct metadata strategy for particular business


requirements
• How many and what kinds of silos of metadata exist today in
organization?
• Are there revenue $ at risk, compliance issues, regulatory rules
which must be addressed?
• Do we have the flexibility to assess the impact of changes with our
current architecture?

• Promoting adoption of a metadata strategy and associated


technology
• How does our approach address our different user’s needs?
• Is our approach one that easy to use and facilitates adoption
rather than hinders it?
• Do we have business and technical sponsorship?

IBM Confidential © 2009 IBM Corporation


Information Management Software

Metadata helps answer important questions such as:

• What data or information exists ?


What is “Profit Amount” ?
• Where is it being used ? How is it defined ?
How is it calculated ?
• What is its business definition ? Where is it stored or used ?
Is it reliable ? Accurate ?
• What other names has it been called or If I make a change to “Profit Amount”
is being called ? – what will be impacted ?

• How is it inter-related to other


information ?

• Who is using it ?

• Why do we need it ?

• When was it last updated ?

IBM Confidential © 2009 IBM Corporation


Information Management Software

What does Information Server help achieve?


We use it to manage the
We use it to figure
impact of change
out and prove what
happened

Business Subject Matter Architects Data Developers DBAs


Users Experts Analysts

We use it to document & We use it to accelerate


link business concepts information integration
design

We use it to understand and We use it to accelerate


document information sources information integration
development

IBM Information Server and its unified metadata enables these tasks!
IBM Confidential © 2009 IBM Corporation
Information Management Software

What does Information Server help achieve?

Business Subject Matter Architects Data Developers DBAs


Users Experts Analysts

Unified Metadata Management


 Simplify Integration  Increase trust and
confidence in information
 Facilitate change  Increase compliance to
Design Operational management & reuse standards

IBM Confidential © 2009 IBM Corporation


Information Management Software

The IBM InfoSphere Vision

An Industry Unique Information Platform


• Simplify the delivery of Trusted Information

• Accelerate client value

• Promote collaboration

• Mitigate risk

• Modular but Integrated

• Scalable – Project to Enterprise

IBM Confidential © 2009 IBM Corporation


Information Management Software

The IBM Solution: InfoSphere Information Server

IBM InfoSphere Information Server

Unified Deployment

Discover, model, Standardize, merge, Combine and Synchronize, virtualize


define, and govern and correct information restructure information and move information
information structure for new uses for in-line delivery
and content

Unified Metadata Management

Delivering information you can trust


IBM Confidential © 2009 IBM Corporation
Information Management Software

What IBM InfoSphere Information Server Does with Metadata

Modeling BI ERP BPEL Engine SOA Service


• We dynamically manage
all Information Server
Miti Bridges, XML, EJB, Web Service, JMS
module metadata

Services Metadata Types of MD


• We support active, role-
Access Server Technical based metadata
Interchange Business
Integration Project collaboration
Impact Analysis Operational
Data Lineage • We support Information
Server technical,
operational and
Business Glossary
FastTrack
DataStage/QualityStage/ISD
business metadata
• We share 3rd party
Information Analyzer Metadata Workbench
Information Server Modules modeling and BI
metadata via MITI

IBM Confidential © 2009 IBM Corporation


Information Management Software

InfoSphere Information Server: Unified Metadata Management

Unified Metadata Management


Business | Technical | Operational

Active, centrally managed repository Define relationships, control Share and deliver relevant information
with secure access via services layer extensibility, and link 3rd party metadata across the organization

Includes Definitions, Terms, Abbreviations, Glossaries, Classification, Categories,


Examples, Stewards and Owners which are described using the business language

Category Customer – Someone who has purchased product within the last 12 months

Sub-Category High-Value Low-Value


minimum of $500,000 assets average less than $50,000 assets
Attributes  Avg $ value rounded to nearest $1,000  Most recent credit score
 Address  Maximum balance

IBM Confidential © 2009 IBM Corporation


Information Management Software

InfoSphere Information Server: Unified Metadata Management

Unified Metadata Management


Business | Technical | Operational

Active, centrally managed repository Define relationships, control Share and deliver relevant information
with secure access via services layer extensibility, and link 3rd party metadata across the organization

Includes Host Server, Database Type, Database Schemas, Table Name, Column
Names, Data Types which are described in technical detail

Schema Bank_Customer_Schema

Tables Master_Cust Master_Addr

Columns  Name: varchar (30)  Street: varchar (40)


 Active: varchar (1)  City: varchar (30)

IBM Confidential © 2009 IBM Corporation


Information Management Software

InfoSphere Information Server: Unified Metadata Management

Unified Metadata Management


Business | Technical | Operational

Active, centrally managed repository Define relationships, control Share and deliver relevant information
with secure access via services layer extensibility, and link 3rd party metadata across the organization

Includes Job Name, Job Execution Times, Number of Rows Processed, Error or
Success Status, Time Started, Time Completed which are described in sequence

Job Name DS_Load_Bank_Data

Start April 15, 2008 at 2 AM

Details  Processed 1 million rows


 With no errors
End April 15, 2008 at 2:15 AM

IBM Confidential © 2009 IBM Corporation


Information Management Software

InfoSphere Information Server: Unified Metadata Management

Unified Metadata Management


Business | Technical | Operational

Active, centrally managed repository Define relationships, control Share and deliver relevant information
with secure access via services layer extensibility, and link 3rd party metadata across the organization

Business Glossary DataStage Operational Metadata (Job Run Information)

Custom Attributes Physical Schemas


(RBDMS & Modeling Tools)

BI Reports

ETL Job Design

IBM Confidential © 2009 IBM Corporation


Our Mission - Lineage

IBM Confidential © 2009 IBM Corporation


Information Management Software

Understand: Using the MetaData Workbench to understand your business.

Database Schema’s and Host Servers DataStage Jobs and Operational Metadata BI Reports

IBM Confidential © 2009 IBM Corporation


Information Management Software

Preparing Metadata for Metadata Workbench Lineage

• Capturing the metadata you wish to include in such reports

• Import metadata about Database Tables and Files that are used in
Job Design and Production
• Import metadata about BI Reports used to publish information
• Publish shared metadata as necessary
• Generate and import operational metadata from job runs
• Invoke Metadata Workbench administrative services

Did you know? Design metadata for DataStage and QualityStage jobs is automatically stored in the
metadata repository as well as metadata from all other suite tools.

IBM Confidential © 2009 IBM Corporation


Information Management Software

Adding Content to the MetaData Server

Create database collateral and


connection information.

Metadata Server
(Common Model)

IBM Confidential © 2009 IBM Corporation


Information Management Software

Adding Content to the MetaData Server

Information Server Authoring.

Business Glossary Categories


Business Glossary Terms
Data Stewards
Information Server users

Metadata Server
(Common Model)

IBM Confidential © 2009 IBM Corporation


Information Management Software

Adding Content to the MetaData Server

Import DataStage Jobs and Table


Definitions.

DataStage Job
DataStage Data Collection
DataStage Table Definition
Classifying Business Glossary Terms

Metadata Server
(Common Model)

IBM Confidential © 2009 IBM Corporation


Information Management Software

Adding Content to the MetaData Server

Import and configure Operational


Metadata (OMD)

Generate OMD
Configure OMD
Import OMD
View OMD

Generating OMD, creates an XML file that traces


the events that took place (read, write, failure) and Metadata Server
the objects it referenced. The pointer to the object (Common Model)
is known as a locator. By design, OMD is dis-
connected from the Metadata Server.

IBM Confidential © 2009 IBM Corporation


Information Management Software

Adding Content to the MetaData Server

Job Sequencing and Integration

Operational Metadata Integration


Job Sequence Wizard
Analysis Report: Data Flow - Operational

OMD Integration and Job Sequencing is the process Metadata Server


by which we connect OMD data to the Metadata (Common Model)
Server, so that the MetaData Workbench can
analyze information as it really happened.

IBM Confidential © 2009 IBM Corporation


Information Management Software

Adding Content to the MetaData Server

Import Export Wizard

•ERWin Model
Business Objects Report Definition
DataSource Identity

Metadata Server
Loading of external 3rd Party Tools will create an
(Common Model)
independent copy of the Database / Schema / Table
objects. As a user, you would like to connect this
copy to pre-existing Table Definitions created in
DataStage.

IBM Confidential © 2009 IBM Corporation


Information Management Software

Adding Content to the MetaData Server

Navigate, Analyze, Utilize and Manage


Information Assets

Information Asset editing capability


Information Asset report capability
Understanding and Visualizing the content
created and how it works together

Metadata Server
(Common Model)

IBM Confidential © 2009 IBM Corporation


Information Management Software

Adding Content to the MetaData Server


Navigate, Analyze, Utilize and Manage Information Assets

Business
Process Job Summary Job
Meaning

BI Report

Metadata Server
(Common Model)

IBM Confidential © 2009 IBM Corporation


Information Management Software

Metadata Workbench Business Benefit

• Good exploitation and coordination of metadata across tools is very compelling

• Tool Integration & metadata collaboration saves time and money, and
improves Quality of results.
• An organization’s ability to Govern Data is significantly improved.
• A organization is better able to Manage Change, more Agile.
• A Shared, Common, Vocabulary saves time and effort, helps development,
and makes data more accessible & more understandable.

The key is making the metadata capture and


share process effortless

IBM Confidential © 2009 IBM Corporation


Information Management Software

Data Flow Analysis Report

 “Where has my data come from”. The Golden Source.


 Operational Events of a Job or Database
 Design Elements of a Job
 There are known limitations for Data Flow Analysis

IBM Confidential © 2009 IBM Corporation


Information Management Software

Data Flow Analysis Report

IBM Confidential © 2009 IBM Corporation


Information Management Software

Data Flow Analysis Report

IBM Confidential © 2009 IBM Corporation


Information Management Software

Data Flow Analysis Report

Lineage from a BI Report to the true data source

IBM Confidential © 2009 IBM Corporation


Metadata Workbench

IBM Confidential © 2009 IBM Corporation


Information Management Software

How does it work?


Metadata Workbench allows
users to explore, analyze, and
Information Server enrich the metadata in the
Components Metadata Server

WebSphere IBM WebSphere

Business Glossary Metadata Workbench

Metadata
WebSphere
Workbench
DataStage &
QualityStage Metadata Services
Import/Export

IBM WebSphere Metadata Server Manager


Information
Analyzer

Enterprise Information
WebSphere Information Server
components generate design Assets,
Federation Server
time and runtime metadata, e.g. Business Reports
automatically storing it in the
Metadata Server

IBM Confidential © 2009 IBM Corporation


Information Management Software

Tasks in Metadata Workbench


• Explore
• Jobs, reports, databases, files, tables, columns, terms, stewards, servers
• Simple and advanced search
• Robust Queries
• Graphical or report view of asset relationships
• Analyze
• Understand impact analysis
• Trace lineage from a BI report back to its source
• Understand what really happened and why?
• Manage
• Create and edit descriptions of assets
• Reconcile duplicate assets
• Map databases to database aliases
• Access run time information to enrich reporting

IBM Confidential © 2009 IBM Corporation


Information Management Software

Metadata Workbench Feature Overview

MANAGE EXPLORE ANALYZE

Manage Integration Assets to Explore key Integration Analyze dependencies and


enable in-depth analysis Assets: relationships between key
Integration assets, Business
Jobs, Reports, Databases, Intelligence Reports and data
Assign security roles Models, Terms, Stewards, models
Systems, Specifications,
Link together multiple
Quality Rules
viewpoints of design assets
from ETL, business, BI and  Trace data movement to
modeling with operational  Easy navigation of key and from databases, jobs
metadata Integration Assets and reports for full lineage
Edit names and  Simple and advanced  Understand business
descriptions of Integration search meaning of columns, tables,
Assets and other assets
 Integrated cross-view of
Access runtime information Information Server and 3rd  Assess the impact of
to enrich reporting party assets change across Integration
assets
Import export manager for  Graphical view of Asset
3rd party integration Relationships Robust query builder

IBM Confidential © 2009 IBM Corporation


Information Management Software

Metadata Workbench Integration Asset Categories

IBM Confidential © 2009 IBM Corporation


Information Management Software

Metadata Workbench User Roles


• Administrator (Explore, Analyze, Manage)
• Prepares the metadata for use in the metadata workbench
• Schedules automated analysis services
• Runs manual analysis services
• Completes stitching activities when appropriate
• Creates log views
• Publishes queries
• Explores metadata models
• Includes all User tasks and abilities
• User (Explore, Analyze)
• Finds and explores information assets
• Runs analysis reports
• Creates, saves and runs queries

IBM Confidential © 2009 IBM Corporation


Information Management Software

Homepage – Welcome to Metadata Workbench 8.1

The homepage offers direct access to Discover key Information Assets, Find any Information
Asset or execute Queries.

IBM Confidential © 2009 IBM Corporation


Information Management Software

Browse – Engines or Data Servers

Browse DataStage or QualityStage Jobs, Table


Definitions or other objects within their Folder and
Project placement.

Browse Databases and Files and their content.

IBM Confidential © 2009 IBM Corporation


Information Management Software

Find – Finding Information Assets

Whats New:
Information can be matched in any of six ways
Information can be searched by Name or also Description
Information can be restricted by adding Context (the Parent)

IBM Confidential © 2009 IBM Corporation


Information Management Software

Find – Results

Whats New:
Save Results in Data or Report Format (CSV Files)
Jump to a particular item using Type-Ahead
Information is listed including its Short Description

IBM Confidential © 2009 IBM Corporation


Information Management Software

Query – Querying the Repository

Whats New:
Enhanced Selection
Enhanced and more advanced Criteria Selection
Save, Import, Export and Edit Queries
Publish Queries and share them with all Workbench Users

IBM Confidential © 2009 IBM Corporation


Information Management Software

Query – Results

Whats New:
Save Results in Data or Report Format (CSV Files)
Jump to a particular item using Type-Ahead
Information is listed in Tabular Format
Related Information Assets may be clicked to open their Display Page

IBM Confidential © 2009 IBM Corporation


Information Management Software

List – Display Information Assets

Select any Information Asset Type, and display a list of all its content

IBM Confidential © 2009 IBM Corporation


Information Management Software

Display of DataStage Jobs

Whats New:
Images are displayed automatically
Sections may be collapsed
Now includes Project and Folder data
Now includes advanced Job Run data

IBM Confidential © 2009 IBM Corporation


Information Management Software

Display of BI Reports

Whats New:
Automatic reconciliation with OLAP
Enhanced Display of data

IBM Confidential © 2009 IBM Corporation


Information Management Software

Automated Services

Whats New:
Singular action for all analysis services
Ability to include or exclude Projects
Ability to schedule Analysis Services
Ability to map Database Aliases to sources
Enhanced and extended support for Stages

IBM Confidential © 2009 IBM Corporation


Information Management Software

Graph View
What Objects does this User Own

• Shows Asset relationships


• Shows Asset associations

IBM Confidential © 2009 IBM Corporation


Information Management Software

Other Enhancements

Right Click
Right-click any link, and the Workbench
presents an action menu specific to the
Information Asset Type selected.

Actions include standard Copy, Edit and Add


to Favorites, but also links to Analysis Reports
and Graph View.

IBM Confidential © 2009 IBM Corporation


Information Management Software

Other Enhancements

Context
Enhanced display of the context, parentage,
of Information Assets, including the ability to
click on the items to open their Display page.

IBM Confidential © 2009 IBM Corporation


Information Management Software

Other Enhancements

Assign Term
Classify Information Assets to a Business
Glossary Term directly from the Workbench.
From the Right-Click or Right-Navigation task
menu, select the action.

IBM Confidential © 2009 IBM Corporation


Information Management Software

Pre-reqs for Accessing the Metadata Workbench

1. To view graphical reports and to display assets in the graph view, you
must download and install Adobe SVG Viewer from
http://www.adobe.com/svg/viewer/install/
2. Supported web browsers are IE 6 & 7, and Mozilla Firefox 2
3. You must have the role of Metadata Workbench Admin or User.
4. Enable java script in the browsers
5. Enable cookies for the site
6. Screen resolution set to 1024 by 768 or greater.

IBM Confidential © 2009 IBM Corporation


Metadata Workbench Administrator

IBM Confidential © 2009 IBM Corporation


Information Management Software

Purpose and Definition

The IBM MetaData Workbench analysis services provides advanced


algorithms and data mining techniques in determining existing
relationships that have yet to be formalized. These services provide the
added value of the IBM MetaData Workbench and its reporting, querying
and analysis capabilities.

Purpose and Definition


Learn and understand the management tasks of the MetaData
Administrator, including how to execute analysis services, import
Operational MetaData Logs, view Analysis Service logging and reconcile
Database Aliases.

IBM Confidential © 2009 IBM Corporation


Information Management Software

Benefit

By dedicating a MetaData Administrator to ensuring accuracy within


analysis, users benefit from current and precise lineage reports.
Furthermore, users may search, view and browse DataStage and Quatlity
Stage Jobs and Physical Data Sources to understand their usages and
status.

IBM Confidential © 2009 IBM Corporation


Information Management Software

Tasks

• Define and Manage user credentials of the IBM MetaData Workbench


• Recommend and implement process for Naming of Host Sytems,
Databases and Data Files. Ensure all requisite metatada appears in the
Repository.
• Define Process for Operation Metadata Management, including import
and purge of Operational Data Files
• Import DataStage and QualityStage Project Environment Variables
• Reconcile Database Aliases used within ETL Development Jobs
• Reconcile duplicate Data Sources

IBM Confidential © 2009 IBM Corporation


Information Management Software

Tasks

• Define, Schedule and execute MetaData Workbench Automated


Analysis Services
• Review, maintain and react to the Logging Errors of the Automated
Analysis Service
• Determine requirement and need for Manual Linking Services
• Support and provide assistance to users of the Metadata Workbench as
required, for purposes of Analysis Reporting, Querying or Preview
Information Assets
• Author and Publish Metadata Workbench repository queries

IBM Confidential © 2009 IBM Corporation


Information Management Software

User Access Roles

 Metadata Workbench User finds and explores


assets, runs analysis reports, and creates,
Security roles give
saves, and runs queries. organizations the
flexibility to customize
 Metadata Workbench Administrator runs the management of metadata
analysis linking services, publishes queries, for their specific business
explores metadata models plus all User tasks. requirements

IBM Confidential © 2009 IBM Corporation


Information Management Software

Automated Services

•Ability to include or exclude Projects


Allows administrators to minimize time
• Intelligent metadata linkage delta processing
maintaining and managing
• Ability to schedule Analysis Services
metadata assets as well as reduce
• Ability to map Database Aliases to sources
the numbers of errors introduced
• Enhanced and extended support for Stages
from manual reconciliation
processes.

IBM Confidential © 2009 IBM Corporation


Installation Considerations

IBM Confidential © 2009 IBM Corporation


Information Management Software

Metadata – Where does it exist


Development

•FastTrack
•Business Glossary Operational
•Database and File Data
•User Authorization
•DataStage Jobs Central
Metadata

•Operational Data
Reporting

•QualityStage Jobs MDR BI Target


Development
Platform IMI
•Analaysis Projects
•BI Reports

Operational

IBM Confidential © 2009 IBM Corporation


Information Management Software

Metadata Workbench

• Metadata Workbench deployed at the Data Centers


• Primary use is to browse and query the metadata of the local data
center
• Provides real-time representation of the Data Center
• Metadata Workbench deployed at the Hub
• Browse and query the metadata across all the data centers
• Enrich metadata for accurate data lineage
• Investigate data lineage, displays lineage across all the Data Centers

IBM Confidential © 2009 IBM Corporation


Information Management Software

Business Glossary

• Business Glossary is deployed at each Data Center and the


Hub
• The glossary is managed at the Hub and is published to each
data center via BG asset interchange
• The Data Centers do not change the content of the glossary
• Each Data Center can assign assets to Terms in the glossary.
These assignments will be migrated to the Hub together with
the assets

IBM Confidential © 2009 IBM Corporation


Information Management Software

Capacity Planning for Metadata Workbench

• CPU -
• The calculation is one core for every 5 concurrent users

• RAM -
• Requires 150 MB of RAM plus 100 MB for each concurrent user.

• Metadata Workbench with an expected capacity for 2 concurrent


users would require 450MB of extra RAM and an extra one
core

IBM Confidential © 2009 IBM Corporation


Information Management Software

Capacity Planning for Metadata Workbench

IBM Confidential © 2009 IBM Corporation


Import Data Sources and Reports

IBM Confidential © 2009 IBM Corporation


Information Management Software

Import Export Manager for Information Server IT


Developers
IT
Administrators

Expand visibility of metadata touch-points in support of data integration projects


Features
• Security enforced via Information Server common security
layer as well as the 3rd party application security layer
• Metadata Bridges interchange metadata with each specific
application a consist of a model, a decoder, and an
encoder which require no coding.
• Import capabilities for 3rd party BI tools (Cognos, Business
Objects, MicroStrategy), data modeling tools (ERwin,
RDA) and databases (ODBC connections to all major
RDBMS)
• Support a variety of import formats including XMI, XML,
UML, CWM and CSV metadata exchange formats

Benefits
• No manual interface coding required for 3rd party
metadata visibility
• Visibility of data modeling to ETL to report layer
minimizes risks of overlooking critical dependencies
• Leverage common metadata exchange environment
for application development consistency

IBM Confidential © 2009 IBM Corporation


Information Management Software

Meta Integration Technologies, Inc. (MITI)

• OEM of 3rd party metadata bridges for import


- More than a dozen major vendors OEM MITI bridges*
• IBM and MITI jointly certified and tested bridges
• Additional (MITI) bridges
- Many bridges are available “as-is” and can be easily enabled
post installation.
* http://www.metaintegration.net/Partners/Directory.html

IBM Confidential © 2009 IBM Corporation


Information Management Software

The Areas of Metadata

Business Glossary & IS Users ETL Operational Metadata (Job Run


Information)

BI Reports

Physical Schemas

ETL Job Design

IBM Confidential © 2009 IBM Corporation


Information Management Software

The Areas of Metadata Connected


Business Glossary & IS Users ETL Operational Metadata (Job Run
Information)

BI Reports

Physical Schemas

ETL Job Design

IBM Confidential © 2009 IBM Corporation


Information Management Software

Sample Import Process


with CA ERwin

IBM Confidential © 2009 IBM Corporation


Information Management Software

CA ERwin

Sample file from ERwin V4.1.4

IBM Confidential © 2009 IBM Corporation


Information Management Software

ERwin Import Process – Step 1

Select the
bridge type
for import

Importing the sample file Erwin v4.1.4 via ERwin bridge

IBM Confidential © 2009 IBM Corporation


Information Management Software

ERwin Import Process – Step 2

Set
parameters
for import
including
location of
file

Importing the sample file Erwin v4.1.4 via ERwin bridge

IBM Confidential © 2009 IBM Corporation


Information Management Software

ERwin Import Process – Step 3

Import status
screen

Importing the sample file Erwin v4.1.4 via ERwin bridge

IBM Confidential © 2009 IBM Corporation


Information Management Software

ERwin Import Process – Step 4

Select
objects for
import

Importing the sample file Erwin v4.1.4 via ERwin bridge

IBM Confidential © 2009 IBM Corporation


Information Management Software

ERwin Import Process – Step 4 (continuation)

Select objects
for import
(continuation)

Importing the sample file Erwin v4.1.4 via ERwin bridge

IBM Confidential © 2009 IBM Corporation


Information Management Software

ERwin Import Process – Step 5

Logon
information
for
Information
Server to
complete
import

Importing the sample file Erwin v4.1.4 via ERwin bridge

IBM Confidential © 2009 IBM Corporation


Information Management Software

ERwin Import Process – Step 6

Final status
of import

Importing the sample file Erwin v4.1.4 via ERwin bridge

IBM Confidential © 2009 IBM Corporation


Information Management Software

MetaData Workbench – Business Reports

 Business Reports are imported from 3rd Party Reporting Tools, as Business Objects
 The Import Manager allows for import of various 3 rd Party BI Vendor Applications
 Reports contain Report Fields. Report Fields are bound to Database Fields.
 Browsing a BI Report Field will display the Database Field to which it is bound.
 Flow of information may be Analyzed at an Operational Level

IBM Confidential © 2009 IBM Corporation


Information Management Software

MetaData Workbench – Import Business Reports

 Import Manager needs to be installed where DataStage Client has been installed
 Import Manager may require 3rd Party Client Application
 Import Manager allows the Common Repository to include Models and Reports that do not
have native support within DataStage or Information Analyzer
 Import of Business Intelligence Reports creates Report Definition, OLAP Model and Database

IBM Confidential © 2009 IBM Corporation


Information Management Software

MetaData Workbench – Business Reports Limitations

 Databases are loaded into “Unknown” or “Uninitialized” Servers and Schemas


 Report Fields may not be bound to the Database Fields, requiring a user to manually correct
this relationship
 Display of Report and OLAP Model may not be representatitive of the Native Tool. This is
due to the lack of Vendor Idenfication during the import process
 The Import process may not distinguish between User Defined Reports and Templates or
Samples.

IBM Confidential © 2009 IBM Corporation


Business Glossary

IBM Confidential © 2009 IBM Corporation


Information Management Software

Applications of a Business Glossary


Simply put, a Business Glossary is created to represent the
language of the business, independent of technology

Three Primary Applications:


1. Ownership
 Identifying stewards
 Managing content
2. Collaboration
 Common, approved vocabulary
 Sharing domain expertise - Business & IT
3. Auditability
 Evolution of language
 Centralized management

All key enablers to regulatory compliance and


support the IBM Data Governance Maturity Model

IBM Confidential © 2009 IBM Corporation


Information Management Software

Understanding the value of Business Metadata


• In the language of the business, independent of
technology
• Documents the business meaning of data & related
technology assets
• Used to
• define a shared meaning
• standardize names
• establish responsibility, accountability, and traceability
• govern access
• share insights & experiences among users
• represent business hierarchies
• document business descriptions, examples,
abbreviations and synonyms
• Must be managed by those that understand the
meaning and importance of the information
assets to the business
• Better aligns the efforts of IT with the goals of the
business

IBM Confidential © 2009 IBM Corporation


Information Management Software

InfoSphere Business Glossary Steward Business


Users
Create and manage business vocabulary and relationships

Steward
Features
Console
• Facilitate business & IT communications by
creating & managing a common business vocabulary
• Web based interface shared across enterprise
business teams
• Allows creation of stewards & assignment of their
responsibilities for terms & assets.
• Link business terms to information assets

Benefits
• Aligns the efforts of IT with the goals of the
business
• Provides business context to information
technology assets
Business
• Establishes responsibility and accountability in Interface
accordance with data governance policies

IBM Confidential © 2009 IBM Corporation


Information Management Software

Business Glossary Lifecycle

 Import
 Create
 Assign
 Customize

Populate

Access Manage
 Browse
 Instant find  Relate
 Advanced search  Status
 Reporting  Enrich
 Feedback

IBM Confidential © 2009 IBM Corporation

You might also like