Professional Documents
Culture Documents
1.1
Basic Concepts
Q. No.
1.2
Question
1.
2.
3.
4.
5.
What are conformed dimensions? Explain the need for conformed dimensions? What is bus architecture?
6.
What are slowly changing dimensions? What are the various methods of handling them?
7.
8.
9.
What is an Operational Data Store (ODS)? How different it is from a data warehouse?
10.
11.
12.
13.
14.
15.
Explain the role of metadata in data warehousing environment? Who are the users of metadata?
16.
What are the popular metadata interchange standards currently available in the market? Explain them in brief?
17.
What are the different phases involved in data warehousing development lifecycle? How different it is
compared to any other OLTP development life cycle models?
Q. No.
Question
1.
2.
3.
4.
What is the difference between a dimensional data model and a normal data model?
Q. No.
5.
6.
7.
1.3
How do you convert a logical model into a physical model? What design considerations will you generally
employ?
What are type 1, 2 and 3 changes?
8.
What do you mean by snow flaking a dimension? Is it always essential to snow flake a dimension? What are
the tradeoffs (Performance Vs Ease of maintaining a dimension)
What is a conformed dimension?
9.
10.
11.
12.
ETL
Q. No.
1.4
Question
Question
1.
2.
What are your views on the usage of ETL tool vs. Custom built code?
3.
What are first generation and second generation ETL tools - give a few examples
4.
5.
What are the various approaches to handle data refresh into the DWH?
6.
7.
24x7 availability of the DWH is becoming a more common requirement these days. Given that there is a data
refresh time window for the DWH, how will you ensure 100% uptime and end user data access when the actual
data refresh takes place?
OLAP
Q. No.
Question
1.
2.
3.
What is the difference between ROLAP, MOLAP, HOLAP and DOLAP? How will you choose which
technology to use?
Explain drill-up, drill-down, drill across - with examples
4.
5.
Q. No.
Question
6.
7.
1.5
Metadata Management
1.6
1.7
Analytical CRM
1.8
Analytical SCM
1.9
E-Business Intelligence
Question
Answer
1.
2.
Q. No.
3.
4.
5.
6.
7.
Question
with an example.
Why is a wide key in the
index not good from
performance point of
view? How do you get
performance
enhancements from a
surrogate key?
How do you insert text
data or an image in a
SQL server table?
Does dropping of a table
get logged and can you
recreate a dropped table
with the data?
In what scenarios the
scripting of a database
job/task required?
Whats a bitmapped
index and whether SQL
server has it?
Answer
Wider the key, the less the actual data pertaining to the key on the
database page. Hence more I/O and poorer performance. Also it wont
be that efficient.
Q. No.
Question
Answer
By reducing the number of read operations to data, bitmapped indexes
offer better response time than traditional indexing methods such as
B-tree indexes. The idea behind a bitmapped index is that one bit
associates a specific value for an attribute with a row. For example,
each distinct value in a column can have a bitmapped index consisting
of 5 million bits--one for each record in the database. When a bit is on
(1), the value occurs in the record; when the bit is off (0), the value
doesn't occur. The index can identify records through their bitmap
position, so bitmaps don't need pointers.
8.
9.
Data Warehousing
Q.
No.
1.
2.
Question
Answer
Q.
No.
3.
Question
Answer
4.
5.
6.
7.
8.
Which is recommended;
going for data driven
queries and using lookups
for loading the datamart or
putting the lookup table in
a flat file and doing the
load.
9. Why a bit map index
becomes so crucial in a
data warehouse
environment?
10. What is indexed view in
SQL 2000?
Its a query that is fired based on the data value. For example, if the
product = coke which is already there in dimension table but the
package size has been changed then it updates the record in the
dimension table and an UPDATE query is fired or else a new record
is inserted (a insert query is fired) in the dimension table depending
on how we are managing the slowly changing dimensions.
A virtual cube, like a view in a relational database, is a logical
construct that itself contains no data. Just as a view is a join of
multiple relations, a virtual cube is a join of multiple cubes.
Through DTSRUN command which is scheduled from the
MSOLAP EM itself or through the NT scheduler.
This may cause the load process of the data mart slow. Moreover
more indexes result in increased space requirements that make the
database difficult to maintain and backup also.
Interviewing Top managers/business analysts.
KPI's (key performance indicators) of the business.
Sales /other Reports currently used for analyzing the business.
Understanding the business.
The second option is recommended, as its too fast especially in
scenarios where there is too much data to be loaded and the
window is quite small.
Oracle
Q. No.
1.
Question
What special features does oracle 8i offer for DW?
Answer
Q. No.
2.
3.
4.
5.
Question
What are the differences between table, normal
view and materialized view?
How do you decide on the rollback segment (size
& numbers) in DW vis--vis the OLTP systems?
How are bitmap indexes useful for a DW?
7.
8.
9.
6.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
Answer
Q. No.
Question
Answer
unrecoverable?
28.
29.
30.
31.
32.
33.
34.
35.
36.
37.
38.
39.
40.
41.
What is a SGA?
42.
43.
44.
45.
Q. No.
Question
Answer
46.
47.
48.
49.
1.12 Informatica
Q. No.
1.
Question
What are the various
types
of
sources
supported
by
Informatica?
Answer
2.
3.
Does
Informatica
generate
code
compatible with source?
What is PowerChannel?
4.
DB2
Adabas
PeopleSoft
Siebel
MF
Cobol/FF
PowerCenter uses either native connectivity or ODBC to extract from
source systems so code is always compatible with source
Informatica PowerChannel is a product that greatly improves the
movement of data across slow speed networks, for example a WAN or
the Internet. It is typically used for the movement of data between
geographically dispersed locations, branch offices for example, or for
sending or receiving data from external agencies. PowerChannel
provides a management environment for the defining and executing of
these data movements using XML control constructs, allowing the
data to be compressed and encrypted using RSA security algorithms.
Q. No.
Question
5.
6.
Answer
It ensures complete, secure and reliable delivery of the data files
including the ability to restart failed jobs from the point of failure.
Incremental loads are supported using PowerCenter mapping variables
and parameters.
Event driven loads are also supported within PowerCenter either using
mapping variable/parameter files or using the event driven option in
the scheduler. Once an event is recognized the during a mapping flow
the type of load (new, update or delete) can be determined and then an
update strategy transformation object is used to apply this
Entire copy of source table (refresh) is supported within PowerCenter
F - Provided with PowerCenter are a number of functions that allow
many different conversion operations. These are listed below:
Aggregate: AVG, COUNT, FIRST, LAST, MAX, MEDIAN, MIN,
PERCENTILE, STDDEV, SUM and VARIANCE
Character: ASCII, CHR, CONCAT, INITCAP, INSTR, LENGTH,
LOWER, LPAD, LTRIM, RPAD, RTRIM, SUBSTR and UPPER
Conversion: TO_CHAR, TO_DATE, TO_DECIMAL, TO_FLOAT,
TO_INTEGER and TO_NUMBER
Date:
ADD_TO_DATE, DATE_COMPARE, DATE_DIFF,
GET_DATE_PART,
LAST_DAY,
MAX,
MIN,
ROUND,
SET_DATE_PART and TRUNC
Numeric: ABS, CEIL, CUME, EXP, FLOOR, LN, LOG, MOD,
MOVINGAVG, MOVINGSUM, POWER, ROUND, SIGN, SQRT
and TRUNC
Scientific: COS, COSH, SIN, SINH, TAN and TANH
Special: ABORT, DECODE, ERROR, IIF and LOOKUP
Conversion: ISNULL, IS_DATE, IS_NUMBER and IS_SPACES
7.
Variable:
SETCOUNTVARIABLE,
SETMAXVARIABLE,
SETMINVARIABLE, SETVARIABLE
EBCDIC to ASCII conversion is handled automatically by
PowerCenter
Slowly changing dimensions are supported within PowerCenter.
Changes can be identified and keys generated using a combination of
the transformation objects supplied as standard. PowerCenter also
includes a wizard to allow the following mappings to be created:
Type 1 Dimension mapping. Loads a slowly changing dimension
table by inserting new dimensions and overwriting existing
dimensions. This mapping is used when you do not want a history of
previous dimension data.
Type 2 Dimension/Version Data mapping. Loads a slowly changing
dimension table by inserting new and changed dimensions using a
version number and incremented primary key to track changes. This
mapping is used when you want to keep a full history of dimension
data and to track the progression of changes.
Type 2 Dimension/Flag Current mapping. Loads a slowly changing
Q. No.
Question
Answer
dimension table by inserting new and changed dimensions using a flag
to mark current dimension data and an incremented primary key to
track changes. This mapping is used when you want to keep a full
history of dimension data, tracking the progression of changes while
flagging only the current dimension.
Type 2 Dimension/Effective Date Range mapping. Loads a slowly
changing dimension table by inserting new and changed dimensions
using a date range to define current dimension data. This mapping is
used when you want to keep a full history of dimension data, tracking
changes with an exact effective date range.
8.
Optimization supported
by the tool?
9.
Q. No.
Question
Answer
PowerCenter uses work areas called folders. These can be restricted to
owners, groups and others with read, write and execute permissions
able to be set for each. This security model enables single and multiuser development environments.
Q. No.
Question
Answer
Written into server properties .. instead of hard coding every time use
them .. in case change in the paths, no need to change the code
on
the
source
..
parallel
Used for COBOL sources. The data in COBOL sources is highly denormalized.. to make it normalized & use with relational DBs use
Normalizer Transformation.
Similar to a filter transformation .. used when same data is to be tested
against multiple conditions.
Cache Funda
Formula
Question
Answer
Q.
No.
Question
Answer
2.
3.
Invalid syntax
Loops
Isolated tables
Isolated joins
Loops within contexts
Missing or incorrect cardinalities
Equi join
Outer Join
Theta Join
A Loop is caused by a circular set of joins which defines a closed path
thro a set of tables.
Graphical representation of Universe domain, security domain and
document domain
General supervisor
Supervisor
Designer
Supervisor Designer
End user
Versatile User
Set up and maintain architecture
Define user and groups
Assign them appropriate security profile
Customize user and group profiles
Security management
Three products Business Object Client, Supervisor, Designer.
4.
5.
6.
What is a BO
supervisory module?
What are all the
profiles that can be
created by a
supervisor?
7.
8.
9.
Q.
No.
Question
Answer
supervisor?
12. Can the same user exist
in two different groups
with different profiles?
13. What are a universe, a
class and an object?
How are they related?
14. What are calculationcontexts? What are
Input and Output
Contexts?
Q.
No.
Question
Answer
In DESIGNER you can specify two types of strategies: built-in
strategies and external strategies.
Built In Strategies:
DESIGNER uses the following built-in strategies for creating the
components of universes:
The Objects Creation strategy, which tells DESIGNER how
to define classes and objects automatically from the databases
tables and columns
The Joins Creation strategy, which tells DESIGNER how to
define joins automatically from the databases tables and
columns
The Table Browser strategy, which tells DESIGNER how to
read the table and column structures from the databases data
dictionary
You can view them in the Strategies tab of the Universe Parameters
dialog box.
External Strategies:
External strategy files are declared in the STG section of .PRM files
located in the various RDBMS folders. All external strategy files contain
a number of existing strategies delivered with Business Objects
products. For example, a file may contain one object strategy, one join
strategy, and one table browser strategy, or multiple strategies of each
type. In this file you can customize an existing strategy or create your
own. Each external strategy file is specific to one RDBMS.
DESIGNER lets you save universes in either enterprise or workgroup
mode.
Enterprise mode means that you are working in an environment with a
repository.
Workgroup mode means that you are working without a repository.
The mode in which you save your universes determines whether other
designers are able to access them.
A universe is identified by:
A file name that consists of up to 8 characters and a .unv
extension.
A long name that consists of up to 35 characters. It may be a
name that more fully describes the purpose of the universe.
This is the name by which end users identify the universe in
BUSINESSOBJECTS or WEBINTELLIGENCE.
A unique system identifier. This is the identifier assigned by
the repository when you export the universe. This identifier is
null if you have never exported the universe.
A concatenated object is an object you create by combining two existing
objects.
For example, lets say that you wish to create an object called Full
Name, which is a concatenation of the objects Last Name and First
Name in the Customer class.
You can resolve loops in two ways:
Using aliases
Using contexts
Q.
No.
Question
Answer
When you create aliases and you end up with object names that sound
very different, aliases are probably the right solution. If you end up with
object names that sound very similar, you should consider using
contexts.
You would have to use Data Synchronization and link 2 data providers
by City and then define a variable say Revenue Country, the
formula for this variable will be using Multicube() function as
Multicube(<Revenue>). Here without Multicube function you will get
erroneous results.
You can create reports using SQL only in Full Client (C/S mode) and
not in Infoview and yes you can push the reports in the repository and
view and refresh them in Infoview and cannot edit it their.
Aggregate Aware is the main answer to look for in this question. Can
ask to explain the concept of Aggregate Awareness?
NO. You cannot have Free Hand SQL or Stored Procedures or OLAP
sources as data providers for creating reports from ZABO Full Client.
NO. Only full client users can do this.
The answer is, no. However, you can format your objects in Designer,
including font styles, colors, borders, etc. These formats will work for
your Webi reports. However, this formating only works for the data
returned by the query. It does not work for the column headers, or, of
course, your report title, etc.
You are going to have to use a VBA macro to do this. The macro would
calculate the time periods and pass the data to the report prompt. Or the
time periods would have to come from a table instead of a prompt.
OR
If the prompt refers to a constant such as week number you could
remove the prompt and add a condition such as; sales week = (max
(sales week) where the max (sales week) could either be an additional
object or a complex condition. This would only bring the results for the
maximum week number.
Q.
No.
Question
Answer
prompts?
32. What is Short-Cut Join
and what is its use?
33. What are the major
differences between
BO Full Client and
Infoview?
Q.
No.
Question
Answer
Objects?
certain database structures, you may need to use contexts rather than
aliases to resolve loops. A situation where this commonly occurs is a
transactional database with multiple fact tables (multiple stars) that
share lookup tables. For example, the Club database contains statistical
information both about sales and reservations. The statistics relating to
each type of transaction are stored in distinct fact tables. However,
because these fact tables share common dimensions, such as Resorts and
Customers, the schema contains a loop. The only way to resolve this
loop is to ensure that queries answer questions for one transaction or the
other, such as: Is the customer information needed from the perspective
of sales or reservations? The method for specifying the appropriate
perspective is called a context. When a user runs a query from a
universe containing contexts, BUSINESSOBJECTS or
WEBINTELLIGENCE prompts the user to indicate the correct
perspective for the query.
The Chasm trap occurs when two many to one joins converge on a
single table. You will get incorrect results if you run a query when the
following circumstances exist:
A many to one to many relationship exists among three
tables in the universe structure.
The query includes objects based on two tables both at the
many end of their
respective joins.
There are multiple rows returned for a single dimension.
In the example below a customer can place many orders and/or
place many loans:
ORD
ERS
CUS
TO
ME
R
LOA
NS
If you want to run a query that returns the total order and loan values for
a customer Paul, you would get the following results:
Customer Name Order Date
Loans Value
Paul
12/01/99
50.00
Paul
14/04/99
50.00
Paul
20/09/99
50.00
Paul
12/01/99
100.00
Paul
14/04/99
100.00
Paul
20/09/99
100.00
Order Value
Loans Date
100.00
05/08/97
150.00
05/08/97
150.00
05/08/97
100.00
03/06/97
150.00
03/06/97
150.00
03/06/97
Q.
No.
Question
Answer
Sum = 800
Sum =
450
The total order value returned is 800, and the total loan value is 450.
This is obviously an incorrect result. A Cartesian product of the
CUSTOMER, ORDERS, and LOAN tables has been returned. The
correct results should be:
Total orders value for Paul is 400.
Total loans value for Paul is 150.
To RESOLVE a Chasm trap you need to make two separate queries and
then combine the results. Depending on the type of objects defined for
the fact tables, and the type of end user environment, you can use the
following methods to resolve a Chasm trap:
Create a context for each fact table. This solution works in all
cases for BUSINESSOBJECTS universes.
Modify the SQL parameters for the universe so you can
generate separate SQL queries for each measure. This solution only
works for measure objects. It does not generate separate queries for
dimension or detail objects.
Break the universe into multiple universes. One universe for
each fact table. This solution only applies to WEBINTELLIGENCE
universes when there are dimension objects in one or both fact tables so
two SELECT statements are synchronized and not joined.
To create Running Sum use the formula: RunningSum (<Revenue>)
and to reset this when Country changes make the formula as
RunningSum (<Revenue> ;< Country>) and a reset context with more
than one dimension looks like this: ;< Year>, <Region>
Q.
No.
Question
40.
41.
42.
43.
Answer
The Count function counts values of a dimension object that are the
same only one time. This is called a distinct count. The Count All
function counts all rows including empty and duplicate rows.
<Max Rev Per Year> = Max (<Revenue> ForEach <Year>)
Yes you can have multiple security domains, but currently They
CANNOT interact between themselves. To replicate users between
security domains use Import Export users command from supervisor.
Yes you can run BCA on UNIX, but you cannot run documents with
VBA macros on them, for that you would need it to be running on NT.