You are on page 1of 5

Data warehousing concepts:

1.

What is a data warehouse?A data warehouse is a database which,


1.Maintains history of data
2.Contains Integrated data (data from multiple business lines)
3.Contains Heterogeneous data (data from different source formats)
4.Contains Aggregated data
5.Allows only select to restrict data manipulation
6.Data will be stored in de-normalized format
Definition of a data warehouse:
1. Subject-Oriented
2. Integrated
3. Non-volatile
4. Time-Variant
Main Usage of a data warehouse:
1. Data Analysis
2. Decision Makings
3. Planning or Forecasting

2.

What is a dimension?A Dimension table is a table where it contains


only non-quantifying data and category of information which are key for analysis.
A dimension table contains primary key and non-quantifying columns. If the primary
key does not exist in source table then surrogate key would exist.

3.

What are the types of dimension?Based on what type of data it stores


there is two major types dimension table,
1.Confirmed dimension
2.Junk dimension
Based on where its being derived there is one dimension category,
3.Degenerated dimension
Based on how frequently the data in the dimension can be divided into 2 types,
4.Rapidly Changing Dimension (RCD)
5.Slowly Changing Dimension (SCD)

4.

What is a fact and what are the types of fact?A fact is a column or
attribute which can be quantifiable/measurable and will be used as key analysis factor.
We can call it as a measure.
Types of Fact:
1. Additive
2. Semi-additive
3. Non-additive

5.

What does a fact table contain?A table which contains facts is called
fact table. Typically a fact table has facts and foreign keys of dimension tables.
Fact table structure:
Foriegn_key1
Forign_keyN
Fact1
FactN

6.

What are the types of a fact table?


Transactional
The fact table will contain datas in very detail level without any rollup/aggregation the
way how transactional database stores.
Accumulating
Accumulating refers storing multiple entries for a single record to track the changes
throughout the workflow.
Periodic snapshot
The data will be extracted and loaded for a particular period of a time. It describes what
would be the state of the record in that specific period.
Factless fact table
When a fact table does not have any fact is called Factless fact table. It has only foreign
keys of dimension tables.

7.

What is granularity?

8.

Why staging table is required?

9.

What is a surrogate key?

10.

OLTP vs DW database

11.

Operational Data Store (ODS) vs Staging database

12.

Explain about star schema.

13.

Explain about snowflake schema

14.

What is the difference between star and snowflake?

15.

What is data cleansing?

16.

What is data masking?

17.

Can you list down the challenges you have faced?

18.

Data mart vs data warehouse

19.

What is data purging and archiving?

20.

What are the types of SCD?

21.

What type of schema and SCD type used in your project?

ETL:
1.

What is ETL?

2.

Explain about your current project?

3.

What are the transformation types?

4.

What are the types of load?

5.

Name some ETL tools

6.

Explain the scenarios for testing source to a staging table.

7.

How do you ensure that all source table datas are loaded into target table?

8.

Give an example for Low severity and High priority defect.

9.

What are the components of Informatica?

10.

What are tasks available in Informatica?

11.

Database testing vs ETL testing

12.

What is partitioning?

13.

What are the responsibilities of an ETL tester?

14.

What does a mapping document contain?

15.

What kind of defects can expect?

16.

1000 records are in the source table, but only 900 records are loaded into the
target table. How do you find the missing 100 records?

17.

Can you give few test cases to test the incremental load table?

18.

How do you compare a flat file and database table?

19.

What is connected lookup and unconnected lookup?

SQL:
1.

Write a query to find the duplicate record?

2.

Write a query to delete duplicate records?

3.

What is the difference between Drop vs truncate vs delete?

4.

What are constraints?

5.

Unique key vs Primary key

6.

Union and union all

7.

Write a query to find cumulative sum at month level.

8.

Write a query to find nth highest value?

9.

The difference between Rank vs Dense_rank?

10.

What is a correlated subquery?

11.

What is a normalization process?

12.

The difference between Exists and In?

13.

Primary key vs Foreign key?

14.

Subquery vs Join?

15.

View vs Materialized view?

16.

Function vs Procedure?

17.

Explain about different types of joins?

18.

What is the use of NVL command?

19.

What would be the order of commands Having, Group by, where, order by?

20.

DML vs DDL vs DCL

21.

Clustered index vs Non-clustered index?

22.

How do you fetch the unique records from a table?

23.

Questions on SQL functions (Min, max, sum, count, to_date, to_char etc..)

TESTING PROCESS:
1.

Explain about Testing life cycle

2.

Explain about defect life cycle?

3.

The difference between Test Plan and Test Strategy?

4.

Test Scenarios vs Test Cases

5.

Regression Testing vs Retesting

6.

Can you explain about Agile process?

You might also like