Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Definition 1 :
ETL is nothing but Extract,Transform and Loading of the data from multiple heterogeneous data sources to a
single or multiple sources.
Definition 2 :
Extraction : Take data from an external source and move it to the warehouse pre-processor database.
Transformation: Transform data task allows point-to-point generating, modifying and transforming data.
Loading: Load data task adds records to a database table in a warehouse.
Question 3 : What is difference between Manual Testing and ETL Testing?(90% asked ETL Testing
Interview Questions)
Answer :
1.The main difference between manual testing and ETL testing is manual testing is related to the
functionality of the program and ETL testing is related to the databases and its count.
2.ETL is the automated testing process where you don’t need any technical knowledge other than the
software. Also, ETL testing is extremely faster, systematic and assure top results as needed by the
businesses.
3.Manual testing is highly time-consuming where you need technical knowledge to write the test cases
and the scripts. It is slow, needs efforts, and highly prone to errors.
Question 4 : Explain Need Of ETL Testing.(100% asked ETL Testing Interview Questions)
Answer :
Now a days we are migrating the tons of systems from old technology to new technology. At the time of
migration activities user also needs to migrate the data as well from old DBMS to latest DBMS.So there is
huge need to test that data is correct from target side.The following are some bullet points where i have
explained the necessity of ETL Testing :
To keep a check on the Data which are being transferred from one system (Old system) to the other (New
system).
To keep a track on the efficiency and speed of the process.
To be well acquainted with the ETL process before it gets implemented into your business and production.
Question 5 : Where user can use ETL concepts. Give some examples.
Answer :
Before ETL tool user needs to write a long code for data transformation to data loading
ETL makes the life simple and one tool will manage all the scenarios of transformation and loading of the data
There are following examples where we are using the ETL :
Example 1 : Data warehousing :
The ETL is used in data warehousing concepts. User needs to fetch the data from multiple
heterogeneous systems and loads it in data warehouse database.ETL Concept is mainly used here to
extract the data from source,transform the data and load it in to target systems.
The data migrations are difficult efforts if you use the PL SQL or T-SQL development to do. If you want to
migrate the data using simple way use different ETL tools.
Now a days lot of companies are merging in to different MNCs. To move the data from one company to
another company the ETL concepts is been used.
Question 6 : Explain how ETL is used in third party data management.(100% asked ETL Testing Interview
Questions)
Answer :
The big organizations always gives different application development to different kind of vendors.Means
not a single vendor is managing everything. Lets take example of Telecommunication project where
billing is managed by one company and CRM is managed by other company.If CRM company needs
some data from the company who is managing the Billing. That company will receive a data feed from the
other company. To load the data from the feed ETL process is used.
As The ETL definition suggests that ETL is nothing but Extract,Transform and loading of the data;This
process needs to be used in data warehousing widely. The simple example of this is managing sales data
in shopping mall. If user wants the historical data as well as current data in the shopping mall first step is
always user needs to follow the ETL process.Then that data will be used for reporting purpose.
Most common ETL tools include − SAP BO Data Services (BODS), Informatica, Microsoft – SSIS, Oracle
Data Integrator ODI, Talend Open Studio, Clover ETL Open source, etc.
Most common BI tools include − SAP Business Objects, SAP Lumira, IBM Cognos, JasperSoft, Microsoft
BI Platform, Tableau, Oracle Business Intelligence Enterprise Edition, etc.
Question 9 : What is difference between ETL Testing and Database Testing?(80% asked ETL Testing
Interview Questions)
Answer :
Following are different difference points between ETL testing and database testing :
Business flow environment based on earlier data Applicable to business flow systems
Informatica, Cognos and QuerySurge can be used QTP and Selenium tools for automation
Question 13 : How ETL is used in Data migration projects. Explain with example.(60% asked ETL Testing
Interview Questions)
Answer :
ETL tools are widely used in data migration projects. If the organization is managing the data in oracle 10
g previously and now organization wants to go for SQL server cloud database then there is need to
migrate the data from Source to Target.To do this kind of migration the ETL tools are very useful. If user
wants to write the code of ETL it is very time consuming process. To make this simple the ETL tools are
very useful in which the coding is simple as compare to PL SQL or T-SQL code.So ETL process i very
useful in Data migration projects.
Question 14 : Explain multiple steps to choose ETL tool .(90% asked ETL Testing Interview Questions)
Answer :
The choosing the ETL tool is very difficult thing. You need to consider lot of factors while choosing the
correct ETL tool according to the project.Choosing the ETL tool for specific project is very stratagic move
even you need it for a small project.Make sure that ETL tool migrations are no small efforts.In this section
i would like to give you some bullet points to consider while choosing your ETL tool.
1.Data Connectivity :
ETL tool should be communicate with any source of data no matter where it comes from.This is very
critical.
2.Performance :
Moving and changing a data requires some serious processing power. So you need to check the
performance factors.
3.Transformation Flexibility :
Matching,Merging and changing the data is very critical.ETL data should provide these and many
transformation packages which allow modifications to the data in transformation phase with simple drag
and drop.
4.Data Quality :
Your data is not clean. The only way to leverage your data when your data is consistent and clean.
1.Source bugs
2.Calculation bugs
Question 19 : What is fact and what are its types.(100% asked ETL Testing Interview Questions)
Answer ;
It is a central component of a multi-dimensional model which contains the measures to be analyzed.
Facts are related to dimensions.
Semi additive: A measure can participate arithmetic calculations using some dimensions.
Ex: Sales amount
Question 20 : Explain Data extraction phase in ETL with its type.(90% asked ETL Testing Interview
Questions)
Answer :
The data extraction is nothing but extracting the data from multiple heterogeneous sources with using
ETL tools.
There are 2 Types of Data Extraction
1.Full Extraction : All the data from source systems or operational systems gets extracted to staging
area. (Initial Load)
2.Partial Extraction : Sometimes we get notification from the source system to update specific date. It is
called as Delta load.
Source System Performance : The Extraction strategies should not affect source system performance.
Question 21 : What is Dimensions? Explain with example.
Answer :
Dimension table is table which describes the business entities of an enterprise which describes the
objects in a fact table.Dimension table has primary key which uniquely identifies each dimension
row.Dimension table is sometimes called as lookup or reference table.The primary key of dimension table
is used to associate relationship between fact table which contains foreign key.Dimension tables are
normally in de-normalized form because these tables are only used to analyse the data and not used to
execute transactions.
1. Query Constrainting
2. Grouping /Filtering
3. Report labeling
Following are different examples of dimensions:
1.Time
2.Location
3.Item
4.Branch
1.Standardizing data : Data is fetched from multiple sources so it needs to be standardized as per the
target system.
2.Character set conversion : Need to transform the character sets as per the target systems. (Firstname
and last name example).
3.Calculated and derived values: In source system there is first val and second val and in target we need
the calculation of first val and second val.
4.Data Conversion in different formats : If in source system date in in DDMMYY format and in target the
date is in DDMONYYYY format then this transformation needs to be done at transformation phase.
Question 24 : Explain partitioning in ETL?
Answer :
The transactions are always needed to be divided for the better performance. The same processes are
known as Partitioning. It simply makes sure that the server can directly access the sources through
multiple connections.
Initial load : Populating all the data tables from source system and loads it in to data warehouse table.
Incremental Load : Applying the ongoing changes as necessary in periodic manner.
Full Refresh : Completely erases the data from one or more tables and reload the fresh data.
Question 26 : What are different types of ETL tool?
Answer :
There are following types of ETL tools:
Informatica
IBM Datastage
Abnitio
Clover ETL
1.Pentaho
2.Kettle
These are most important interview questions for Testers.Hope you like this article on ETL Testing
Interview Questions.If you find it useful or if you have suggestions with the same kindly comment it in to
comment box.