Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
BI system
After understanding what the BI system is, it's time to discover more about its components and
understand how these components work with each other. There are also some BI tools that help to
implement one or more components. The following diagram shows an illustration of the
The BI architecture and components differ based on the tools, environment, and so on. The
architecture shown in the preceding diagram contains components that are common in most of the
BI systems. In the following sections, you will learn more about each component.
purpose of data analysis and reporting. This purpose changes the design of this database as well.
As you know, operational databases are built on normalization standards, which are efficient for
transactional systems, for example, to reduce redundancy. As you probably know, a 3NF-designed
database for a sales system contains many tables related to each other. So, for example, a report
on sales information may consume more than 10 joined conditions, which slows down the
response time of the query and report. A data warehouse comes with a new design that reduces
the response time and increases the performance of queries for reports and analytics. You will
learn more about the design of a data warehouse (which is called dimensional modeling) later in
this chapter.
It is very likely that more than one system acts as the source of data required for the BI system. So
there is a requirement for data consolidation that extracts data from different sources and
transforms it into the shape that fits into the data warehouse, and finally, loads it into the data
warehouse; this process is called Extract Transform Load (ETL). There are many challenges in
the ETL process, out of which some will be revealed (conceptually) later in this chapter.
According to the definition of states, ETL is not just a data integration phase. Let's discover more
about it with an example; in an operational sales database, you may have dozen of tables that
provide sale transactional data. When you design that sales data into your data warehouse, you
can denormalize it and build one or two tables for it. So, the ETL process should extract data from
the sales database and transform it (combine, match, and so on) to fit it into the model of data
warehouse tables.
There are some ETL tools in the market that perform the extract, transform, and load operations.
The Microsoft solution for ETL is SQL Server Integration Service (SSIS), which is one of the best
ETL tools in the market. SSIS can connect to multiple data sources such as Oracle, DB2, Text
Files, XML, Web services, SQL Server, and so on. SSIS also has many built-in transformations to
transform the data as required. Chapter 4, ETL with Integration Services, is about SSIS and how
A data warehouse is designed to be the source of analysis and reports, so it works much faster
than operational systems for producing reports. However, a DW is not that fast to cover all
requirements because it is still a relational database, and databases have many constraints that
reduce the response time of a query. The requirement for faster processing and a lower response
time on one hand, and aggregated information on another hand causes the creation of another
layer in BI systems. This layer, which we call the data model, contains a file-based or memory-
based model of the data for producing very quick responses to reports.
Microsoft's solution for the data model is split into two technologies: the OLAP cube and the In-
memory tabular model. The OLAP cube is a file-based data storage that loads data from a data
warehouse into a cube model. The cube contains descriptive information as dimensions (for
example, customer and product) and cells (for example, facts and measures, such as sales and
and Time. Each cell in the cube shows a junction of these three dimensions. For example, if we
store the sales amount in each cell, then the green cell shows that Devin paid 23$ for a Hat on
June 5. Aggregated data can be fetched easily as well within the cube structure. For example, the
orange set of cells shows how much Mark paid on June 1 for all products. As you can see, the
cube structure makes it easier and faster to access the required information.
Microsoft SQL Server Analysis Services 2012 comes with two different types of modeling:
multidimensional and tabular. Multidimensional modeling is based on the OLAP cube and is fitted
with measures and dimensions, as you can see in the preceding diagram. The tabular model is
based on a new In-memory engine for tables. The In-memory engine loads all data rows from
tables into the memory and responds to queries directly from the memory. This is very fast in
terms of the response time. You will learn more about SSAS Multidimensional in Chapter 2, SQL
Server Analysis Services Multidimensional Cube Development, and about SSAS Tabular
in Chapter 3, Tabular Development of SQL Server Analysis Services, of this book. The BI
Multidimensional solutions.
Data visualization
The frontend of a BI system is data visualization. In other words, data visualization is a part of
the BI system that users can see. There are different methods for visualizing information, such as
strategic and tactical dashboards, Key Performance Indicators (KPIs), and detailed or
consolidated reports. As you probably know, there are many reporting and visualizing tools on the
market.
Microsoft has provided a set of visualization tools to cover dashboards, KPIs, scorecards, and
dashboard tool that performs best when connected to SSAS Multidimensional OLAP cube. You
will learn about PerformancePoint in Chapter 10, Dashboard Design. Microsoft's SQL Server
Reporting Services (SSRS) is a great reporting tool for creating detailed and consolidated
reports. SSRS is a mature technology in this area, which will be revealed in Chapter 9, Reporting
Services. Excel is also a great slicing and dicing tool especially for power users. There are also
components in Excel such as Power View, which are designed to build performance dashboards.
You will learn more about Power View in Chapter 9, Reporting Services, and about Power BI
features of Excel 2013 in Chapter 11, Power BI. Sometimes, you will need to embed reports and
dashboards in your custom written application. Chapter 12, Integrating Reports in Application,
Every organization has a part of its business that is common between different systems. That part
of the data in the business can be managed and maintained as master data. For example, an
organization may receive customer information from an online web application form or from a retail
Master Data Management (MDM) is the process of maintaining the single version of truth for
master data entities through multiple systems. Microsoft's solution for MDM is Master Data
Services (MDS). Master data can be stored in the MDS entities and it can be maintained and
changed through the MDS Web UI or Excel UI. Other systems such as CRM, AX, and even DW
can be subscribers of the master data entities. Even if one or more systems are able to change the
master data, they can write back their changes into MDS through the staging architecture. You will
The quality of data is different in each operational system, especially when we deal with legacy
systems or systems that have a high dependence on user inputs. As the BI system is based on
data, the better the quality of data, the better the output of the BI solution. Because of this fact,
working on data quality is one of the components of the BI systems. As an example, Auckland
might be written as "Auck land" in some Excel files or be typed as "Aukland" by the user in the
input form.
As a solution to improve the quality of data, Microsoft provided users with DQS. DQS works based
on Knowledge Base domains, which means a Knowledge Base can be created for different
domains, and the Knowledge Base will be maintained and improved by a data steward as time
passes. There are also matching policies that can be used to apply standardization on the data.
You will learn more about DQS in Chapter 6, Data Quality and Data Cleansing.
What Is BI Architecture?
Business intelligence architecture is a term used to describe standards and policies for
organizing data with the help of computer-based techniques and technologies that create
business intelligence systems used for online data visualization, reporting, and analysis.
One without the other wouldn’t function, and we will now explain premises that surround
their framework by using a BI architecture diagram to fully understand how data
warehouse enhances the BI processes.
1. Collection of data
2. Data integration
3. Storage of data
4. Data analysis
5. Distribution of data
6. Reaction based on insights
**click to enlarge**
We can see in our BI architecture diagram how the process flows through various layers,
and now we will focus on each.
1. Collection of data
The first step in creating a stable architecture starts in gathering data from various data
sources such as CRM, ERP, databases, files or APIs, depending on the requirements
and resources of a company. Modern BI tools offer a lot of different, fast and easy data
connectors to make this process smooth and easy by using smart ETL engines in the
background. They enable communication between scattered departments and systems
that would otherwise stay disparate. From a business point of view, this is a crucial
element in creating a successful data-driven decision culture that can eliminate errors,
increase productivity, and streamline operations. You have to collect data in order to be
able to manipulate with it.
2. Data integration
When data is collected through scattered systems, the next step continues in extracting
data and loading it to a data warehouse. This process is called ETL (Extract-Transform-
Load).
With an increasing amount of data generated today and the overload on IT departments
and professionals, ETL as a service comes as a natural answer to solve complex data
requests in various industries. The process is simple; data is pulled from external sources
(from our step 1) while ensuring that these sources aren’t negatively impacted with the
performance or other issues. Secondly, data is conformed to the demanded standard. In
other words, this (transform) step ensures data is clean and prepared to the final stage:
loading into a data warehouse.
3. Data storage
Now we approach the data warehousing and business intelligence concepts. While both
terms are often used interchangeably, there are certain differences that we will focus on
to get a more clear picture on this topic.
**click to enlarge**
The main differences, as we can also see in the visual, between business intelligence
and data warehousing are indicated in these main questions:
Business intelligence and data warehousing have different goals. While they are
connected and cannot function without each other, as mentioned earlier, BI is mainly
focused on generating business insights, whether operational or strategic efficiency such
as product positioning and pricing to goals, profitability, sales performance, forecasting,
strategic directions, and priorities on a broader level. The point is to access, explore, and
analyze measurable aspects of a business. On the other hand, a data warehouse (DWH)
has its significance in storing all the company’s data (from one or several sources) in a
single place. In a nutshell, BI systems and tools make use of data warehouse while data
warehouse acts as a foundation for business intelligence.
Exclusive Bonus Content: Wondering about BI & DWH? Learn right here!
Get the short and sweet version with our completely free summary!
b) What is the output?
The output data of both terms also vary. While BI outputs information through data
visualization, online dashboards, and reporting, data warehouse outlines data in
dimension and fact tables for upstream applications (or BI tools). Data cleansing,
metadata management, data distribution, storage management, recovery, and backup
planning are processes conducted in a data warehouse while BI makes use of tools that
focus on statistics, visualization, and data mining.
The output difference is closely interlaced with the people that can work with either BI or
data warehouse. But let’s see this through our next major aspect.
To expand our previous point, the people involved in managing the data are quite
different. C-level executives or managers use modern BI tools in the form of a real-time
dashboard since they need to derive factual intelligence, create effective sales reports or
forecast strategic development of the department or company. CEOs or sales managers
cannot manage data warehouse since it’s not their area of expertise; they need a tool that
will translate the heavy IT data into insights that an average business user can fully
understand. That’s where business intelligence creates a solid bridge between DWH and
BI. On the other hand, a data warehouse is usually dealt with by data (warehouse)
engineers and back-end developers. They are the technical chain in a BI architecture
framework that design, develop, and maintain systems for future data analysis and
reporting a business might need.
With the expansion of data processed and created in our digital age, the tools and
software needed to perform analysis expanded and developed in recent years in ways
we could not have imagined. In this context, the need for utilizing a proper tool, a
stable business intelligence dashboard and data warehouse increased exponentially. In
such environment, the data warehouse processes can be managed with a product such
as Amazon Redshift while the full support for BI insights needed to effectively generate
and develop sustainable business acumen with tools such as datapine. Visualization of
data is the core element that enables managers, professionals, and business users to
perform analysis on their own, without the need for heavy IT support or work.
Now that we have expounded what is data warehousing and business intelligence, we
continue with our next step: analyzing the BI architecture layers needed for establishing
a sustainable business development.
4. Analysis of data
In this step of our compact BI architecture, we will focus on the analysis of data after it’s
handled, processed, and cleaned in former steps with the help of data warehouse(s). The
ubiquitous need for successful analysis for empowering businesses of all sizes to grow
and profit is done through BI application tools. Especially when it comes to ad hoc
analysis that enables freedom, usability, and flexibility in performing analysis and helping
answer critical business questions swiftly and accurately.
This visual above represents the power of a modern, easy-to-use BI user interface.
Modern BI tools like datapine empower business users to create queries via drag and
drop, and build stunning data visualizations with a few clicks, even without profound
technological knowledge. This simplifies the process of creating business dashboards, or
an analytical report, and generate actionable insights needed for improving the
operational and strategic efficiency of a business. The data warehouse works behind this
process and makes the overall architecture possible.
5. Data distribution
Data distribution comes as one of the most important processes when it comes to sharing
information and providing stakeholders with indispensable insights to obtain sustainable
business development. Distribution is usually performed in 3 ways:
a) Reporting via automated e-mails: Created reports can be shared with selected
recipients on a defined schedule. The dashboards will be automatically updated on a
daily, weekly or monthly basis which eliminates manual work and enables up to date
information.
c) Embedding: This form of data distribution is enabled through embedded BI. Your own
application can use dashboards as a mean of analytics and reporting without the need
for labeling the BI tool in external applications or intranets.
**click to enlarge**
This dashboard is the final product on how data warehouse and business intelligence
work together. The processes behind this visualization include the whole architecture
which we have described, but it would not be possible to achieve without a firm data
warehouse solution. Ultimately, this enables a high-level manager to get a
comprehension of the strategic development and potential decisions for creating and
maintaining a stable business.
On this particular dashboard, you can see the total revenue, as well as on a customer
level, adding also the costs. The targets are also set so that the dashboard immediately
calculates if they have been met or additional adjustments are needed from a
management point of view. As revenue is one of the most important factors when
evaluating if the business is growing, this management dashboard ensures all the
essential data is visualized and the user can easily interact with each section, on a
continual basis, making the decision processes more cohesive and, ultimately, more
profitable.
Exclusive Bonus Content: Wondering about BI & DWH? Learn right here!
Get the short and sweet version with our completely free summary!
Although the terms have been used as synonyms in recent years, today they function on
diverse levels, but the perspective is the same: analyze, clean, monitor, and evaluate the
data in the finest and most productive way possible.