Sei sulla pagina 1di 19

OLAP and BIG DATA

Challenges and Solutions


Presented By:
Contents

• Introduction
• OLAP
• Big Data
• OLAP AND BIG DATA Today
• Current problems
• Possible solutions and future research
OLAP

• OLAP stand for Online Analytical Processing.


• OLAP provides analysis of data stored in a database.
• User easily & selectively extract data
• OLAP tools enable users to analyze different dimensions of
multidimensional data.
• Analyze DBI from multiple DB at a same time
• Data aggregation in many different ways
• Chief component
Different forms of OLAP

Three ways of storing data:


• Multidimensional OLAP (MOLAP)
• Best Query Performance
• Relational OLAP (ROLAP)
• Ideal for large databases
• Hybrid OLAP (HOLAP)
• Best of both worlds!
BIG DATA

Big data describes the large volume of data for both structured and unstructured
data.
• Big Data may will be the Next Big Thing in the IT world.
• Firms like Google, eBay, LinkedIn, and Facebook were built around big data from
the beginning.
• Like many new information technologies, big data can bring about dramatic cost
reductions, improvements in the time required to perform a computing task.
Cont.…

• ‘Big Data’ is similar to ‘small data’, but bigger in size but Big data has
requires different approaches techniques, tools and architecture.
• Main aim to solve new problems or old problems in a better way
• Big Data generates value from the storage and processing of very large
quantities of digital information that cannot be analyzed with traditional
computing techniques.
Characteristics of Big Data
• Big Data Volume
A typical PC might have had 10 gigabytes of storage in 2000.
• Facebook store 500 terabytes of new data every day.
• Boeing 737 will generate 240 terabytes of flight data during a single flight across the US.
• Excel used to hold up to 65k rows in a single spreadsheet. Now it holds over a million

• Big Data Velocity


Data comes extremely fast
• Twitter Firehose (6,000 tweets per second)
• Infrastructure and sensors generate massive log data in real-time.
• Online gaming systems support millions of current users, each producing multiple inputs per second.
Cont.

• Big Data Variety


• Big Data isn't just numbers, dates, and strings. Big Data is also 3D data, audio and
video, and unstructured text, including log files and social media.
• Estimated 80% of all enterprise data is unstructured
• Traditional database systems were designed to address smaller volumes of structured
data, fewer updates or a predictable, consistent data structure.
• noSql Databases
What is OLAP on Big Data?

• OLAP on Big Data is a powerful concept that involves pre-aggregation of


massive volumes of data into multidimensional cubes and then querying
them to get faster results.
• The process involves building OLAP cubes with all the dimensions and
measures on which the user would want to explore the data and then
enabling BI tools to fetch data directly from these cubes.
• Cubes serve the queries.
• There is no need to connect the Big Data platform every time the user asks a
question and responses are instant.
Cont.…

• Users do not need to move away from their traditional BI environment to


perform Big Data analytics.
• They can continue using their existing tools and work seamlessly, without
worrying about the size and scale of the data.
OLAP is not new, will it work on Big Data?

OLAP is not a new concept and is used widely to enable easy, interactive
business intelligence on enterprise data.
When traditional OLAP solutions are brought into the world of Big Data,
they fail to perform as they cannot deal with the
i. massive increase in the data volume,
ii. the explosion of cardinality and dimensions,
iii. and the large variety of data sources.
Question is how to achieve OLAP on Big Data?

• To be able to deal with the scale of Big Data, the OLAP layer needs to be
built directly on the Big Data platform, utilizing its build and store capacity.

• This will also ensure that the cubes can handle many dimensions to deal
with the variety of data as well as accommodate the high volume and
velocity of Big Data.
What OLAP on Big Data can do
for you?

By creating an OLAP layer on your Big Data platform,


you can interact with your Big Data visually using a BI tool of your choice.
There is no need to wait for insights and you can access massive volumes of
data to get quick answers to all the questions that you may have.
You can slice and dice, drill-down, and explore all aspects of your data to
get deeper insights and make smarter, more informed decisions
Possible solutions and future research

1. innovative methodologies for designing OLAP data cubes over Big Data:
there is a strong need for methodologies capable of dealing with
requirements posed by designing and modeling OLAP data cubes over Big
Data
2. Complex OLAP data cubes over Big Data due to the intrinsic complexity of
Big Data sets, it follows the need for defining and exploiting complex
OLAP data cubes over Big Data, tailored to support advanced data-
intensive large-scale scientific applications
Bibliography
• https://www.m-brain.com/home/technology/big-data-with-8-vs/
• https://www.techopedia.com/definition/14647/data-aggregation
• https://www.webopedia.com/TERM/O/OLAP.html
• https://datascience.stackexchange.com/questions/8384/examples-of-the-three-vs-of-
big-data
• https://searchsqlserver.techtarget.com/definition/MOLAP
• https://searchoracle.techtarget.com/definition/relational-online-analytical-processing
• https://searchsqlserver.techtarget.com/definition/hybrid-online-analytical-processing
• https://www.kyvosinsights.com/understanding-olap-on-big-data-why-do-you-need-it/

Potrebbero piacerti anche