Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
OLAP
Hector Garcia-Molina
Stanford University
Warehousing
● Growing industry: $8 billion in 1998
● Range from desktop to huge:
◆ Walmart: 900-CPU, 2,700 disk, 23TB
Teradata system
● Lots of buzzwords, hype
◆ slice & dice, rollup, MOLAP, pivot, ...
Metadata Warehouse
Integration
Source Source
Client Client
Mediator
● Operators
◆ slice & dice
◆ roll-up, drill down
◆ pivoting
◆ other
sale
orderId
date customer
product
custId custId
prodId
prodId name
name
storeId address
price
qty city
amt
store
storeId
city
store
storeId
city
snowflake schema
constellations region regId name
north cold region
south warm region
dimensions = 2
dimensions = 3
rollup
drill-down
c1 c2 c3
sum 67 12 50
c1 c2 c3
p1 56 4 50
p2 11 8
129
sum
rollup p1 110
p2 19
drill-down
c1 c2 c3
day 2 ...
p1 44 4
p2 c1 c2 c3
day 1
p1 12 50
p2 11 8 sale(c1,*,*)
c1 c2 c3
sum 67 12 50
c1 c2 c3
p1 56 4 50
p2 11 8
129
sum
sale(c2,p2,*) p1 110
p2 19 sale(*,*,*)
* c1 c2 c3 *
p1 56 4 50 110
p2 11 8 19
day 2 c1* c267 c312 * 50 129
p1 44 4 48
p2
c1 c2 c3 *
day 1
p1 *
12 44 4
50 62 48 sale(*,p2,*)
p2 11 8 19
* 23 8 50 81
c1 c2 c3
day 2
p1 44 4
customer
p2 c1 c2 c3
day 1
p1 12 50 region
p2 11 8
country
region A region B
p1 56 54
p2 11 8
(customer c1 in Region A;
customers c2, c3 in Region B)
c1 c2 c3
p1 56 4 50
p2 11 8
● Data transformation
◆ convertdata to uniform format
◆ remove & add fields (e.g., add date to get history)
Metadata Warehouse
Integration
Metadata Warehouse
Integration
tools
relational
DBMS
ty
B
Ci
A
milk
Product
M.D. tools soda
eggs
soap
1 2 3 4
Date
utilities
multi- could also
dimensional sit on
relational
server DBMS
...
inverted data
age
lists records
index
...
1
1
age bit data
index maps records
c1 c2 c3
p1 67 12 50
c1 c2 c3
p1 56 4 50
p2 11 8
129
c1
materialize p1 110
p2 19
c1 c2 c3
p1 67 12 50
city product date
use greedy
day 2
c1 c2 c3
city, product, date algorithm to
day 1
p1
p2 c1
44
c2
4
c3 decide what
to materialize
p1 12 50
p2 11 8
all
city
state
city, product, date
state, date
state, product
months conceptual
dimension table
days