Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Milomir Vojvodic,
Business Development Manager, EMEA DIS
Social Data
Machine-Generated Data
Architecture
Oracle DataPrinciples
Integration Solutions
and
BestData
Practices
for Big
Integrated Architecture
Master &
Ref Data
Store/Process
DBMS
(OLTP)
Transaction
Data
Integrate
Organize
ODS
Data
Warehouse
Social
Media
Text, Image
Video, Audio
Key-Value
Data Store
Alerting
EPM
BI Applications
Text Analytics
and Search
CDC
Hadoop
Cluster w
MapReduce
Data Marts
In-Database
Analytics
Real-Time
Streaming
(CEP Engine)
MessageBased
Govern
Reporting &
Dashboards
DB Replication
ETL/ELT
Machine
Generated
Analyze
Advanced
Analytics
Visual
Discovery
Management
Security, Governance
Unstructured
Semistructured
Structured
Capture
Oracle
Exadata
Oracle
Exalytics
Stream
Acquire
Organize
Load from big data processing into your data warehouse for further analysis
Access your customer information while you process through your big data in order to look for patterns
Legacy
Sources
Oracle GoldenGate
Architecture
Principles
DB Replica and
CDC within
and
Best
Practices
Data
Integration
Layer
OGG
Source DB
Target DB
OGG
Source DB
Target DB
Second OGG Differentiator
Moving only committed transactions
OGG
OGG
OGG ADG
Zero Downtime
Migrations & Upgrades
Active/Active
DB Deployment
Disaster Recovery
Reporting Database
New DB/HW/OS/APP
OGG
DW Synchronization
Data Warehouse
Hours
150
100
50
0
Year1 Year2 Year3 Year4 Year5
Required No.
CPUs can be
Disaster Recoverydoubled
Test
No Of Required CPUs
120
100
80
60
40
20
0
and Development
Primary Site
Millions
$3
$2
$2
$1
$1
$-
Costs can be
Oracle License doubled
Costs
Begin, TX 1
Insert, TX 1
Begin, TX 2
Begin, TX 2
Pump
Checkpoint
Begin, TX 2
Update, TX 1
Insert, TX 2
Insert, TX 2
Insert, TX 2
Commit, TX 2
Commit, TX 2
Commit, TX 2
Capture
Checkpoint
Begin, TX 3
Begin, TX 3
Insert, TX 3
Insert, TX 3
Commit, TX 3
Begin, TX 4
Commit, TX 3
Delete, TX 4
Delivery
Checkpoint
Architecture
Principles
ETL and Data
Quality within
and
Best
Practices
Data
Integration
Layer
Custom
Reporting
Data
Migration
Data Silos
Data
Replication
Business
Intelligence
Enterprise
Performance
Data
Warehousing
Data Marts
Data Hubs
Batch Scripts
Data Access
SQL
Java
Custom
Data
Warehouse, Data Mart
Data
Federation
Oracle
PeopleSoft, Siebel, SAP
Custom Apps
Files
Excel
XML
OLAP
Custom
Reporting
Business
Intelligence
Enterprise
Performance
Data
Warehouse, Data Mart
Oracle
PeopleSoft, Siebel, SAP
Custom Apps
Files
Excel
XML
OLAP
First ODI
Differentiator
Transformations
using the power of
the Target Database
no staging server
ODI E-LT
Staging Server
ODI
OGG
Data Warehouse
Journalize
Read from
CDC Source
Load
From
Sources to
Staging
Load
CDC
Sources
Journal
ize
Check
Constraints
before Load
Integrate
Transform
and Move to
Targets
Staging Tables
Check
Service
Expose Data
and
Transformati
WW W
on Services
SS S
Integrate
Services
Target Tables
Error Tables
Benefits
SQL
Oracle
Oracle
JMS
Check MS TPump/
Oracle
Server
Web
DBLink
Queues
Excel
Multiload
Merge
Triggers
Services
Oracle
DB2
DB2
Check
Type II
Siebel EIM DB2 Web
SQL*Load
Journals
Exp/Imp
Sybase
SCD
Schema
Services
er
22
Automatically
Generate
Dataflow
Log Miner
OGG
OGG ADG
OGG
ODI
Zero Downtime
Migrations & Upgrades
Active/Active
High Availability
New DB/HW/OS/APP
Query Off-Loading
and Disaster Recovery
ODI EDQ
BI&DW Synchronization
and Loading
Data Warehouse
Customer ID
Customer Name
AD23298
Mr Peter Mayhew
Address 1
9407 Main St
VS38611
144 E Grove St
DC18223
CO9387A
4912 E 41st N
TZ35019
Mr Zachary P Jahn
CB27843
OX80306
14 Oxbridge Way
JP70210
RD48107
57 Hadleigh Close
14 Oxbridge Wy
19
Attributes non-standard,
missing or invalid
Abbreviations
(often ambiguous)
Inconsistent formats
Address 2
Fairfax
State
VA
Zip
22031-4001
Country
USA
Birth Date
02/23/61
Kingston
PA
18704
US
07/12/57
Kansas City
MO
64111-3349
USA
02/23/63
Idaho Falls
ID
83401
USA
31/03/2007
N/A
Aiea
Hawaii
96701
1710
Male
Webster
NY
USA
11/17/1971
Milfrod
NH
03055-4614
US
05/28/67
MA
NH
3056
USA
USA
01/01/01
Y
M
Apt 205
Westlea
City
Compound Names
Mis-Fielded Data
Erroneous Data
Multiple Names
Gender
M
Widespread
duplication
(often hard
to spot)
Item
Classification
Power
Voltage
Mounting
Motor
26101600
10 horsepower
115
Yoke
20
300
Berry
Validate
300
Berry St
SubPremise
#1210
Unit 1210
Locality
SF
San Francisco
AdministrativeArea
California
CA
94158-1670
PostCode
22
Architecture
Oracle DataPrinciples
Integrator
and
BestData
Practices
for Big
23
Transforms
Via MapReduce
Oracle Data
Integrator
Loads
Oracle Data
Integrator
Activates
Oracle Loader
for Hadoop
Loads
Hadoop Cluster
Oracle Database,
Oracle Exadata