Benvenuto in Scribd!

Database Scalability: Jonathan Ellis

Caricato da

Il 0% ha trovato utile questo documento (0 voti)

19 visualizzazioni49 pagine

This document discusses database scalability and different techniques for scaling databases. It covers three key points: 1) Scaling reads is difficult as disk seeks are slow, requiring techniques like caching. Scaling writes is even harder and requires partitioning or sharding data across multiple servers. 2) There are two main types of replication: asynchronous which is easiest but can lose data if the master fails, and synchronous which is more complex but prevents data loss. Replication does not help scale writes. 3) Partitioning or sharding data across multiple servers based on a partitioning key is necessary to scale writes. This allows adding capacity without downtime but requires an architecture to handle failures and maintain data integrity.

Descrizione originale:

Titolo originale

scalingdatabases-090729134110-phpapp01

Copyright

Formati disponibili

PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

Copyright:

Attribution Non-Commercial (BY-NC)

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

19 visualizzazioni49 pagine

Database Scalability: Jonathan Ellis

Caricato da

bbo0t

Copyright:

Attribution Non-Commercial (BY-NC)

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 49

Cerca all'interno del documento

Database scalability

Jonathan Ellis

Classic RDBMS persistence

Index

Data

Disk is the new tape*

~8ms to seek
~4ms on expensive 15k rpm disks

What scaling means performance

money

Performance
Latency Throughput

Two kinds of operations

Reads Writes

Caching
Memcached Ehcache etc

cache

Cache invalidation
Implicit Explicit

Cache set invalidation

get_cached_cart(cart=13, offset=10, limit=10) get('cart:13:10:10') ?

Set invalidation 2
prefix = get('cart_prefix:13') get(prefix + ':10:10') del('cart_prefix:13')

http://www.aminus.org/blogs/index.php/2007/1

Replication

Types of replication
Master slave
Master slave other slaves

Master master
multi-master

Types of replication 2
Synchronous Asynchronous

Synchronous
Synchronous = slow(er) Complexity (e.g. 2pc) PGCluster Oracle

Asynchronous master/slave
Easiest Failover MySQL replication Slony, Londiste, WAL shipping Tungsten

Asynchronous multi-master
Conflict resolution
O(N3) or O(N2) as you add nodes

http://research.microsoft.com/~gray/replicas.ps

Bucardo MySQL Cluster

Achtung!
Asynchronous replication can lose data if the master fails

Architecture
Primarily about how you cope with failure scenarios

Replication does not scale writes

Scaling writes
Partitioning aka sharding
Key / horizontal Vertical Directed

Partitioning

Key based partitioning

PK of root table controls destination
e.g. user id

Retains referential integrity

Example: blogger.com Users Blogs Comments

Example: blogger.com Users Blogs Comments Comments'

Vertical partitioning
Tables on separate nodes Often a table that is too big to keep with the other tables, gets too big for a single node

Growing is hard

Directed partitioning
Central db that knows what server owns a key Makes adding machines easier Single point of failure

Partitioning

Partitioning with replication

What these have in common

Ad hoc Error-prone Manpower-intensive

To summarize
Scaling reads sucks Scaling writes sucks more

Distributed databases
Data is automatically partitioned Transparent to application Add capacity without downtime Failure tolerant

*Like Bigtable, not Lotus Notes

Two famous papers

Bigtable: A distributed storage system for structured data, 2006 Dynamo: amazon's highly available keyvalue store, 2007

The world doesn't need another half-assed key/value store

(See also Olin Shivers' 100% and 80% solutions)

Two approaches
Bigtable: How can we build a distributed database on top of GFS? Dynamo: How can we build a distributed hash table appropriate for the data center?

Bigtable architecture

Lookup in Bigtable

Dynamo

Eventually consistent

Amazon: http://www.allthingsdistributed.com/2008/12

eBay: http://queue.acm.org/detail.cfm?id=139412

Consistency in a BASE world

If W + R > N, you are 100% consistent W=1, R=N W=N, R=1 W=Q, R=Q where Q = N / 2 + 1

Cassandra

Memtable / SSTable

Disk
Commit log

ColumnFamilies
keyA keyC column1 column1 column2 column7 column3 column11

Column Byte[] Name Byte[] Value I64 timestamp

LSM write properties

No reads No seeks Fast Atomic within ColumnFamily

vs MySQL with 50GB of data

MySQL
~300ms write ~350ms read

Cassandra
~0.12ms write ~15ms read

Achtung!

Classic RDBMS persistence

Index

Data

Questions

Potrebbero piacerti anche

First Derivatives In-Memory Databases: Peter Storeng
Documento34 pagine
First Derivatives In-Memory Databases: Peter Storeng
diegoes
Nessuna valutazione finora
MySQL Cluster - Deployment Best Practices Presentation
Documento31 pagine
MySQL Cluster - Deployment Best Practices Presentation
Omchavarium Mangundiharja
Nessuna valutazione finora
Memcached and Redis
Documento12 pagine
Memcached and Redis
Kumar Fl
Nessuna valutazione finora
SQL Server Administration PDF
Documento13 pagine
SQL Server Administration PDF
alex
Nessuna valutazione finora
Lecture 1
Documento31 pagine
Lecture 1
bilalmujahid500
Nessuna valutazione finora
Mysql Cluster Deployment Best Practices
Documento39 pagine
Mysql Cluster Deployment Best Practices
sushil pun
Nessuna valutazione finora
Hbase in Practice
Documento46 pagine
Hbase in Practice
Diego Fernandes
Nessuna valutazione finora
Scaling To 200K Transactions Per Second With Open Source - MySQL, Java, Curl, PHP
Documento37 pagine
Scaling To 200K Transactions Per Second With Open Source - MySQL, Java, Curl, PHP
Dathan Vance Pattishall
Nessuna valutazione finora
Rwws Mysql 2006
Documento73 pagine
Rwws Mysql 2006
Vivek Tiwari
Nessuna valutazione finora
10 NoSQL Databases - HBase Hive Cassandra
Documento74 pagine
10 NoSQL Databases - HBase Hive Cassandra
Neeraj Garg
Nessuna valutazione finora
No SQL
Documento45 pagine
No SQL
Aleena Nasir
Nessuna valutazione finora
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
Documento31 pagine
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
DR. BODHI CHAKRABORTY
Nessuna valutazione finora
Common Exadata Mistakes: Andy Colvin Practice Director, Enkitec IOUG Collaborate 2014
Documento49 pagine
Common Exadata Mistakes: Andy Colvin Practice Director, Enkitec IOUG Collaborate 2014
Mohammad Abdul Azeez
Nessuna valutazione finora
Enkitec-DIY Exadata KerryOsborne
Documento26 pagine
Enkitec-DIY Exadata KerryOsborne
tssr2001
Nessuna valutazione finora
MySQL Scaling and High Availability Architectures
Documento57 pagine
MySQL Scaling and High Availability Architectures
britt
100% (8)
Optimizing Data Loading
Documento26 pagine
Optimizing Data Loading
budulinek
Nessuna valutazione finora
Tim Hawkins: or "How To Survive The Digg or Slashdot Effect"
Documento34 pagine
Tim Hawkins: or "How To Survive The Digg or Slashdot Effect"
Robert Guiscard
100% (10)
Hadoopintro
Documento31 pagine
Hadoopintro
Dhanraj Kannan
Nessuna valutazione finora
Exadata Primer For The Executive
Documento65 pagine
Exadata Primer For The Executive
sashi99
Nessuna valutazione finora
Impala Presentation - Orlando PDF
Documento60 pagine
Impala Presentation - Orlando PDF
VinkeyChen
Nessuna valutazione finora
No SQL
Documento32 pagine
No SQL
Shubham N/A
Nessuna valutazione finora
Mongodb at Fliptop
Documento28 pagine
Mongodb at Fliptop
Alessandra Lino
Nessuna valutazione finora
An Introduction To MySQL Performance Optimization
Documento20 pagine
An Introduction To MySQL Performance Optimization
Luis Rei
Nessuna valutazione finora
Extreme Performance
Documento45 pagine
Extreme Performance
abcatxy
Nessuna valutazione finora
Introduction To Big Data and NoSQL
Documento52 pagine
Introduction To Big Data and NoSQL
Max Chiu
Nessuna valutazione finora
Memcached Talk
Documento35 pagine
Memcached Talk
quannum
Nessuna valutazione finora
Amazon Dynamo DB - Presentation
Documento30 pagine
Amazon Dynamo DB - Presentation
Chetan Nagarkar
100% (1)
Memcache: Rob Sharp Rob@sharp - Id.au Lead Developer The Sound Alliance
Documento35 pagine
Memcache: Rob Sharp Rob@sharp - Id.au Lead Developer The Sound Alliance
quannum
100% (2)
This Unit: Caches: - Basic Memory Hierarchy Concepts
Documento24 pagine
This Unit: Caches: - Basic Memory Hierarchy Concepts
jameskipkosgei
Nessuna valutazione finora
MySQL Performance Basics - BeCamp 2008
Documento19 pagine
MySQL Performance Basics - BeCamp 2008
kaplumb_aga
Nessuna valutazione finora
25 - Map Reduce Recap
Documento12 pagine
25 - Map Reduce Recap
gpswain
Nessuna valutazione finora
Memory Management in SQL Server Analysis Services: Steve Wright Director of Product Support SQL Sentry, Inc
Documento38 pagine
Memory Management in SQL Server Analysis Services: Steve Wright Director of Product Support SQL Sentry, Inc
worlockh
Nessuna valutazione finora
Storing Data: Disks and Files: (R&G Chapter 9)
Documento39 pagine
Storing Data: Disks and Files: (R&G Chapter 9)
raw.junk
Nessuna valutazione finora
JRandall - Build High Perf SQL Server
Documento43 pagine
JRandall - Build High Perf SQL Server
Karthikeyan Ramachandriya
Nessuna valutazione finora
DBT SB
Documento159 pagine
DBT SB
dev kumar
Nessuna valutazione finora
A Practical Guide To Oracle 10g RAC Its REAL Easy!: Gavin Soorma, Emirates Airline, Dubai Session# 106
Documento113 pagine
A Practical Guide To Oracle 10g RAC Its REAL Easy!: Gavin Soorma, Emirates Airline, Dubai Session# 106
manjubashini_suria
Nessuna valutazione finora
Lecture1.1 Database Concepts
Documento61 pagine
Lecture1.1 Database Concepts
Justin Joshuva
Nessuna valutazione finora
Linuxpiter 2015 Kosmodemiansky Linux
Documento26 pagine
Linuxpiter 2015 Kosmodemiansky Linux
felipealv_161505
Nessuna valutazione finora
Nosql Databases: P.Krishna Reddy Iiit Hyderabad
Documento30 pagine
Nosql Databases: P.Krishna Reddy Iiit Hyderabad
Sarvesh Mehta
Nessuna valutazione finora
Overview - Explain - Measuring Performance - Disk Architectures - Indexes - Join Algorithms (CTD.)
Documento69 pagine
Overview - Explain - Measuring Performance - Disk Architectures - Indexes - Join Algorithms (CTD.)
Purushothama Reddy
Nessuna valutazione finora
11 Memory
Documento41 pagine
11 Memory
Khoa Pham
Nessuna valutazione finora
PostgreSQL When It's Not Your Job
Documento183 pagine
PostgreSQL When It's Not Your Job
7hds9j4s4t
Nessuna valutazione finora
sadAD8 Databases
Documento7 pagine
sadAD8 Databases
robin jaiswal
Nessuna valutazione finora
Progress Database Performance Tuning
Documento107 pagine
Progress Database Performance Tuning
baskibin
Nessuna valutazione finora
Databricks, An Introduction: Chuck Connell, Insight Digital Innovation
Documento36 pagine
Databricks, An Introduction: Chuck Connell, Insight Digital Innovation
Saravanan1234567
Nessuna valutazione finora
Healthy SQL Server
Documento26 pagine
Healthy SQL Server
CarlosDuPivato
Nessuna valutazione finora
SQL Progrmming
Documento237 pagine
SQL Progrmming
Zoran Milinkovic
Nessuna valutazione finora
03-Chap4-Cache Memory Mapping
Documento24 pagine
03-Chap4-Cache Memory Mapping
abdul shakoor
Nessuna valutazione finora
The Nutanix Bible Guide
Documento75 pagine
The Nutanix Bible Guide
anonymous_9888
100% (1)
Best Practices For Performance
Documento0 pagine
Best Practices For Performance
divandann
Nessuna valutazione finora
Transactional Memory: David Chisnall
Documento21 pagine
Transactional Memory: David Chisnall
savio77
Nessuna valutazione finora
AWS Certified Solutions Architect - Professional
Da Everand
AWS Certified Solutions Architect - Professional
VB Dev
Nessuna valutazione finora
Google BigQuery Analytics
Da Everand
Google BigQuery Analytics
Jordan Tigani
Valutazione: 3 su 5 stelle
3/5 (1)
Elements of Android Room
Da Everand
Elements of Android Room
Mark Murphy
Nessuna valutazione finora
Oracle Goldengate 11g Complete Cookbook
Da Everand
Oracle Goldengate 11g Complete Cookbook
Ankur Gupta
Valutazione: 5 su 5 stelle
5/5 (1)
FreeBSD Mastery: Storage Essentials: IT Mastery, #4
Da Everand
FreeBSD Mastery: Storage Essentials: IT Mastery, #4
Michael W. Lucas
Nessuna valutazione finora
Concise Oracle Database For People Who Has No Time
Da Everand
Concise Oracle Database For People Who Has No Time
Billy Aung Myint
Nessuna valutazione finora
Oracle 11g R1/R2 Real Application Clusters Essentials
Da Everand
Oracle 11g R1/R2 Real Application Clusters Essentials
Syed Jaffer Hussain
Valutazione: 5 su 5 stelle
5/5 (1)
Build Your Own Distributed Compilation Cluster: A Practical Walkthrough
Da Everand
Build Your Own Distributed Compilation Cluster: A Practical Walkthrough
Hunter Davis
Nessuna valutazione finora
Oracle Database Transactions and Locking Revealed: Building High Performance Through Concurrency
Da Everand
Oracle Database Transactions and Locking Revealed: Building High Performance Through Concurrency
Darl Kuhn
Nessuna valutazione finora
SQL Data Types
Documento8 pagine
SQL Data Types
Shajir
Nessuna valutazione finora
Usenet
Documento22 pagine
Usenet
maulik4191
Nessuna valutazione finora
CommVault Questions
Documento13 pagine
CommVault Questions
Ganesh Rm
100% (2)
L8CN: Network Layer: Logical Addressing
Documento23 pagine
L8CN: Network Layer: Logical Addressing
Ahmad Shdifat
Nessuna valutazione finora
White Paper SAP BusinessObjects BI Solutions For SAP HANA PDF
Documento14 pagine
White Paper SAP BusinessObjects BI Solutions For SAP HANA PDF
Rachna Bhatnagar
Nessuna valutazione finora
Docker
Documento4 pagine
Docker
abhidas0810
Nessuna valutazione finora
Backtracking: CENG 707 Data Structures and Algorithms
Documento50 pagine
Backtracking: CENG 707 Data Structures and Algorithms
Anders S. Voyan
Nessuna valutazione finora
Computer System Overview: Operating System - CE122 Weeks - 01
Documento31 pagine
Computer System Overview: Operating System - CE122 Weeks - 01
vertado
Nessuna valutazione finora
Session Tracking
Documento38 pagine
Session Tracking
Smita Borkar
Nessuna valutazione finora
VFP Advanced Project: 1, VFP Advanced 32-Bit Version 2, VFP Advanced 64-Bit Version
Documento25 pagine
VFP Advanced Project: 1, VFP Advanced 32-Bit Version 2, VFP Advanced 64-Bit Version
luis eduardo ostos febres
Nessuna valutazione finora
Menu Chart Sample
Documento1 pagina
Menu Chart Sample
ÏbhéèDre Thïïnfàllible Morrison
100% (1)
Loaders
Documento59 pagine
Loaders
Rohan Ray
Nessuna valutazione finora
Windows Installation Guide For Suricata IDS/IPS/NSM
Documento41 pagine
Windows Installation Guide For Suricata IDS/IPS/NSM
EDUARDO
Nessuna valutazione finora
Features - DATA MINING CUP - 2022
Documento2 pagine
Features - DATA MINING CUP - 2022
Akhi Gupta
Nessuna valutazione finora
Cimplicity SecureDeploymentGuide v2 PDF
Documento70 pagine
Cimplicity SecureDeploymentGuide v2 PDF
MOHAMMAD
Nessuna valutazione finora
Real Estate Property Dealing Site .Net Project
Documento48 pagine
Real Estate Property Dealing Site .Net Project
Desai Dharam
50% (2)
Chapter 4
Documento14 pagine
Chapter 4
Jeevan Balaka
Nessuna valutazione finora
02 NetAct Reporter Introduction OSSREP v.1
Documento36 pagine
02 NetAct Reporter Introduction OSSREP v.1
bhupendra08021986
Nessuna valutazione finora
Seizing FSMO Roles in Active Directory
Documento3 pagine
Seizing FSMO Roles in Active Directory
Srinivas Kumar
Nessuna valutazione finora
UNIT 3 Data Link Layer
Documento3 pagine
UNIT 3 Data Link Layer
anjali prasad
Nessuna valutazione finora
Setting Up The DEPP PC and Network Revised
Documento32 pagine
Setting Up The DEPP PC and Network Revised
Davisito91
Nessuna valutazione finora
Ch05 - Physical Database Design and Performance
Documento38 pagine
Ch05 - Physical Database Design and Performance
dunglp22411c
Nessuna valutazione finora
Maths Project Done By: M.Rafeh Ayub Class: 7B6
Documento4 pagine
Maths Project Done By: M.Rafeh Ayub Class: 7B6
api-302298991
Nessuna valutazione finora
CRISP Data Mining SIBM Pune
Documento24 pagine
CRISP Data Mining SIBM Pune
AYUSH AGRAWAL
Nessuna valutazione finora
SapScript REPORT Z f110 in Avis Forms FI
Documento15 pagine
SapScript REPORT Z f110 in Avis Forms FI
s37k2
Nessuna valutazione finora
Download
Documento33 pagine
Download
Rt
Nessuna valutazione finora
9 Classical Reports
Documento16 pagine
9 Classical Reports
KIRAN
Nessuna valutazione finora
Quiz 3 - Nse 2 v3 Ok
Documento2 pagine
Quiz 3 - Nse 2 v3 Ok
John Angelo Recalde Monar
Nessuna valutazione finora
WIMAX
Documento25 pagine
WIMAX
Salem Trabelsi
Nessuna valutazione finora
Red Hat Enterprise Linux 6 DM Multipath en US
Documento42 pagine
Red Hat Enterprise Linux 6 DM Multipath en US
Sachin Kuchhal
100% (1)