Coding For Modern Distributed Storage Systems: Part 1.: Locally Repairable Codes

Coding for Modern Distributed
Storage Systems: Part 1.
Locally Repairable Codes
Parikshit Gopalan
Windows Azure Storage, Microsoft.
The data deluge …
Problem Statement:
We generate insane amounts of digital data.
We expect it to be stored reliably and accessible anytime, anywhere.
And for free!
 Total data in the cloud is of the order of few hundred Exabytes

 A terabyte hard drive costs $100.
 Even storing raw data costs hundreds of millions.
Hardware is no longer cheap.
Currently, data centers consume up to 3 percent of all global electricity

production while producing 200 million metric tons of carbon dioxide.
Data Storage: the basics
Goal: Tolerate one disk failure.
 Duplication
2X overhead.
Quick recovery.
 Simple XOR [RAID5]

Treat each disk as a bit vector.
1.2X overhead.
Slower recovery.
Data Storage: the basics
Goal: Tolerate two disk failures.
 Triplication
3X overhead.
Quick recovery.
 [6,4] Reed Solomon Code [RAID6]

1.5X overhead.
Slower recovery.
Need a larger field: each disk is a byte-vector.
Data storage: the basics
Reed Solomon codes
 𝑘 data symbols, 𝑛 − 𝑘 parity checks.
 Field size O(𝑛).
 Any k symbols suffice for full data recovery (MDS).
How many parity checks do you need?

o(k) redundancy seems to be sufficient.
[G.-Huang-Jenkins-Yekhanin’13]
 Failure rate 𝑝 is tiny (assume 𝑘 ⋅ 𝑝 < 0.5).
 Goal is (only) to be as reliable as 3-way replication.
Should be getting overheads close to 0!

Data storage: the basics
Reed Solomon codes
 𝑘 data symbols, 𝑛 − 𝑘 parity checks.
 Field size O(𝑛).
 Any k symbols suffice for full data recovery (MDS).
How many parity checks do you need?

o(k) redundancy seems to be sufficient.
Should be getting overheads close to 0!
Recovery cost would be prohibitively high:

 Need to read 𝑘 other disks (MDS).
 Limits us to small values of 𝑘 (< 25).
Degraded Reads
Typical failure scenario: a single disk fails or is prohibitively slow.

Degraded Reads
Typical failure scenario: a single disk fails or is prohibitively slow.
Reed Solomon:
• Need to read 𝑘 disks.
• Any 𝑘 suffice.
Disk Failure
Typical failure scenario: a single disk fails and needs to be replaced.
Reed Solomon:
• Need to read 𝑘 disks.
• Any 𝑘 suffice.
Can we do better?
Regenerating Codes [Dimakis-Godfrey-Wu-Wainwright-Ramachandran’10]

Metric: Network bandwidth.
Optimize the amount of data communicated to repair a single lost node.
Locally Repairable Codes [G.-Huang-Simitci-Yekhanin’12]

Metric: Number of disk reads.
Optimize the number of disk reads needed to repair a single lost node.
Part 1 of this Tutorial: LRCs
Part 1.1: Locality

1. Locality of codeword symbols.
2. Rate-distance-locality tradeoffs.
Part 1.2: Reliability

1. Beyond minimum distance: Maximum recoverability.
2. Constructions of Maximally Recoverable LRCs.
Locality
[Chen-Huang-Li’07, Oggier-Datta’11, G.-Simitci-Huang-Yekhanin’12,
Papailiopoulos-Luo-Dimakis-Huang-Li’12]
[G.-Simitci-Huang-Yekhanin’12] A coordinate in a linear code has locality 𝒓 if

it can be expressed as a linear combination of 𝑟 other coordinates.
If 𝑐𝑖 is lost, it can be recovered by reading just 𝑟 other symbols.

 Data locality r: all data symbols have locality r.
 All-symbol locality r: all symbols have locality r.
Decouples typical decoding complexity 𝑟 from length 𝑛.

 𝑟 reads for single failures, degraded reads.
 No guarantees for more worst-case failures.
Locally Decodable/Testable Codes
Locally Decodable Codes [Goldreich-Levin’89, Katz-Trevisan’00, Yekhanin’12]
 Implicit in early work on Majority Logic Decoding [Reed’52].
 Aims for locality up to the minimum distance.
 Super-linear length lower bounds known for 𝑟 = 𝑂 1 and constant relative
distance [Katz-Trevsian’00].
Such codes would have given fault-tolerant storage with unlimited
scalability.
 Best constructions are super-polynomial [Yekhanin’07, Efremeko’09].
 Can get high rate with larger locality [Kopparty-Saraf-Yekhanin’11, …
Meir’14].
Locally Testable Codes [Blum-Luby-Runbinfeld’90, Rubinfeld-Sudan’92].

Codes with data locality
Def: A 𝑛, 𝑘, 𝑑 𝑞 code has data locality 𝒓 if each information symbol
can be expressed as a linear combination of 𝑟 other coordinates.
Pyramid Codes [Chen-Huang-Li’07]:

Take an [𝑘 + 𝑑 − 1, 𝑘, 𝑑] Reed-Solomon code, split the first parity.
1
This gives 𝑛 = 𝑘 1 + 𝑟 + 𝑑 − 2.
Is the linear overhead necessary?

Codes with all-symbol locality
Def: An 𝑛, 𝑘, 𝑑 𝑞 linear code has locality 𝒓 if each co-ordinate can be
expressed as a linear combination of 𝑟 other coordinates.
Add a local parity to every group of parity symbols of size r.
1
This gives 𝑛 = (𝑘 + 𝑑 − 2)(1 + 𝑟 ).
Rate-distance-locality tradeoffs
What are the tradeoffs between 𝑛, 𝑘, 𝑑, 𝑟?
[G.-Huang-Simitci-Yekhanin’12]: In any linear code with information locality r,

𝑘
𝑛 −𝑘 ≥ + 𝑑 − 2.
𝑟
[G.-Huang-Simitci-Yekhanin’12]: In any linear code with information locality 𝑟,

𝑟+1
𝑛 ≥ 𝑘 + 𝑑 − 2.
𝑟
[G.-Huang-Simitci-Yekhanin’12]: In any linear code with information locality r,

𝑟+1
𝑛 ≥ 𝑘 + 𝑑 − 2.
𝑟
Moreover, any code achieving this bound for 𝑟|𝑘 and small 𝑑 looks like
[G.-Huang-Simitci-Yekhanin’12]: In any linear code with all-symbol locality r,

𝑟+1
𝑛 ≥ 𝑘 + 𝑑 − 2.
𝑟
Let 𝑟|𝑘. For equality, (𝑟 + 1)|𝑛. Hence 𝑑 ≥ 𝑟 + 3.
Codes that achieve equality could look like
Such codes do exist for 𝑞 ≥ 𝑘𝑛𝑘 (non-explicit construction).

Explicit codes with all-symbol locality.
[Tamo-Papailiopoulos-Dimakis’13]
 Optimal length codes with all-symbol locality for 𝑞 = exp(𝑘).
 Construction based on RS code, analysis via matroid theory.
[Silberstein-Rawat-Koyluoglu-Vishwanath’13]
 Optimal length codes with all-symbol locality for 𝑞 = 2𝑛 .
 Construction based on Gabidulin codes (aka linearized RS codes).
[Barg-Tamo’ 14]
 Optimal length codes with all-symbol locality for 𝑞 = 𝑂(𝑛).
 Construction based on Reed-Solomon codes.
[G.-Huang-Simitci-Yekhanin’12]: In any linear code with all-symbol locality r,

𝑟+1
𝑛 ≥ 𝑘 + 𝑑 − 2.
𝑟
For equality, (𝑟 + 1)|𝑛. Hence 𝑑 ≥ 𝑟 + 3.
 Algorithmic proof using linear algebra.

 [Papailiopoulus-Dimakis’12] Replace linear algebra with information theory.
 [Prakash-Lalitha-Kamath-Kumar’12] Generalized Hamming weights.
 [Barg-Tamo’13] Graph theoretic proof.
Generalizations
 Non-linear codes
[Papailiopoulos-Dimakis, Forbes-Yekhanin].
 Vector codes
[Papailoupoulos-Dimakis, Silberstein-Rawat-Koyluoglu-Vishwanath, Kamath-
Prakash-Lalitha-Kumar]
 Codes over bounded alphabets
[Cadambe-Mazumdar]
 Codes with short local MDS codes
[Prakash-Lalitha-Kamath-Kumar, Silberstein-Rawat-Koyluoglu-Vishwanath]
 Codes with local Regeneration
[Silberstein-Rawat-Koyluoglu-Vishwanath, Kamath-Prakash-Lalitha-Kumar…]
Towards locally decodable codes
 Codes with short local MDS codes [Prakash-Lalitha-Kamath-Kumar,

Silberstein-Rawat-Koyluoglu-Vishwanath]
Avoids the slowest node bottleneck [Shah-Lee-Ramachandran]
 Sequential local recovery [Prakash-Lalitha-Kumar]
 Multiple disjoint local parities [Wang-Zhang, Barg-Tamo]
Can serve multiple read requests in parallel.
Problem: Consider an 𝑛, 𝑘 𝑞 linear code where even after 𝑑 arbitrary failures,

every (information) symbol has locality 𝑟. How large does 𝑛 need to be?
[Barg-Tamo’14] bound might be a good starting point.

Coding For Modern Distributed Storage Systems: Part 1.: Locally Repairable Codes

Caricato da

Informazioni sul documento

Titolo originale

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

Coding For Modern Distributed Storage Systems: Part 1.: Locally Repairable Codes

Caricato da

Copyright:

Formati disponibili

Coding for Modern Distributed

Storage Systems: Part 1.

Locally Repairable Codes

 Total data in the cloud is of the order of few hundred Exabytes

Currently, data centers consume up to 3 percent of all global electricity

Goal: Tolerate one disk failure.

 Simple XOR [RAID5]

Goal: Tolerate two disk failures.

 [6,4] Reed Solomon Code [RAID6]

How many parity checks do you need?

Should be getting overheads close to 0!

How many parity checks do you need?

Recovery cost would be prohibitively high:

Typical failure scenario: a single disk fails or is prohibitively slow.

Typical failure scenario: a single disk fails or is prohibitively slow.

Typical failure scenario: a single disk fails and needs to be replaced.

Regenerating Codes [Dimakis-Godfrey-Wu-Wainwright-Ramachandran’10]

Locally Repairable Codes [G.-Huang-Simitci-Yekhanin’12]

Part 1.1: Locality

Part 1.2: Reliability

[G.-Simitci-Huang-Yekhanin’12] A coordinate in a linear code has locality 𝒓 if

If 𝑐𝑖 is lost, it can be recovered by reading just 𝑟 other symbols.

Decouples typical decoding complexity 𝑟 from length 𝑛.

Locally Testable Codes [Blum-Luby-Runbinfeld’90, Rubinfeld-Sudan’92].

Pyramid Codes [Chen-Huang-Li’07]:

Is the linear overhead necessary?

Add a local parity to every group of parity symbols of size r.

[G.-Huang-Simitci-Yekhanin’12]: In any linear code with information locality r,

[G.-Huang-Simitci-Yekhanin’12]: In any linear code with information locality 𝑟,

[G.-Huang-Simitci-Yekhanin’12]: In any linear code with information locality r,

[G.-Huang-Simitci-Yekhanin’12]: In any linear code with all-symbol locality r,

Such codes do exist for 𝑞 ≥ 𝑘𝑛𝑘 (non-explicit construction).

[G.-Huang-Simitci-Yekhanin’12]: In any linear code with all-symbol locality r,

 Algorithmic proof using linear algebra.

 Codes with short local MDS codes [Prakash-Lalitha-Kamath-Kumar,

Problem: Consider an 𝑛, 𝑘 𝑞 linear code where even after 𝑑 arbitrary failures,

Potrebbero piacerti anche