Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Parikshit Gopalan
Windows Azure Storage, Microsoft.
The data deluge …
Problem Statement:
We generate insane amounts of digital data.
We expect it to be stored reliably and accessible anytime, anywhere.
And for free!
Duplication
2X overhead.
Quick recovery.
Triplication
3X overhead.
Quick recovery.
Reed Solomon:
• Need to read 𝑘 disks.
• Any 𝑘 suffice.
Disk Failure
Reed Solomon:
• Need to read 𝑘 disks.
• Any 𝑘 suffice.
Can we do better?
1
This gives 𝑛 = 𝑘 1 + 𝑟 + 𝑑 − 2.
1
This gives 𝑛 = (𝑘 + 𝑑 − 2)(1 + 𝑟 ).
Rate-distance-locality tradeoffs
What are the tradeoffs between 𝑛, 𝑘, 𝑑, 𝑟?
[Tamo-Papailiopoulos-Dimakis’13]
Optimal length codes with all-symbol locality for 𝑞 = exp(𝑘).
Construction based on RS code, analysis via matroid theory.
[Silberstein-Rawat-Koyluoglu-Vishwanath’13]
Optimal length codes with all-symbol locality for 𝑞 = 2𝑛 .
Construction based on Gabidulin codes (aka linearized RS codes).
[Barg-Tamo’ 14]
Optimal length codes with all-symbol locality for 𝑞 = 𝑂(𝑛).
Construction based on Reed-Solomon codes.
Rate-distance-locality tradeoffs
What are the tradeoffs between 𝑛, 𝑘, 𝑑, 𝑟?
Non-linear codes
[Papailiopoulos-Dimakis, Forbes-Yekhanin].
Vector codes
[Papailoupoulos-Dimakis, Silberstein-Rawat-Koyluoglu-Vishwanath, Kamath-
Prakash-Lalitha-Kumar]
Codes over bounded alphabets
[Cadambe-Mazumdar]
Codes with short local MDS codes
[Prakash-Lalitha-Kamath-Kumar, Silberstein-Rawat-Koyluoglu-Vishwanath]
Codes with local Regeneration
[Silberstein-Rawat-Koyluoglu-Vishwanath, Kamath-Prakash-Lalitha-Kumar…]
Towards locally decodable codes