Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Overview
NFS
Introduction-Design Overview
Architecture
System Interactions
Master Operations
Fault tolerance
Conclusion
NFS
Is
build RPCs
Low
performance
Security
Issues
Introduction
Need For GFS:
Large
Data Files
Scalability
Reliability
Automation
Replication
Fault
of data
Tolerance
Design Overview:
Assumptions:
Components Monitoring
Importance of Bandwidth
Interface:
Additional operations
o
Snapshot
Record append
Architecture:
Cluster Computing
Single
Master
Multiple
Chunk Servers
Multiple
clients
Master Load.
Fixed chunk Size.
The master also predicatively provide chunk
locations immediately following those requested by
unique id.
Chunk Size :
64 MB size.
Read and write operations on same chunk.
Reduces network overhead and size of metadata in
the master.
Metadata :
Types of Metadata:
o
Chunk Locations:
o
Operation Log:
o
It is central to GFS.
System Interactions:
Leases
Atomic
Record Appends:
Snapshot:
o
Duplicate metadata
Master Operation
Namespace
Replica
Placement:
Re-replication:
o
Rebalancing:
Garbage
Collection:
Stale
Replica detection:
Fault Tolerance
High
availability:
Fast recovery.
Chunk replication.
Shadow Masters.
Data Integrity:
Conclusion
GFS meets Google storage requirements:
Incremental
Regular
Data
Simple
Fault
growth
architecture
Tolerance