Data Placement On Tertiary Storage: Sunil Prabhakar Jiangtao Li Purdue University

Data Placement on Tertiary
Storage
Sunil Prabhakar
Joint work with Jiangtao Li
Purdue University
Introduction
■ With current hardware, performance for data-
intensive applications is constrained by I/O
■ Random I/O performance is largely
determined by the latency.
■ For tertiary storage the latency is even more
critical than for secondary storage.
■ Given the current trends this problem will be
aggravated in the foreseeable future.
October 5, 2010 Sunil Prabhakar 2
Reducing Latency
■ Caching
■ Prefetching
■ Scheduling
■ Parallel I/O
■ Compression

■ Placement (replication & prefetching)
■…

Tertiary Storage Placement
■ Goal:
◆ Reduce switching of media
◆ Reduce seek latency
■ Two sub-problems:
◆ Medium allocation
◆ Intra-medium placement

Related Work
■ Specific domains (arrays, RDBMS, Images)
■ Most placement work has focused on intra-medium
placement.
■ Recent work for tertiary storage has addressed the
allocation problem, but under the assumption of
independent access probabilities.
■ This is not always a valid assumption (e.g. web
pages, online manuals, multimedia databases)

Problem addressed
■ Design of placement schemes with non-independent
access patterns.
■ Initially assume that the access pattern is known
■ Focus on the allocation problem -- existing techniques
for intra-medium placement
■ Additional issues addressed:
◆ Replication
◆ Impact of secondary storage
◆ prefetching

Access Patterns
■ Use the notion of a browsing graph
◆ Nodes represent objects
◆ Node labels give the probability that an object is
independently accessed
◆ Directed edges between nodes have labels giving
the probability that the edge will be traversed.
E.g. edge witha probability
® b pab represents
the fact that object b will be accessed following
an access for object a with probability pab .

Browsing Graph
■ Birth Probability
■ Edge Probability
■ Death Probability
0.2
0.1 0.1 0.3
0.4
0.4 0.1
0.3 0.1
Data Placement Schemes
1. Birth Probability Scheme
◆ Place objects in decreasing order of birth
probability (independent placement)
◆ This is known to be optimal if we ignore
relationships between objects [2].
1. Static Probability Scheme
◆ Same as above, except that we use the static
probability for determining placement (cf Google).

3. Edge Merge Scheme
◆ Place strongly connected neighbors on the same
medium
◆ Edges are merged in decreasing order of probability.
◆ Birth probability and edge probabilities of merged
object are calculated.
◆ Merge edges as long as the total size of the merged
objects is smaller than medium capacity.
◆ Merged objects are now allocated to media in
decreasing order of the cumulative static probability.

5. Hot Edge Merge Scheme
◆ Identical to Edge Merge, except that only edges that
have more than a threshold probability are merged.
◆ Objective is to produce media with very high
probability of being loaded permanently.

6. Birth Hop Scheme
◆ Initially, place highest birth probability object on an
empty medium.
◆ Repeatedly add the object with the highest birth
probability or edge probability from objects already on
that medium.
◆ Once the medium is full, repeat the above steps for the
remaining objects.
7. Static Hop Scheme
◆ Identical to above scheme, except that we use static
probability instead of birth probability.
Other Issues
■ Adaptive Placement
◆ Use observed pattern of access to periodically reorganize data
placement.
■ Impact of Secondary Storage
◆ Handle as above -- “observe” pattern at tertiary level.
■ Replication
◆ No cost replication when objects belong to multiple clusters
-- use available free space.
■ Prefetching
◆ Once a medium is loaded on a drive, some high probability
objects are prefetched.

Experimental Results
■ Simulation (CSIM) of Ampex DST drives.
■ 10,000, objects (100MB each)
■ 2000 tapes of 2GB each -- 4TB total
■ 4, 5GB disks -- 20 GB total
■ Birth probability follows a Zipf distribution
■ Objects divided into clusters (5 and 20 per cluster)
■ 5% of objects are outliers
■ Death probability uniformly chosen (0.05 -- 0.2)
■ Edge probabilities are uniformly distributed.
■ Average response time for 1000 requests.

Performance
40
35
30 B irth
25 Static
E d ge M erge
20
H ot E dge M erge
15 B irth H op
10 Static H op
5
A verage R espon se T im e
0
1 2 3 4
N um ber of D rives
Sensitivity to Access Pattern
■ Study the impact of variations in the access
pattern.
■ Consider variations in:
◆ Node probabilities
◆ Edge probabilities
◆ Cluster compositions
■ Test with original placement and also with
modified placement (based upon observed
pattern).
Variations in Edge Probabilities
40
35
Org. Edge Merge 30

Org. Static Hop 25
Org. Static 20
Mod. Static Hop 15
Mod. Static
10
5
Average Response Time
0
0 10 20 30 40 50
Percentage Change
Variations in Node Probabilities
40
35
30 O rg. Edge M erge
25 O rg. Static H op
20 O rg. Static
15 M od. Static H op
10 M od. Static
5
Average Response Tim e
0
0 10 20 30 40 50
Percentage Change
Variations in Node Clusters
40
35
O rg. Edge M erge
30
O rg. Static H op
25 O rg. Static
20 M od. Edge M erge
15 M od. Static H op
M od. Static
10
Average
5 Response Tim e
0
0 5 10 15 20
Percentage Change
Access Pattern Variations
■ The node and edge probabilities are less
critical than the cluster composition!
■ Therefore, it is important to be able to
recognize the related objects.
■ Changes in node and edge probabilities
should not trigger re-organization -- Edge
Merge is especially insensitive to these.

Impact of Secondary Storage
■ The presence of a secondary storage buffer
can have a significant impact on placement.
■ High probability objects are likely to be
cached on disk.
■ We handle this situation by simply placing
objects based upon the “effective” access
pattern at the tertiary level.
■ Experiment with various cache sizes.

Secondary Storage
35
30
25
O rg. Edge M erge
20 O rg. Static
15 M od. Edge M erge
M od. Static
10
5
0
0.4 4 8 12 16 20
Percentage Change
Replication
■ Objects that belong to more than one cluster cause
problems.
■ We propose to make replicas of objects (one for
each cluster that the object belongs to).
■ Since large clusters of related objects are placed
placed together, it is quite likely that extra space is
left over.
■ Storage overhead is also likely to be small.

Replication
8
7.9
7.8
7.7
7.6
7.5 No Replication
7.4 W ith Replication
7.3
7.2
7.1
Average
7 Response Tim e
6.9
1 2 3 4
Num ber of Drives
Prefetching
■ Since related objects are clustered on the
same medium -- prefetching is very
promising.
■ Trade-off.
■ Experiment 1:
◆ Always prefetch data not on disk
◆ Vary max prefetch size per medium

Prefetching
30
25
Birth
20 Static
Edge M erge
15 H ot Edge M erge
Birth H op
10
Static H op
5
0
0 100 200 300
Num ber of Drives
Prefetching (contd.)
■ Edge merge is best suited for prefetching.
■ Experiment 2:
◆ Prefetch only if suitable object exists
◆ Object has strong edge from current object, or
◆ Object has high static
◆ Set bounds for each:
✦ ME - minimum edge probabilty
✦ MS - minimum static probability

Prefetching
11.3
11.1
M E=0.5;M S=0.005
10.9
10.7 M E=0.3;M S=0.0003
10.5
M E=0.05;M S=0.005
10.3
10.1 M E=0.05;M S=0.000
9.9 3
Average
9.7 Response Tim e
9.5
0 100 200 300
Num ber of Drives
Conclusion
■ If objects are accessed in a related fashion -- this
information is valuable for placement.
■ Proposed schemes (esp. Edge Merge) significantly
outperform “optimal” schemes based upon
independent access assumptions (77% better)
■ Exact knowledge of access pattern is not critical --
only the relationship information is important.
■ Adapting to changes in access patterns and
handling unknown access patterns is easily
achieved.
Conclusion (contd).
■ Incorporating disk cache effects is handled
in the same manner as adapting to changes
in access patterns.
■ Selective replication and prefetching are
effective for the proposed schemes,
resulting in significantly improve
performance.

Data Placement On Tertiary Storage: Sunil Prabhakar Jiangtao Li Purdue University

Caricato da

Informazioni sul documento

Descrizione originale:

Titolo originale

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

Data Placement On Tertiary Storage: Sunil Prabhakar Jiangtao Li Purdue University

Caricato da

Copyright:

Formati disponibili

Data Placement on Tertiary

October 5, 2010 Sunil Prabhakar 3

October 5, 2010 Sunil Prabhakar 4

October 5, 2010 Sunil Prabhakar 5

October 5, 2010 Sunil Prabhakar 6

October 5, 2010 Sunil Prabhakar 7

October 5, 2010 Sunil Prabhakar 9

October 5, 2010 Sunil Prabhakar 10

October 5, 2010 Sunil Prabhakar 11

October 5, 2010 Sunil Prabhakar 13

October 5, 2010 Sunil Prabhakar 14

Org. Edge Merge 30

October 5, 2010 Sunil Prabhakar 20

October 5, 2010 Sunil Prabhakar 21

October 5, 2010 Sunil Prabhakar 23

October 5, 2010 Sunil Prabhakar 25

October 5, 2010 Sunil Prabhakar 27

October 5, 2010 Sunil Prabhakar 30

Potrebbero piacerti anche