Availability-Based Methods for Distributed Storage Systems

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Oct 2012 France Publisher:IEEEJournal:2012 IEEE 31st Symposium on Reliable Distributed SystemsFunded by:EC | GOSSPLE

Authors: Anne-Marie Kermarrec; Erwan Le Merrer; Gilles Straub; Alexandre van Kempen;

doi: 10.1109/srds.2012.10

Availability-Based Methods for Distributed Storage Systems

- Summary
- Subjects
- Metrics

Abstract

Distributed storage systems rely heavily on redundancy to ensure data availability as well as durability. In networked systems subject to intermittent node unavailability, the level of redundancy introduced in the system should be minimized and maintained upon failures. Repairs are well- known to be extremely bandwidth-consuming and it has been shown that, without care, they may significantly congest the system. In this paper, we propose an approach to redundancy management accounting for nodes heterogeneity with respect to availability. We show that by using the availability history of nodes, the performance of two important faces of distributed storage (replica placement and repair) can be significantly improved. Replica placement is achieved based on complementary nodes with respect to nodes availability, improving the overall data availability. Repairs can be scheduled thanks to an adaptive per-node timeout according to node availability, so as to decrease the number of repairs while reaching comparable availability. We propose practical heuristics for those two issues. We evaluate our approach through extensive simulations based on real and well-known availability traces. Results clearly show the benefits of our approach with regards to the critical trade-off between data availability, load-balancing and bandwidth consumption.

Country

France

Related Organizations

French Institute for Research in Computer Science and Automation
France
Institut de Recherche en Informatique et Systèmes Aléatoires
France
University of Rennes 1
France
University of Southern Brittany
France
Université de Rennes 1
France

View all View all

Keywords

Distributed storage systems, [INFO.INFO-DC] Computer Science [cs]/Distributed, Parallel, and Cluster Computing [cs.DC], timeout, Availability, [INFO.INFO-DS] Computer Science [cs]/Data Structures and Algorithms [cs.DS]

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	16
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%