Skip to main navigation Skip to search Skip to main content

Elastic data routing in cluster-based deduplication systems

  • Temple University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

As a space-efficient approach to data archive and backup, data deduplication is becoming increasingly popular in storage systems. However, as the data growing rapidly in data centers, single-node storage node is no longer be able to provide the corresponding throughput and capacities as expected. Building deduplication clusters is considered as a promising strategy to leverage such bottle-neck on single-node system. However, deduplication relies on how much the system knows about information of previous stored data. The single-node system obviously obtains all such information and is able to detect duplicate data there; however storage nodes in cluster-based system cannot know information on other nodes. It is nontrivial to route data intelligently enough so that the system could support deduplication performance comparable to that of a single-node system, while also at a trivial cost. In this paper, we propose an elastic data routing strategy, aiming to achieve deduplication performance comparable to state-of-the-art, while require much less computation resources.

Original languageEnglish
Title of host publication2014 IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2014
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages117-118
Number of pages2
ISBN (Print)9781479930883
DOIs
StatePublished - 2014
Event2014 IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2014 - Toronto, ON, Canada
Duration: Apr 27 2014May 2 2014

Publication series

NameProceedings - IEEE INFOCOM
ISSN (Print)0743-166X

Conference

Conference2014 IEEE Conference on Computer Communications Workshops, INFOCOM WKSHPS 2014
Country/TerritoryCanada
CityToronto, ON
Period04/27/1405/2/14

Fingerprint

Dive into the research topics of 'Elastic data routing in cluster-based deduplication systems'. Together they form a unique fingerprint.

Cite this