TEA:内存存储中的一种高效冗余过渡策略——编码导向的复制与Erasure Coding

32 浏览量更新于2024-08-27 收藏 1.46MB PDF 举报

本文主要探讨了在内存存储系统中实现性能与内存效率良好平衡的问题，针对不受欢迎（冷）的数据集，提出了将复制和错误纠正编码（Erasure Coding，简称EC）相结合的策略。由于缓存工作负载通常遵循长尾分布，大部分内存数据不常访问，因此如何过渡到使用EC进行冗余管理变得至关重要。论文的主角是TEA（Traffic-efficient Erasure-coded Archival Scheme），这是一种针对ERP（Encoding-oriented Replication Placement）策略设计的高效内存存储架构。ERP政策是一种结合了交错去中心化机制的复制策略，旨在优化数据副本放置，减少跨机架（Rack）的流量。ERP通过将数据分布在不同的节点上，降低了数据获取时不必要的全局通信，从而提高了系统性能。 TEA的核心目标是提升内存存储系统的效率，特别是对于冷数据的处理。它具备以下三个关键特性： 1. **跨rack流量减缓**：TEA通过设计巧妙的编码和分布策略，能够在执行数据恢复时减少对不同机架的请求次数，减少了网络带宽的消耗，降低了整体的通信成本。 2. **编码导向的复制管理**：ERP作为基础，TEA优先考虑编码技术，这意味着在存储冷数据时，即使丢失部分数据块，也能利用EC的容错能力从其他副本重构缺失部分，从而避免了频繁的全量复制操作。 3. **内存效率提升**：通过在内存中高效地实现EC编码和解码，TEA能够在保持良好访问性能的同时，降低存储需求，因为EC能用更少的数据块来存储相同的信息，这对于内存资源有限的环境中尤其重要。 TEA是一种创新的内存存储解决方案，它有效地解决了在追求高性能和内存效率之间找到平衡的问题，特别适用于那些数据访问模式具有长尾效应的场景，如大规模在线服务中的数据存储。通过结合ERP的复制策略和TEA的编码技术，论文提出了一种实用且高效的架构，能够优化冷数据的管理，减少数据传输成本，并为实际应用中的内存存储提供了一种新的优化路径。

TEA: A Traic-eicient Erasure-coded Archival Scheme for

In-memory Stores

Bin Xu

binxu@hust.edu.cn

Huazhong University of Sci.& Tech.

Wuhan, Hubei, China

Jianzhong Huang

∗

Qiang Cao

∗

Huazhong University of Sci.& Tech.

Wuhan, Hubei, China

Xiao Qin

xqin@auburn.edu

Auburn University

Auburn, AL 36849, USA

ABSTRACT

To achieve good trade-o between access performance and mem-

ory eciency, it is appropriate to adopt replication and erasure

coding to keep popular and unpopular in-memory datasets, re-

spectively. An issue of redundancy transition from replication to

erasure coding (a.k.a., erasure-coded archival) should be addressed

for unpopular in-memory datasets, since caching workloads exhibit

long-tail distributions and most in-memory data are unpopular.

In this paper, we propose an encoding-oriented replica placement

policy - ERP - by incorporating an interleaved declustering mecha-

nism, and design a trac-ecient erasure-coded archival schemes

-TEA - for ERP-powered in-memory stores. With ERP in place, TEA

embraces three salient features: (i) it alleviates cross-rack trac

raised by retrieving data-block replicas, (ii) it improves rack-level

load balancing by distributing replicas via load-aware primary-rack-

selection approach, and (iii) it mitigates block-relocation operations

launched to sustain rack-level fault-tolerance. The empirical results

show that TEA not only brings forth lower cross-rack trac than

four candidate encoding schemes, but also exhibits superb archival-

throughput and rack-level-balancing performance. In particular,

TEA accelerates archival throughput by at least 70.8%; and improves

rack-level load-balancing by a factor of more than 1.58x relative to

the four competitors.

CCS CONCEPTS

• Information systems →

Distributed storage;

• Computer sys-

tems organization → Re dundancy.

KEYWORDS

In-memory store, Erasure encoding, Replication, Archival

ACM Reference Format:

Bin Xu, Jianzhong Huang, Qiang Cao, and Xiao Qin. 2019. TEA: A Trac-

ecient Erasure-coded Archival Scheme for In-memory Stores. In 48th

International Conference on Parallel Processing (ICPP 2019), August 5–8, 2019,

Kyoto, Japan. ACM, New York, NY, USA, 10 pages. https://doi.org/10.1145/

3337821.3337826

∗

Jianzhong Huang (hjzh@hust.edu.cn) and Qiang Cao (caoqiang@hust.edu.cn) are the

joint corresponding authors.

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for prot or commercial advantage and that copies bear this notice and the full citation

on the rst page. Copyrights for components of this work owned by others than ACM

must be honored. Abstracting with credit is permitted. To copy otherwise, or republish,

to post on servers or to redistribute to lists, requires prior specic permission and/or a

fee. Request permissions from permissions@acm.org.

ICPP 2019, August 5–8, 2019, Kyoto, Japan

ACM ISBN 978-1-4503-6295-5/19/08... $15.00

https://doi.org/10.1145/3337821.3337826

1 INTRODUCTION

1.1 Motivation

The following three aspects motivate us to delve in the development

of an erasure-coded archival scheme for in-memory stores.

Aspect #1

–low access latency in in-memory stores. We are in

an era of data-driven business world. For example, data-intensive

analytics has become indispensable since enterprises want to gain

insights to products, services and marketing strategies from in-

creasing volumes of data. Commonly, a data-intensive application

is supported by a cluster consisting of hundreds of nodes and peta-

bytes of data. It is a technology trend to constitute an in-memory

store upon the cluster to achieve low-latency performance. A tra-

ditional case is that Facebook leverages Memcached as a building

block to construct a distributed key-value store facilitating the

world’s largest social network [19].

Aspect #2

–demands for redundancy strategies. Since volatile

DRAM only maintains data while it is powered, existing in-memory

stores accomplish memory-level fault-tolerance by applying repli-

cation and/or erasure coding. Replication is a simple yet eective

redundancy scheme. For instance, Repcached [

] and Bigmem-

ory [

] keep two replicas in memory among nodes. Compared

to the replication, erasure coding achieves higher space eciency

dened as a ratio of user data to the combination of user data

and redundancy data [

]. Unsurprisingly, space-ecient erasure

codes are also adopted by in-memory stores, e.g.,

EC-cache

[

Cocytus [31], MemEC [30], and Ring [24].

Aspect #3

–necessity of erasure-coded archival. An analysis of

traces collected from Facebook’s Memcached deployment shows

that caching workloads exhibit long-tail distributions, in which a

small percentage of keys appeared in most of the requests whereas

most keys repeated only a handful of times [

]. Therefore, it is not

economical to employ a single redundancy scheme for the entire

in-memory data (i.e., metadata, key and value in in-memory key-

value stores) within the data lifetime. Nowadays, some key-value

stores (e.g., Memcached in Facebook [

], Ring [

]) adopt a hybrid

redundancy strategy, where replication is applied to popular data,

while erasure codes are employed for unpopular data.

Generally, newly-loaded data is kept in a replication manner to

support high access parallelism. Since most of the new data are

infrequently accessed, it is wise to encode unpopular data replicas

using erasure codes to achieve high space eciency. We refer to

such an encoding process as ‘erasure-coded archival’.

1.2 Challenges and Strategies

We face two challenges during the course of designing an erasure-

coded archival scheme for in-memory stores.

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38581447

粉丝: 8
资源: 911

TEA:内存存储中的一种高效冗余过渡策略——编码导向的复制与Erasure Coding

DR-Update: A Dual-level Relay Scheme in Erasure coded Storage Systems for Balanced Updates

分布式存储系统：HDFS：HDFS高级特性：ErasureCoding.docx

minIO如何在启动集群时，通过参数关闭纠删码功能

hadoop历史版本详细介绍

minioi集群 Unable to initialize backend: /xxx drive is already being used in another erasure deployment.

四台机器搭建minio集群

如何配置MinIo的网关

最新资源