Ceph对象存储：深入分析存储后端的写入行为

需积分: 10 176 浏览量更新于2024-09-10 1 收藏 1.9MB PDF 举报

"深入理解Ceph对象存储中的写入行为" 在云 computing 环境中广泛应用的 Ceph 是一种可扩展、可靠且高性能的存储解决方案。Ceph 内部提供了三种不同的存储后端：FileStore、KStore 和 BlueStore。然而，关于这些存储后端之间的差异及其对性能影响的研究相对较少。这篇论文旨在通过微基准测试和长期工作负载的深入分析，比较 Ceph 存储后端，并重点研究它们的写入行为，特别是关注写放大因子（Write Amplification Factor，简称 WAF）。为了准确分析 WAF，研究人员将每种存储后端的写入流量仔细分类为几个类别。他们发现，由于这三种存储后端的内部机制不同，写入操作可能会被显著放大。例如，WAF 可能超过 13 倍，这意味着原始写入数据量可能会被放大至原先的 13 倍以上，这在高负载或大规模数据处理的场景下可能导致显著的性能下降和存储资源的浪费。 FileStore 是 Ceph 的早期存储后端，基于文件系统，它可能在处理元数据和更新文件时产生较高的写放大。相比之下，KStore 使用键值对存储，可能在某些工作负载下提供更好的性能，但其写入行为可能受到数据分布和访问模式的影响。BlueStore 是 Ceph 最新的存储后端，直接在块设备上操作，设计目标是提高效率和降低写入放大，尤其是在 SSD 和 NVMe 设备上。论文详细探讨了每个存储后端在处理不同类型的写入操作时的行为，包括顺序写入、随机写入以及混合工作负载。通过对这些工作负载进行模拟，研究人员揭示了各种因素如何影响 WAF，例如数据碎片、缓存策略和后台数据压缩。此外，他们还研究了如何通过优化配置参数和策略来减少写放大，以提升整个系统的性能和效率。这篇论文为理解和优化 Ceph 对象存储的性能提供了深入见解，对于数据中心管理员和云服务提供商来说，这些信息至关重要，因为他们需要确保存储系统的高效运行，同时降低运营成本。通过了解这些存储后端的写入行为，可以更有效地选择和配置 Ceph 系统，以满足特定的工作负载需求。

Understanding Write Behaviors of Storage

Backends in Ceph Object Store

Dong-Yun Lee

, Kisik Jeong

, Sang-Hoon Han

, Jin-Soo Kim

, Joo-Young Hwang

, and Sangyeun Cho

Computer Systems Laboratory

Sungkyunkwan University, South Korea

Memory Business

Samsung Electronics Co., Ltd., South Korea

{dongyun.lee, kisik, shhan}@csl.skku.edu, jinsookim@skku.edu, {jooyoung.hwang, sangyeun.cho}@samsung.com

Abstract—Ceph is a scalable, reliable and high-performance

storage solution that is widely used in the cloud computing

environment. Internally, Ceph provides three different storage

backends: FileStore, KStore and BlueStore. However, little effort

has been devoted to identifying the differences in those storage

backends and their implications on performance. In this paper,

we carry out extensive analysis with a microbenchmark and

a long-term workload to compare Ceph storage backends and

understand their write behaviors by focusing on WAF (Write

Ampliﬁcation Factor). To accurately analyze WAF, we carefully

classify write trafﬁc into several categories for each storage

backend.

We ﬁnd that writes are ampliﬁed by more than 13x, no matter

which Ceph storage backend is used. In FileStore, the overhead

of Ceph write-ahead journaling triples write trafﬁc compared to

the original data size. Also, FileStore has the journaling of journal

problem, generating a relatively large amount of ﬁle system

metadata and journal trafﬁc. KStore suffers severe ﬂuctuations

in IOPS (I/O Operations Per Second) and WAF due to large

compaction overheads. BlueStore shows the stable performance

on both HDDs and SSDs in terms of IOPS, WAF and latency.

Overall, FileStore performs the best among all storage backends

on SSDs, while BlueStore is also highly promising with good

average and tail latency even on HDDs.

I. INTRODUCTION

In the cloud computing era, a stable, consistent and high-

performance block storage service is essential to run a large

number of virtual machines. Ceph is a storage solution that

meets all these demanding requirements and has attracted a

spotlight in the last decade. Ceph is a scalable, highly reli-

able software-deﬁned storage solution that provides multiple

interfaces for object, block and ﬁle level storage [1]. Ceph

aims at completely distributed storage without a single point

of failure and high fault tolerance with no speciﬁc hardware

support. Since Ceph provides strong consistency to clients,

users can access objects, block devices and ﬁles without

worrying about consistency. Moreover, because it has a scale-

out structure, Ceph can improve its performance gradually by

adding additional cluster nodes [2].

Internally, all storage services in Ceph are built upon the

Ceph RADOS (Reliable Autonomic Distributed Object Store)

layer [3], which manages ﬁxed-size objects in a scalable,

distributed and reliable manner. Ceph provides three different

storage backends in the RADOS layer: FileStore, KStore and

BlueStore. FileStore and KStore manage objects on top of

traditional ﬁle systems and key-value stores (e.g., LevelDB

and RocksDB), respectively. On the other hand, BlueStore is

a new object store architecture that has been developed actively

for the Ceph RADOS layer in recent years. BlueStore saves

object data into the raw block device directly, while it manages

their metadata on a small key-value store such as RocksDB.

Currently, Ceph can be conﬁgured to use one of these storage

backends freely.

Due to Ceph’s popularity in the cloud computing environ-

ment, several research efforts have been made to ﬁnd optimal

Ceph conﬁgurations under a given Ceph cluster setting [4], [5]

or to tune its performance for fast storage like SSD (Solid-

State Drive) [6]. However, little attention has been paid to

the differences in the storage backends available in Ceph and

their implications on the overall performance. In this paper,

we compare the write behaviors and performance of Ceph

backends with a focus on WAF (Write Ampliﬁcation Factor).

The study on the WAF of various storage backends can be

very enlightening to understand the storage access behaviors of

Ceph for the following reasons. First, WAF has a major impact

not only on the overall performance, but also on device lifetime

when Ceph runs on SSDs. Second, the larger WAF, the more

limited effective bandwidth given to the underlying storage

device. In particular, HDD (Hard Disk Drive) exhibits very

low IOPS (I/O Operations Per Second) compared to SSD and

it is very important to use raw hardware bandwidth effectively.

Finally, as in the previous research with SQLite, there might

be issues such as journaling of journal [7] problem when

implementing distributed storage services on top of a local

ﬁle system.

We have used a microbenchmark and a long-term workload

of 4KB random writes to measure write trafﬁc of various

Ceph storage backends on both HDDs and SSDs. Our results

with the long-term workload indicate that Ceph ampliﬁes

the amount of write trafﬁc by more than 13x under the

replication factor of 3, regardless of the storage backend used.

In FileStore, we ﬁnd that write-ahead journaling with separate

Ceph journal does not double, but rather triples write trafﬁc

下载后可阅读完整内容，剩余9页未读，立即下载

老猫望月

粉丝: 1
资源: 8

Ceph对象存储：深入分析存储后端的写入行为

Understanding retweeting behaviors in twitters

Understanding Student Behaviors in Online Classroom_Data Scientific Approach

The Photoelectron Behaviors of Sulfur Sensitizing AgCl in the Optical Storage Process

Investigation on the present situation of environmental management and accounting behaviors of an enterprise in Jiang Xi province

Behaviors of harmonic signals in wavelength-modulated spectroscopy under high absorption strength

Study on the behaviors of Cu filling in special through-silicon-vias by the simulation of electric field distribution

Behaviors of fatigue crack propagation in friction stir welded AA7075-T6

OPTICALLY TRACKING THE MOTION OF MICROBEADS TO STUDY PHYSICAL BEHAVIORS OF THE LIVING CELL IN RESPONSE TO TRANSIENT STRETCH OR COMPRESSION

Abnormal behaviors of Goos–Hanchen shift in hyperbolic metamaterials made of aluminum zinc oxide materials

Qualitatively different decaying behaviors of the quantum Loschmidt echo in the two phases of the LMG model

最新资源