Failure-Atomic msync(): 数据持久化与系统崩溃下的完整性保护

150 浏览量更新于2024-08-25 收藏 213KB PDF 举报

"这篇论文《Failure-Atomic msync(): A Simple and Efficient Mechanism for Preserving the Integrity of Durable Data》探讨了在面临电源故障和系统崩溃等潜在问题时，如何在更新过程中保护应用数据的完整性。传统的解决方案如关系型数据库和事务性键值存储虽然有效，但限制了编程的灵活性，因为它们强制执行严格的数据访问接口。作者提出并实现了一种新的方法，通过增强标准操作系统原语msync()的功能，同时保持概念上的简单性，支持高度灵活的编程，并确保即使在故障发生时也能原子性地提交内存映射文件的更改。在Linux环境中的实现表明，failure-atomic msync()能够在数百次全机电源中断后保持应用程序数据完整性，并且在微基准测试中表现出良好的性能。" 本文的研究重点在于解决更新过程中数据完整性的问题，尤其是在可能出现电源故障或系统崩溃的场景下。传统的数据持久化方案，如关系数据库和事务性键值存储，虽然提供了数据一致性保证，但这些方案通常要求严格的数据访问模式，这在一定程度上限制了软件开发的灵活性。论文提出了一种名为"Failure-Atomic msync()"的新机制，它扩展了操作系统中原有的msync()函数。msync()函数主要用于同步内存映射文件到磁盘，确保内存中的更改被持久化。然而，原生的msync()并不保证在系统故障下的原子性。Failure-Atomic msync()的目标是在异常情况下也能保证数据的原子性提交，这意味着即使在系统崩溃或电源故障期间，对内存映射文件的修改也能完整地保存下来，从而维护数据的完整性。为了实现这一目标，作者设计并实现了failure-atomic msync()的Linux版本，经过一系列的实际测试，包括数百次的全机电源中断，该机制成功地保护了应用程序数据不受破坏。此外，性能测试表明，即使增强了功能，failure-atomic msync()仍然具有良好的运行效率。 Failure-Atomic msync()提供了一种简单而高效的方法，既能确保数据持久化，又允许开发者在编程时保持灵活性。这种方法对于那些需要在不确定环境下运行，同时又对数据完整性有严格要求的应用程序来说，是一种有价值的改进。

Atomic I/O transactions have been supported in both ﬁle

systems and in storage. Generally speaking, application-

level data integrity semantics are not visible at the stor-

age ﬁrmware and therefore storage-level transactions [36]

are not suitable for protecting application data integrity. At

the operating system level, Stasis supports transactional I/O

on memory pages through UNDO logging [38]. TxOS [34]

connects its transactional memory support with ﬁle system

journaling to enable atomic storage operations. Compared

with I/O transactions, our failure-atomic msync() presents

a simple, POSIX-compatible programming interface that is

less complex and easier to use. It is worth noting that Mi-

crosoft Windows Vista introduced an atomic ﬁle transaction

mechanism (TxF) but the vendor deprecates and may dis-

continue this feature, noting “extremely limited developer

interest ... due to its complexity and various nuances” [28].

Some transactional I/O systems [34, 38] enable atomic I/O

over failures as well as concurrent accesses. Failure-atomic

msync() focuses on failure-atomicity while leaving concur-

rency management to the applications (through mutex locks

or other synchronization means).

Rio Vista [27] was an early effort that supports data con-

sistency over operating system failures on persistent mem-

ory but did not support data consistency over power failures.

RVM [37] is similar in spirit to failure-atomic msync(),

though utilizing a different interface and focusing on virtual

memory support rather than mapped ﬁles. With continu-

ing advances in non-volatile memory (NVRAM) hardware

technologies [8, 11], recent studies have proposed a new

NVRAM-based ﬁle system design [10], new data access

primitives (including Mnemosyne [44], NV-heaps [9], and

CDDS [43]), as well as fast failure recovery [30]. Unfor-

tunately, today’s NVRAM manufacturing technologies still

suffer from low space density (or high $/GB) and stabil-

ity/durability problems. Until these problems are resolved,

today’s storage hardware (mechanical disks and NAND

Flash-based solid-state drives) and system software (block-

based ﬁle systems) are likely to remain. To realize our pri-

mary objectives of ease-of-use and fast adoption, failure-

atomic msync() targets the software/hardware stacks run-

ning in today’s systems.

Supporting a persistent heap between volatile memory

and durable storage is a classic topic. Atkinson et al. pro-

posed PS-algol, a database programming model that allows

programmers to directly manipulate data structures on a

heap [2] while an underlying system properly and promptly

moves data from the heap to persistent storage [3]. O’Toole

et al. presented a replicating garbage collector that cooper-

ates with a transaction manager to provide durable, consis-

tent storage management [32]. Guerra et al. identify a consis-

tent data version in the heap through pointer chasing from a

root data unit and atomically commit each data version [20].

At a lower level of abstraction, our failure-atomic msync()

can easily implement a persistent heap with data integrity

and high efﬁciency but it also allows other programming

paradigms on memory-mapped data.

The belief that programmers beneﬁt from the conve-

nience of manipulating durable data via conventional main-

memory data structures and algorithms dates back to MUL-

TICS [4], which inspired today’s memory-mapped ﬁle inter-

faces. Failure-atomic msync() retains the ergonomic bene-

ﬁts of memory-mapped ﬁles and couples them with strong

new data-integrity guarantees.

Finally, our work is related to data center state manage-

ment systems such as Bigtable [7] and Dynamo [13] but with

different emphases. While centrally managed data centers

can impose a uniﬁed data access model and distributed co-

ordination, failure-atomic msync() enables small local ad-

justment of existing operating system support at individual

hosts, which is more suitable for the vast majority of inde-

pendent application development scenarios.

3. Interface and System Support

Failure-atomic msync() is a simple OS-supported mech-

anism that allows the application programmer to evolve

durable application data atomically, in spite of failures such

as fail-stop kernel panics and power outages. Failure-atomic

msync() guarantees that a memory-mapped ﬁle will always

either be in the state it was in immediately after the most

recent msync() (or the state it was in at the time of mmap()

if msync() has not been called).

Because its semantics lie at the high-level interface

between the operating system and applications, failure-

atomic msync() does not fundamentally depend on partic-

ular durable media (whether block device or not)—today’s

hard disks and SSDs and forthcoming non-volatile mem-

ory are compatible. Indeed, failure-atomic msync() seems

to be an ideal interface to novel mechanisms for taking

memory checkpoints almost instantaneously by versioning

multilevel-cell NVRAM [46].

In addition to having ﬂexibility in the underlying storage

device, the concept of failure-atomic msync() allows multi-

ple implementations. Journaling, shadow copy, and soft up-

dates are all viable techniques that allow consistent updates

to a ﬁle system. In this paper, we describe our journaling-

based system support.

3.1 Interface and Semantics

The interface to failure-atomic msync() is simply the fa-

miliar mmap() and msync() system calls. In order to en-

able failure-atomic msync(), the programmer merely needs

to specify a new MAP ATOMIC ﬂag to mmap() in addition

to any other ﬂags needed. The programmer can access the

memory-mapped region in the customary fashion. When the

application state is deemed consistent by the programmer,

msync(MS SYNC) is called.

Two POSIX-standardized msync() ﬂags—which are

currently no-ops in Linux—illustrate the fundamental har-

剩余13页未读，继续阅读

weixin_38534352

粉丝: 5
资源: 982

Failure-Atomic msync(): 数据持久化与系统崩溃下的完整性保护

Heart_Faliure_PredictionModel

SandBox v2.1

tomcat中assertion faliure报红

医学分割数据集肾结石分割数据集labelme格式359张1类别.zip

基于STM32的物联网门禁系统设计.zip

烟雾火焰火灾检测21-YOLO（v7至v9）、COCO、CreateML、Darknet、Paligemma数据集合集.rar

python七段电子数码管.py

非常好的单片机+DS1302+按键+串口组成的日历系统电路proteus仿真工程100%好用.zip

2023 2024 年GESP C++ 一级白卷学生版

content-1734249593463.docx

最新资源