实用非阻塞同步：实时系统的方法

需积分: 6 50 浏览量更新于2024-07-17 收藏 101KB PDF 举报

"这篇论文是2001年USENIX年度技术会议的论文，探讨了在实时系统中实现实用的非阻塞同步方法。作者Michael Hohmuth和Hermann H¨artig来自德国德累斯顿工业大学计算机科学系。他们提出了一种结合锁自由（lock-free）和无等待（wait-free）同步技术的方法论，明确了在不同情况下应使用哪种技术。" 正文: 在实时系统中，确保任务的及时执行和确定性至关重要。传统的锁定机制可能会导致阻塞，从而影响系统性能和实时性。这篇论文《务实的非阻塞实时系统同步》提出了一个解决这一问题的新方法，它结合了两种不同的同步技术：锁自由和无等待。锁自由（Lock-Free）技术允许并发执行的线程在不使用互斥锁的情况下访问共享数据，减少了阻塞的可能性，但不是所有操作都能用简单的单指令原子操作（如Compare-and-Swap，CAS）来实现。论文限制了锁自由机制的使用，仅在单个CAS操作足以满足需求的情况下使用，以降低复杂性和提高效率。无等待（Wait-Free）同步则保证每个线程都能在有限步骤内完成操作，无需等待其他线程。这种技术提供了更高的确定性，但在某些场景下可能过于昂贵。论文中，作者展示了如何将Brinch Hansen的监视器（等同于Java的`synchronized`方法）在他们的机制上实现，这证明了这些机制的灵活性和适应性。此外，作者详细描述了如何利用这些机制重新实现了流行的微内核接口L4。与原始实现相比，他们的内核能够对所有操作的执行时间进行限制，从而增强了实时性。这样的改进对于需要严格时序控制的实时系统来说，具有显著的优势，因为它可以预测并控制任务的执行时间。通过这种方式，论文为设计高效率、高性能的实时系统提供了一个实用的框架，它不仅考虑了同步的效率，还兼顾了实时性的要求。这种方法对于开发人员来说是一种宝贵的指导，可以帮助他们在设计实时系统时做出更明智的选择，减少潜在的延迟和系统阻塞，提升系统的整体性能。

Appears in the Proceedings of the 2001 USENIX Annual Technical Conference (USENIX ’01) 3

higher-priority thread A’s critical section detects an

interference with a lower-priority thread B, A helps

B to ﬁnish its critical section ﬁrst. During helping,

A lends B its priority to ensure that no other, lower-

prioritized activities can interfere. When B has ﬁn-

ished, A executes its own critical section.

Wait-free object implementations satisfy a stronger

form of block-freedom than lock-free synchroniza-

tion (discussed in the next paragraph) as they guar-

antee freedom from starvation. Therefore, many

authors point out that wait-free synchronization is

a special case of lock-free synchronization. How-

ever, wait-free synchronization can also be imple-

mented using locks, albeit with a nonblocking help-

ing scheme. For example, a locking scheme with

priority inheritance can be considered a wait-free

synchronization scheme as long as critical sections

never block.

Lock-free synchronization works completely

without locks. Critical code sections are designed

such that they prepare their results out of line and

then try to commit them to the pool of shared

data using an atomic memory update instruction

like compare-and-swap (CAS). The compare part

of CAS is used to detect conﬂicts between two

threads that simultaneously try to update the data; if

it fails, the whole operation is restarted. If needed,

retries can be delayed with an exponential backoff

to avoid retry contention.

This synchronization mechanism has some nice

properties: Because there are no locks, it avoids

deadlocks; it provides better insulation from

crashed threads, resulting in higher robustness and

fault tolerance, because operations do not hold

locks on critical data; moreover, it is automatically

multiprocessing-safe.

Preconditions for using lock-free synchronization

are that primitives for atomic memory modiﬁca-

tions are available, and data is stored in type-stable

memory management. We do not digress into type-

stable memory management in this paper (see [7]

for a discussion of operating-systems–related is-

sues); the rest of this subsection discusses atomic

memory modiﬁcation.

Backoff is never needed on single-CPU systems.

Atomic memory update. The x86 CPUs have

two kinds of atomic memory-modiﬁcation opera-

tions: a test-and-set instruction (TAS) and a CAS

instruction. Newer models (Intel Pentium and

newer) also have a double-size–word (8 bytes)

compare-and-swap instruction (CASW). However,

these CPUs do not support atomically updating two

independent memory words (two-word compare-

and-swap, CAS2).

A number of data structures can be imple-

mented without locks directly on top of CAS and

CASW (i. e., without the overhead of a software-

implemented multi-word CAS): counters and bit-

ﬁelds with widths up to 8 bytes, stacks, and FIFO

queues. [21, 18]

Valois introduced a lock-free single-linked list de-

sign supporting insertions and deletions anywhere

in a list, as well as several other data structures

[23, 22]. These designs also work with just CAS.

However, Greenwald [6] has criticized them for be-

ing quite complex, difﬁcult to get right, and com-

putationally expensive.

Most of the algorithms for lock-free data-structure

synchronization that have been developed recently

assume availability of a stronger atomic primitive

like CAS2. These data structures include general

single-linked and double-linked lists. [6]

A number of techniques exist for implement-

ing lock-free and wait-free general multi-word

compare-and-swap (MWCAS) on top of CAS and

CAS2, enabling nonblocking synchronization for

arbitrarily complex data structures [11, 19, 2, 6].

These techniques have considerable overhead in

both space and runtime complexity, especially

when compared to common lock-based operations,

making them less interesting for kernel design.

The most common technique to implement atomic

multi-word updates on uniprocessors is to prevent

preemption during the update. This is usually done

by disabling interrupt delivery in the CPU. The dis-

advantage of this method is (of course) that it does

not work on multiprocessors.

Bershad [4] has proposed to implement CAS in

software using an implementation and lock known

剩余14页未读，继续阅读

liuwane

粉丝: 0

实用非阻塞同步：实时系统的方法

信息安全领域经典论文集汇总

信息安全领域顶级会议论文集精选

ns-2英文教程PDF：深入学习网络模拟

信息安全_数据安全_us-18-Kirat-DeepLocker-Concealin.pdf

Pantheon the training ground for Internet congestion-control research.pdf

重要国际学术会议目录-推荐下载.pdf

sec17-wang-shuai.pdf_英中对照(1).docx

Control_Flow_Integrity_for_COTS_Binaries.pdf

The BSD Packet Filter A New Architecture for User-level Packet Capture.pdf

sec18_full-proceedings.pdf

最新资源