投票算法增强：动态线性排序副本提高文件可用性

需积分: 9 6 浏览量更新于2024-09-23 收藏 882KB PDF 举报

"这篇论文探讨了增强的投票算法在管理分布式系统中复制文件一致性控制的应用。主要关注的投票算法包括带有主站点的投票、动态投票以及线性有序副本的动态投票。作者通过建立随机模型来比较这些策略在文件可用性方面的表现，并证明动态投票与线性有序副本的组合提供了最高的可用性。" 在分布式系统中，尤其是在存在网络分区的情况下，确保数据一致性是一项关键挑战。网络分区可能由于站点故障或通信链路失效导致，这需要有效的算法来管理复制的文件。投票算法是一种广泛应用的解决方案，它以其合理可用性、简洁的定义便于正确性证明以及实现简单而备受青睐。传统的投票算法通常基于多数原则，即大多数副本同意的数据版本被认为是有效版本。然而，这种基础的投票机制在面对网络分区时可能会导致数据不一致。为了解决这个问题，论文提出了三个增强策略： 1. 带有主站点的投票：在这一策略中，一个主站点被选为决策中心，负责协调和验证投票结果。主站点的存在可以提高决策效率，减少冲突，并有助于恢复网络分区后的系统一致性。 2. 动态投票：动态投票允许副本在需要时参与投票，而不是固定地参与到每一次决策中。这种灵活性可以根据网络状况动态调整投票参与者，从而可能提高系统的响应性和可用性。 3. 动态投票与线性有序副本：结合动态投票，进一步引入线性有序副本的概念。线性有序副本确保副本按照特定顺序进行更新，减少了不同副本之间的版本冲突，提高了数据的一致性。为了评估这些增强策略的效果，作者构建了一个随机模型，该模型模拟了各种网络条件下的系统行为。通过对这些模型的分析，他们得出结论，动态投票与线性有序副本的组合在文件可用性方面表现出色，能提供最高的系统可用性。这表明，这种策略在处理网络分区问题时，既能保证数据一致性，又能最大程度地保持服务的连续性。这篇研究对于设计更可靠的分布式系统具有重要意义，特别是在容错和高可用性要求较高的场景下，如金融交易、云存储和物联网应用等。通过优化投票算法，系统可以更好地应对网络故障，减少数据不一致的发生，提高用户服务体验。

ENHANCEMENTS TO THE VOTING ALGORITHM

Sushi1

Jajodia

and

David

Mutchler

Computer Science and Systems Branch

Code 5590

Naval Research Laboratory

Washington, DC 20375-5000

ABSTRACT

There are several consistency control algorithms for manag-

ing replicated files in the face of network partitioning due to

site or communication link failures. In this paper, we con-

sider the popular voting scheme along with three enhance-

ments:

voting

with a primary

site, dynamic voting, and

dynamic

voting

with linearly

ordered

copiee. We develop a

stochastic model which compares the file availabilities

afforded by each of these schemes. We show that in this

model dynamic voting with linearly ordered copies provides

the greatest availability.

I. INTRODUCTION

There are several consistency control algorithms for

managing replicated data in the face of network partitioning

due to site or communication link failures 141.

Voot-

ing[5,12,15] is the best known example of such a scheme. It

has several appealing aspects: its availability is reasonable;

its simple statement permits

clear correctness proof; and it

is simple to implement. Voting

with a primary

site is a sim-

ple extension of voting. More recently, researchers have

introduced two other enhancements to voting, called

dynamic voting [S] (see also [3]) and dynamic voting with

linearly ordered

copies [7]. These enhancements share all the

advantages of the voting scheme; we show that they provide

greater availability as well.

Sections II and III give formal statements of the prob-

lem and the four algorithms listed above. Section IV pro-

vides a stochastic analysis of the availabilities of these algo-

rithms. The model we use assumes that, update requests

arrive much more frequently than sites fail or are repaired.

In the context of our model, we state theorems that compare

the availabilities of the four algorithms. Our main result is

that dynamic voting with linearly ordered copies provides

the greatest availability.

II. FORMAL SPECIFICATION OF PROBLEM

The distributed database (DDB) system consists of a

collection of independent computers, called

nodes

or sites,

connected via communication links. We assume that site

failures are clean, i.e., nodes stop executing without perform-

ing any incorrect actions and that node crashes are

permission to copy without fee all or part of this material is

granted protided &at the copies are not made or distributed for

direct commercial advantage, the VLDB copyright notice and the

&le of the. publication and its date. appear, and notice is given that

copying is by permission of the Very Large Data Base Endow-

ment. To copy otherwise, or to republish, requires

a fee adbx SW-

cial permission from the Endowment.

Proceedings of the

13th VLDB Conference, Brighton 1987

detectable by other nodes. We do not include Byzantine

failures (111 where sites may act in an arbitrary and mali-

cious manner. Site or communication failures may separate

the sites into more than one connected component of com-

municating sites. We call each connected component a

parti-

tion.

There are several logical files in the DDB, and a physi-

cal copy of each logical file is stored at one or more sites.

Each site keeps

history of all updates which it performed

on a file. We assume that each site runs

eoneurrcncly con-

trol protocol

which ensures that the execution of all transac-

tions within any partition is serializable [8,1]. While serial-

izability of transactions at each site is certainly desirable; it

is not sufficient to guarantee that the transactions running

in different sites will combine to yield a serialieable result;

and therefore, it is

necessary

to run

consistency

control

protocol

which correctly manages the replicated data in the

presence of failures. (An excellent survey of several of these

strategies is given in [4].) In

pessimistic consistency control

protocol, mutual consistency of a replicated file is main-

tained by making sure thaf all reads are fresh and that, files

are updated in at most one partition at any given time. We

will call such

partition the majority partition. Different

pessimistic protocols use different definitions of

majority

partition. When site or communication link recoveries

cause

partitions to unite, the nodes form a new partition by com-

paring their histories and obtain, if necessary, all updates

that they have missed. If there does not exist a majority

partition, all sites in the system must wait until enough sites

and communication links are repaired so that there is once

again a majority partition in the system. Since this wait is

unavoidable [14], the challenge is to come up with a pes-

simistic consistency control algorithm which not only

preserves mutual consistency of various copies of a file, but

at the same time achieves high availability as well.

We can pictorially represent the history of network’s

failure and recovery by using the notion of a partition graph

[lo], defined

follows.

Definition 1. A partition

graph

for

file {is

directed

acyclic graph such that the nodes correspond to the parti-

tions and the edges correspond to either a fragmentation of

a partition into two or more subpartitions or a coalescence

of two or more partitions into a single partition.

Example 1. An example of a partition graph is shown

in Figure 1. The source nodk is labeled with the names of

sites ABCDE that have copies of the file f, indicating that

these sites are all connected and that copies of fare mutu-

ally consistent. The initial partition is fragmented into two

partitions ABC and DE. Later B becomes isolated from AC,

and subsequently A and C also become isolated. Ultimately,

A, D, and E resume comm&ication and form a single parti-

tion.

399

下载后可阅读完整内容，剩余7页未读，立即下载

luoyer

粉丝: 2
资源: 29

投票算法增强：动态线性排序副本提高文件可用性

张量投票算法及其应用

改进的张量投票算法：提升图像线特征提取效果

自适应张量投票算法：基于分形维数的图像处理新方法

PCL Houghing（霍夫投票）算法的实现

算法导论（英）

算法讲解116【扩展】摩尔投票大加强，线段树里捉海王.pptx

数据结构与算法课程设计报告 --电子投票系统

基于多个机器学习算法的投票式邮件过滤模型.pdf

手写数字识别的KNN算法与投票法

C2C电商信誉系统：用户投票质量管控与激励算法

最新资源