并发分布式数据库：理论与实践方法综述

需积分: 9 199 浏览量更新于2024-07-21 收藏 3.05MB PDF 举报

并发分布式数据库是现代信息技术领域的一个关键课题，它涉及在多台计算机或节点之间共享和管理数据的同时保持数据的一致性和完整性，尤其是在高并发环境下。这篇论文由Philipp Bernstein和Nathaniel Goodman撰写，发表于某计算机公司，地址位于马萨诸塞州剑桥。他们的研究旨在全面梳理和总结分布式数据库并发控制领域的最新进展。核心概念是将并发控制问题分解为两个主要子问题：读写（Read-Write）同步和写写（Write-Write）同步。这两个子问题分别处理数据的读取过程中可能发生的冲突，以及在写入操作之间的冲突。解决这些问题的关键技术包括一系列同步策略，如锁定、时间戳（timestamps）、死锁预防（deadlock avoidance）、可见性（senahzability）、两阶段提交（Two-phase Commit, 2PC）和两阶段锁协议（Two-phase Locking, 2PL）等。作者详细描述了48种主要的并发控制方法，这些方法涵盖了文献中已有的实用算法，甚至包含了一些新的创新思路。这些方法着重于结构设计和正确性验证，性能优化则被放在次要位置，因为论文更关注基础理论和原则的阐述。并发控制算法的结构至关重要，它决定了如何组织并发操作，确保数据的一致性，防止数据不一致和竞态条件。正确性则是指算法在所有可能的并发执行路径上都能保证最终状态的正确性，避免出现错误结果。例如，通过使用时间戳和顺序化（timestamp ordering）来确定事务的执行顺序，从而避免了数据的不一致性。死锁是并发控制中的一个常见问题，它发生在多个事务因互相等待对方释放资源而无法继续执行。论文中提到的策略旨在通过预防或检测死锁来提升系统的可用性。两阶段提交和两阶段锁协议则是经典的并发控制技术，它们通过协调多节点间的事务，确保分布式环境下的事务完整性。这篇文章提供了深入理解并发分布式数据库并发控制的核心概念和技术框架，对于数据库管理系统的设计者、开发者以及对分布式计算有深入了解的读者来说，是一份宝贵的参考资料。尽管性能是讨论的一部分，但本文更侧重于并发控制算法的设计原理和理论基础，为后续的研究和实践提供了一个坚实的基础。

Concurrency Control in Database Systems

•

191

In a centralized DBMS we assumed that

(1) private workspaces were part of the TM,

and (2) data could freely move between a

transaction and its workspace, and between

a workspace and the DM. These assump-

tions are not appropriate in a DDBMS

because TMs and DMs may run at different

sites and the movement of data between a

TM and a DM can be expensive. To reduce

this cost, many DDBMSs employ query

optimization procedures which regulate

(and, it is hoped, reduce) the flow of data

between sites. For example, in SDD-1 the

private workspace for transaction T is dis-

tributed across all sites at which T accesses

data [BF.RN81]. The details of how T reads

and writes data in these workspaces is a

query optimization problem and has no di-

rect effect on concurrency control.

The problem of atomic commitment is

aggravated in a DDBMS by the possibility

of one site failing while the rest of the

system continues to operate. Suppose T is

updating x, y, z stored at DMx, DMy, DMz,

and suppose T's TM fails after issuing dm-

write(x), but before issuing the dm-writes

for y and z. At this point the database is

incorrect. In a centralized DBMS this phe-

nomenon is not harmful because no trans-

action can access the database until the

TM recovers from the failure. However, in

a DDBMS, other TMs remain operational

and can access the incorrect database.

To avoid this problem, prewrite com-

mands must be modified slightly. In addi-

tion to specifying data items to be copied

onto secure storage, prewrites also specify

which other DMs are involved in the com-

mitment activity. Then if the TM fails dur-

ing the second phase of two-phase commit,

the DMs whose dm-writes were not issued

can recognize the situation and consult the

other DMs involved in the commitment. If

any DM received a dm-write, the remaining

ones act as if they had also received the

command. The details of this procedure are

complex and appear in HAMM80.

As in a centralized DBMS, a transaction

T accesses the system by issuing BEGIN,

READ, WRITE, and END operations. In

a DDBMS these are processed as follows.

BEGIN: The TM creates a private work-

space for T. We leave the location and

organization of this workspace unspecified.

READ(X): The TM checks T's private

workspace to see if a copy of X is present.

If so, that copy's value is made available to

T. Otherwise the TM selects some stored

copy of X, say xi, and issues din-read(x,) to

the DM at which x, is stored. The DM

responds by retrieving the stored value of

x, from the database, placing it in the pri-

vate workspace. The TM returns this value

to T.

WRITE(X, new-value): The value of X in

T's private workspace is updated to new-

value, assuming the workspace contains a

copy of X. Otherwise, a copy of X with the

new value is created in the workspace.

END: Two-phase commit begins. For

each X updated by T, and for each stored

copy x, of X, the TM issues a prewrite (x,)

to the DM that stores x,. The DM responds

by copying the value of X from T's private

workspace onto secure storage internal to

the DM. After all prewrites are processed,

the TM issues dm-writes for all copies of all

logical data items updated by T. A DM

responds to dm-write(x,) by copying the

value of x, from secure storage into the

stored database. After all dm-writes are

installed, T's execution is finished.

2. DECOMPOSITION OF THE CONCUR-

RENCY CONTROL PROBLEM

In this section we review concurrency con-

trol theory with two objectives: to define

"correct executions" in precise terms, and

to decompose the concurrency control

problem into more tractable subproblems.

2.1 Serializability

Let E denote an execution of transactions

T1 ..... T,. E is a serial execution if no

transactions execute concurrently in E; that

is, each transaction is executed to comple-

tion before the next one begins. Every serial

execution is defined to be correct, because

the properties of transactions (see Section

1.1) imply that a serial execution terminates

properly and preserves database consist-

ency. An execution is serializable if it is

computationally equivalent to a serial exe-

cution, that is, if it produces the same out-

put and has the same effect on the database

as some serial execution. Since serial exe-

cutions are correct and every serializable

execution is equivalent to a serial one, every

serializable execution is also correct. The

Computing Surveys, Vol. 13, No. 2, June 1981

192

P. A. Bernstein and N. Goodman

Transachons

Database

T 1

•

BEGIN; i----n ~

READ (X); WRITE(Y); END

T 2 BEGIN;

READ(Y), WRITE(Z); END

T 3 .

BEGIN,

READ(Z), WRITE(X), END

One possible execution of T1, T2, and T3 is represented by the

following logs. (Note. r,[x] denotes the operation din-read(x) issued

by T~; w,[x] denotes a din-write(x) issued by T,.)

Log for DM A:

rl[xl]wl[yl]r2[yl]w3[xl]

Log for DM B:

wl[y2]w2[z2]

Log for DM C.

w2[z3]r3[z3]

Figure 4.

Modeling executions as logs.

• The execution modeled in Figure 4 is serial. Each

log is itself serial; that is, there is no interleaving of

operations from different transactions. At DM A, Ti

precedes T~ precedes T3; at DM B, % precedes T~;

and at DM C, T2 precedes T3. Therefore, TI, T2, T3

is a total order satisfying the definition of serial.

• The following execution is not serial. The logs them-

selves are not serial.

DM A:

rl[xl]r2[ YllW3[Xl]Wl[ yl]

DM B:

w2[z2]wl[y2]

DM C:

w2[z3lr3[z3]

• The following execution is also not serial Although

each log is serial, there is no total order consistent

with all logs.

DM A:

rl[x~]wl[yl]re[yl]w3[x~]

DM B: w2[z2]wl[y2]

DM C:

w2[z3]r3[z3]

Figure 5. Serial and nonserial loops.

goal of database concurrency control is to

ensure that all executions are serializable.

The only operations that access the

stored database are din-read and din-write.

Hence it is sufficient to model an execution

of transactions by the execution of din-

reads and din-writes at the various DMs of

the DDBMS. In this spirit we formally

model an execution of transactions by a set

logs,

each of which indicates the order in

which dm-reads and din-writes are proc-

essed at one DM (see Figure 4). An execu-

tion is

serial if

there is a total order of

transactions such that if T, precedes Tj in

the total order, then all of T,'s operations

precede all of Tfs operations in every log

where both appear (see Figure 5). Intui-

tively, this says that transactions execute

serially and in the same order at all DMs.

Two operations

conflict

if they operate

on the same data item and one of the op-

erations is a dm-write. The order in which

operations execute is computationally sig-

nificant if and only if the operations con-

flict. To illustrate the notion of conflict,

consider a data item x and transactions T,

and Tj. If T, issues dm-read (x) and T~

issues dm-write(x), the value read by T, will

(in general) differ depending on whether

the dm-read precedes or follows the dm-

write. Similarly, if both transactions issue

dm-write(x) operations, the final value of x

depends on which dm-write happens last.

Those conflict situations are called

read-

write (rw) conflicts and write-write (ww)

conflicts,

respectively.

The notion of conflict helps characterize

the equivalence of executions. Two execu-

tions are

computationally equivalent

if (1)

each dm-read operation reads data item

values that were produced by the same dm-

writes in both executions; and (2) the final

dm-write on each data item is the same in

both executions [PAPA77, PAPA79]. Condi-

tion (1) ensures that each transaction reads

the same input in both executions (and

therefore performs the same computation).

Computing Surveys, Vol. 13, No. 2, June 1981

剩余36页未读，继续阅读

AddisionYoung

粉丝: 7
资源: 4

并发分布式数据库：理论与实践方法综述

rqlite：基于SQLite的轻量级分布式关系数据库

pyrqlite:适用于rqlite的Python（DB-API 2.0）客户端，基于SQLite构建的轻量级分布式数据库

Hibatis轻量级高并发分布式数据库框架

分布式数据库分布式数据库.ppt

分布式数据库实践字节跳动分布式数据库实践V2.zip

分布式数据库实践字节跳动分布式数据库实践V2.pdf

分布式数据库并发控制

分布式数据库技术系列概览：分布式数据库核心技术发展趋势.pdf

分布式数据库实践金融分布式数据库在核心系统改造的实践V2.pdf

分布式数据库技术系列简报：云计算场景驱动分布式数据库技术演进.pdf

最新资源