多核时代并发控制挑战：面对一千核的深度剖析

62 浏览量更新于2024-08-25 收藏 890KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"Staring Into The Abyss - An Evaluation of Concurrency Control with One Thousand Cores (p209-yu) - 计算机科学" 这篇研究论文深入探讨了在多核心处理器时代，数据库管理系统（DBMS）面临并发控制的挑战。随着计算机架构的发展，单个芯片上集成了数十甚至数百个核心，这种前所未有的片上并行性对数据库系统的可扩展性提出了新的要求。当前的DBMS并未针对如此大规模的并发进行设计，因此，当核心数量增加时，如何有效管理并发访问数据的问题变得极为复杂。作者Xiangyao Yu、George Bezerra、Andrew Pavlo、Srinivas Devadas和Michael Stonebraker等人通过实验评估了在线事务处理（OLTP）工作负载下的并发控制性能。他们指出，随着大量线程并行运行，协调竞争数据访问的复杂性可能会抵消增加核心数量带来的性能提升。在他们的研究中，作者强调了当前DBMS在应对未来CPU架构时存在的不足。他们可能无法充分利用多核处理器提供的计算能力，因为并发控制机制可能成为系统性能的瓶颈。论文中可能涵盖了各种并发控制算法的性能分析，如两阶段锁（2PL）、多版本并发控制（MVCC）以及乐观并发控制（OCC）等，并可能对比了这些算法在不同核心数量下的表现。此外，研究可能还涉及到了硬件和软件之间的交互，例如缓存一致性问题、线程同步开销和资源竞争等。作者可能通过模拟或实测的方式，模拟了拥有上千核心的环境，分析了在极高并发度下，数据库系统的性能、效率和稳定性。论文可能提出了针对高核心数环境优化并发控制策略的解决方案，包括改进现有的并发控制算法、设计适应大规模并行性的数据结构，以及利用硬件特性来提升性能。这些优化可能涉及到减少锁的使用、更有效地利用内存和存储层次，以及利用硬件事务内存（HTM）等技术。这篇论文揭示了当前数据库系统在处理高并发环境下面临的困境，并提供了对未来DBMS设计的洞察和建议，以确保在多核处理器的时代，能够有效地管理和利用并发性，实现性能的最大化。

资源详情

资源推荐

TAMP, a read query makes a local copy of the tuple to ensure re-

peatable reads since it is not protected by locks. When a transaction

is aborted, it is assigned a new timestamp and then restarted. This

corresponds to the “basic T/O” algorithm as described in [3], but

our implementation uses a decentralized scheduler.

Multi-version Concurrency Control (MVCC): Under MVCC,

every write operation creates a new version of a tuple in the database [4,

5]. Each version is tagged with the timestamp of the transaction

that created it. The DBMS maintains an internal list of the versions

of an element. For a read operation, the DBMS determines which

version in this list the transaction will access. Thus, it ensures a

serializable ordering of all operations. One beneﬁt of MVCC is that

the DBMS does not reject operations that arrive late. That is, the

DBMS does not reject a read operation because the element that it

targets has already been overwritten by another transaction [5].

Optimistic Concurrency Control (OCC): The DBMS tracks

the read/write sets of each transaction and stores all of their write

operations in their private workspace [28]. When a transaction

commits, the system determines whether that transaction’s read set

overlaps with the write set of any concurrent transactions. If no

overlap exists, then the DBMS applies the changes from the trans-

action’s workspace into the database; otherwise, the transaction is

aborted and restarted. The advantage of this approach for main

memory DBMSs is that transactions write their updates to shared

memory only at commit time, and thus the contention period is

short [42]. Modern implementations of OCC include Silo [42] and

Microsoft’s Hekaton [11, 29]. In this paper, our algorithm is simi-

lar to Hekaton in that we parallelize the validation phase and thus

is more scalable than the original algorithm [28].

T/O with Partition-level Locking (H-STORE): The database is

divided into disjoint subsets of memory called partitions. Each

partition is protected by a lock and is assigned a single-threaded

execution engine that has exclusive access to that partition. Each

transaction must acquire the locks for all of the partitions that it

needs to access before it is allowed to start running. This requires

the DBMS to know what partitions that each individual transac-

tion will access before it begins [34]. When a transaction request

arrives, the DBMS assigns it a timestamp and then adds it to all

of the lock acquisition queues for its target partitions. The execu-

tion engine for a partition removes a transaction from the queue

and grants it access to that partition if the transaction has the oldest

timestamp in the queue [38]. Smallbase was an early proponent of

this approach [22], and more recent examples include H-Store [27].

3. MANY-CORE DBMS TEST-BED

Since many-core chips do not yet exist, we performed our anal-

ysis through Graphite [30], a CPU simulator that can scale up to

1024 cores. For the DBMS, we implemented a main memory OLTP

engine that only contains the functionality needed for our experi-

ments. The motivation for using a custom DBMS is two fold. First,

we can ensure that no other bottlenecks exist other than concur-

rency control. This allows us to study the fundamentals of each

scheme in isolation without interference from unrelated features.

Second, using a full-featured DBMS is impractical due to the con-

siderable slowdown of simulators (e.g., Graphite has an average

slowdown of 10,000×). Our engine allows us to limit the experi-

ments to reasonable times. We now describe the simulation infras-

tructure, the DBMS engine, and the workloads used in this study.

3.1 Simulator and Target Architecture

Graphite [30] is a fast CPU simulator for large-scale multi-core

systems. Graphite runs off-the-shelf Linux applications by creat-

Host%Machines%

Target%Mul2core%

Application

core%

Figure 1: Graphite Simulator Infrastructure – Application threads are

mapped to simulated core threads deployed on multiple host machines.

Figure 2: Target Architecture – Tiled chip multi-processor with 64 tiles

and a 2D-mesh network-on-chip. Each tile contains a processing core, L1

and L2 caches, and a network switch (SW).

ing a separate thread for each core in the architecture. As shown

in Fig. 1, each application thread is attached to a simulated core

thread that can then be mapped to different processes on separate

host machines. For additional performance, Graphite relaxes cy-

cle accuracy, using periodic synchronization mechanisms to model

instruction-level granularity. As with other similar CPU simulators,

it only executes the application and does not model the operating

system. For this paper, we deployed Graphite on a 22-node cluster,

each with dual-socket Intel Xeon E5-2670 and 64GB of DRAM.

The target architecture is a tiled multi-core CPU, where each tile

contains a low-power, in-order processing core, 32KB L1 instruc-

tion/data cache, a 512KB L2 cache slice, and a network router.

This is similar to other commercial CPUs, such as Tilera’s Tile64

(64 cores), Intel’s SCC (48 cores), and Intel’s Knights Landing (72

cores) [1]. Tiles are interconnected using a high-bandwidth, 2D-

mesh on-chip network, where each hop takes two cycles. Both the

tiles and network are clocked at 1GHz frequency. A schematic of

the architecture for a 64-core machine is depicted in Fig. 2.

We use a shared L2-cache conﬁguration because it is the most

common last-level cache design for commercial multi-cores. In a

comparison experiment between shared and private L2-caches, we

observe that shared caches lead to signiﬁcantly less memory trafﬁc

and higher performance for OLTP workloads due to its increased

aggregate cache capacity (results not shown). Since L2 slices are

distributed among the different tiles, the simulated multi-core sys-

tem is a NUCA (Non-Uniform Cache Access) architecture, where

L2-cache latency increases with distance in the 2D-mesh.

3.2 DBMS

We implemented our own lightweight main memory DBMS based

on pthreads to run in Graphite. It executes as a single process with

the number of worker threads equal to the number of cores, where

each thread is mapped to a different core. All data in our DBMS is

stored in memory in a row-oriented manner. The system supports

basic hash table indexes and a pluggable lock manager that allows

us swap in the different implementations of the concurrency con-

trol schemes described in Section 2. It also allows the indexes and

lock manager to be partitioned (as in the case with the H-STORE

scheme) or run in a centralized mode.

211

剩余11页未读，继续阅读

weixin_38679651

粉丝: 6
资源: 934

多核时代并发控制挑战：面对一千核的深度剖析

The.MIT.Press.Once.Upon.an.Algorithm.How.Stories.Explain.Computing.0262036630

命令行http服务器http-server.zip

STARING_DIR

写一个拍出来就能火的脚本

mysql8.0.33安装到staring

docker一直staring

mysql安装一直卡在staring

DBMA光谱分支和空间分支

各种函数声明和定义模块

湖北工业大学在河南2021-2024各专业最低录取分数及位次表.pdf

1805.06605v2 DEFENSE-GAN.pdf

【语音去噪】FIR和IIR低通+带通+高通语音信号滤波（含时域频域分析）【含Matlab源码 4943期】.mp4

java-ssm+jsp幼儿园管理系统实现源码(项目源码-说明文档)

hadoop_3_2_0-yarn-resourcemanager-3.3.4-1.el7.x86_64.rpm

DelphiWebMVC-master.zip

东北农业大学在河南2021-2024各专业最低录取分数及位次表.pdf

python第二次作业

hadoop_3_2_0-mapreduce-historyserver-3.3.4-1.el7.x86_64.rpm

最新资源