优化Tensilica LX5核心的专用指令集：HASHI——提升哈希运算性能

31 浏览量更新于2024-08-25 收藏 715KB PDF 举报

HASHI是一项针对哈希操作的专用指令集扩展（Application-Specific Instruction Set Extension, ASIS），由Oliver Arnold、Sebastian Haas、Gerhard Fettweis、Benjamin Schlegel、Thomas Kissinger、Tomas Karnagel和Wolfgang Lehner等研究人员在2014年提出。他们在Vodafone Chair Mobile Communications Systems以及Technische Universität Dresden的Database Technology Group合作，专注于提升数据库查询处理中的哈希操作性能和能源效率。在现代数据库系统中，哈希操作是核心，几乎所有的关键操作如GROUP BY、SELECT和各种JOIN实现都依赖于高效哈希函数。然而，传统的通用处理器在执行这些操作时可能会遇到性能瓶颈，特别是当处理大量数据或复杂数据类型（如字符串）时。为了克服这一问题，研究者们提出了HASHI，它利用Tensilica Xtensa LX5核心的专用指令集扩展。他们特别设计了一种针对32位整数键的位提取哈希算法，以及用于字符串值的CityHash函数。通过深入分析和优化算法的各个部分，他们开发了一套定制的哈希指令集，旨在最大化性能提升和降低能耗。实验结果显示，HASHI不仅显著提高了哈希计算的速度，而且在能耗管理上也有所改善。这对于实时数据分析、大规模数据处理以及移动设备上的数据库应用来说，具有重要的实际价值。通过硬件级别的优化，HASHI能够在不增加额外复杂性的情况下，为数据库系统的整体性能提供坚实支持，从而推动了数据库技术在现代信息技术领域的进步。

HASHI: An Application-Speciﬁc Instruction Set

Extension for Hashing

Oliver Arnold, Sebastian Haas,

Gerhard Fettweis

Vodafone Chair Mobile Communications

Systems

Technische Universit

at Dresden

Dresden, Germany

Benjamin Schlegel

⇤

, Thomas Kissinger,

Tomas Karnagel, Wolfgang Lehner

Database Technology Group

Technische Universit

at Dresden

Dresden, Germany

ABSTRACT

Hashing is one of the most relevant operations within query

processing. Almost all core datab a se operators like group-

by, selections, or di↵erent join implementations rely on

highly eﬃcient hash i mp l ementations. In thi s paper, we

present a way to signiﬁcantly improve performance and en-

ergy eﬃciency of hash op era ti o n s using specialized instruc-

tion set extensions for the Ten sil ic a Xtensa LX5 core. To

show the applicability of instruction set extensions, we im-

plemented a bit extraction hashing scheme for 32-bit integer

keys as well as the CityHash fun c t io n for string values. We

identify the individual parts of the algorithms required to

be optimized, we describe our hashing-speciﬁc instructi o n

set, and ﬁnally give a comprehensive experimental evalua-

tion. We observed th a t the hash implementation usin g the

hashing-speciﬁc instruction set (1) is u p to two orders of

magnitudes fas te r than the basic core without extensions,

(2) exhi b it s always better performance compared to hand-

tuned code running on modern high-end general purpose

CPUs, and (3) has a signiﬁcantly better footprint with re-

spect to energy consumption a s well as chip area. Especially

the third observation has the potential for a higher packing

density and therefore a signiﬁcantly better overall system

performance.

1. INTRODUCTION

Database systems can be optimized in m a ny di↵erent di-

rections, while the overall challenges are often optimiza-

tions of algorithms or adapting the software to given hard-

ware features like multiple co res , SIMD, and memory hier-

archies. Algorithms deployed in database systems a re there-

fore highly tuned and very often eith e r reach the processor’s

peak performance or they are limited by some system char-

acteristics like memory bandwidth or latency, the number

⇤

This author works now at Oracl e Labs, Belmont, CA, USA.

Permission to make digital or hard copies of all or part of this work for

personal or classroom use is granted without fee provided that copies are

not made or distributed for proﬁt or commercial advantage and that copies

bear this notice and the full citation on the ﬁrst page. To copy otherwise,

to republish, to post on servers or to redistribute to lists, requires prior

speciﬁc permission and/or a fee. Articles from this volume were invited

to present their results at ADMS’14, a workshop co-located with The 40th

International Conference on Very Large Data Bases, September 1st - 5th

2014, Hangzhou, China.

of ava i la b le cores, or down-scaled core frequencies due to

power and heat dissipation limitations.

Therefore, a high performance database system design

also depends on hardware specializations to achieve even

better performance. Unfortunately, gen era l purpose pro-

cessors are reaching their limits since single-threaded per-

formance has almost sto p ped to in c rea se, because the maxi -

mum c o re frequency is limited by physical constraints. Even

the current solution, to put more and more homogeneous

cores onto a single socket, wi ll also reach physical limita-

tions soon. As the feature size in which processors are man-

ufactured will shrink, the growing numb er of transistors will

increase the occurrence of dark silicon [4, 6]. Dark silicon is

caused by thermal problems, since supplying all transistors

with energy at the same time would overheat the processor.

Having an energy-eﬃcient processor d es ig n and introducing

specialized instruc t io n sets that are usable on-demand, mit-

igates the impact of dark sili c o n . Especially the latter idea

can be implemented o n a fra c t io n of th e chip space, wh ere

the instruction set extensions can be power-gated whenever

needed without compromis in g the overall general purpose

characteristics of the chip itself.

To follow the idea of specialized instruction sets and to

push the envelope in database performance, we strive to

widen the view t owards adjusting processors for today’s and

future query p rocessing needs. Processors them sel ves ca n

be extended to a ll ow for a high er query throughput and

lower late n cy by adding sp ec ia l iz ed instruction sets, opti-

mized for supporting query processing primitives. Special

purpose hardware extensions for database operation s can

improve the performance of the supported a lg o rit h m s sig-

niﬁcantly while – at the same time – saving energy in the

processor, mit ig a t in g the impact of dark silicon and there-

fore allowing a much h ig h er packing density of individual

cores due to goo d elec tri c a l an d th erma l p ro perties.

In previous work, we proposed an instruction set exten-

sion to acce lera t e set-oriented datab a se primitives [2]. In

this paper, we want to go furt h er with this novel way of

optimization by providing a specialize d instru c t io n set ex-

tension for database hashing primitives in combination with

a low-energy processor design . Hashing is widely used in

modern database s for join implementations, agg reg a ti o n op-

erators, as well as indexing. To support al l these op era t ors

with instruction set extensions, we investigate hashing prim-

itives like integer and string ha s h in g , as well as insert and

lookup operations. For all these operations, we use well-

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38678022

粉丝: 1
资源: 950

优化Tensilica LX5核心的专用指令集：HASHI——提升哈希运算性能

hashi-up：通过SSH引导HashiCorp领事和/或Nomad少于1分钟

hashi-admin:使管理HashiStack集群更加容易

hashi-portfolio

hashi-vault-js:一个与Hashicorp Vault API交互的node.js模块

hashi-helper:Consul和Vault的灾难恢复和配置管理

hashi-ui：@hashicorp Consul和Nomad的现代用户界面

1972493_Mohamed.Hashi_TCSMEANStackTraining

HASHIN_FAILURE_CRITERIA_hard.rar_UMAT hashin_UMAT_Hashin_hashi

elm-hashi:榆树制成的Hashiwokakero益智游戏

HASHIN.rar_ABAQUS_Hashin失效准则 abaqus_abaqus hashin_abaqus 三维Hashi

最新资源