优化最小完美哈希函数：Hash and Displace技术

44 浏览量更新于2024-08-25 收藏 232KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"Hash and Displace - Efficient Evaluation of Minimum Perfect Hash Functions - 1999 (10.1.1.148.7694)" 是一篇关于计算机科学领域的研究论文，主要探讨了如何有效地构建最小完美哈希函数。作者Rasmus Pagh在丹麦奥胡斯大学计算机科学系的BRICS2部门工作。完美哈希函数是一种特殊的哈希函数，它能将任何特定大小的集合映射到一个固定大小的无冲突数组中，确保每个输入值都有唯一的哈希结果。在本文中，作者提出了一种新的构建方法，显著减少了在二级哈希方案中处理冲突（解决桶）时的开销。这种方法只需要一次乘法运算和几次加法运算，除了基本的位操作外，还可以减少对内存的访问次数，特别是只需访问一个固定位置。这提升了之前最小完美哈希函数的探查性能，并证明了其最优性。论文指出，对于大小为n的集合S，所提出的哈希函数（“程序”）占用O(n)个单词的空间，并能在期望的O(n)时间内构造出来。这意味着它在时间和空间复杂度上都是线性的，对于大数据集的处理非常有效。引入部分还提到，研究关注的是针对有限宇宙U={0, ..., u-1}的所有n子集的哈希函数类。对于任何这样的子集S，一个完美的哈希函数类都包含一个没有冲突的函数。这使得这些函数在数据结构和算法设计中特别有用，例如在动态集合操作中，如插入、删除和查找。此外，论文的短版本在WADS 1999会议上发表，记录在LNCS 1663系列中，页码为49-54，由Springer Verlag出版。完整版本则可以在指定的在线地址找到，该地址链接至作者Rasmus Pagh在BRICS的个人网页。这篇论文对计算机科学，尤其是数据结构和算法领域的研究人员和实践者具有重要意义，因为它提供了一种提高哈希函数效率的新方法，特别是在处理大量数据并要求低冲突率的场景下。

资源详情

资源推荐

Short version appeared at WADS 1999, LNCS 1663, p. 49–54.  Springer Verlag.

Available on-line at http://www.brics.dk/~pagh/papers/

Hash and Displace:

Eﬃcient Evaluation of Minimal Perfect Hash Functions

Rasmus Pagh

BRICS

, Department of Computer Science, University of Aarhus,

8000 Aarhus C, Denmark

Email: pagh@brics.dk

Abstract

A new way of constructing (minimal) perfect hash functions is described. The technique con-

siderably reduces the overhead associated with resolving buckets in two-level hashing schemes.

Evaluating a hash function requires just one multiplication and a few additions apart from

primitive bit operations. The number of accesses to memory is two, one of which is to a ﬁxed

location. This improves the probe performance of previous minimal perfect hashing schemes,

and is shown to be optimal. The hash function description (“program”) for a set of size n

occupies O(n) words, and can be constructed in expected O(n) time.

1 Introduction

This paper deals with classes of hash functions which are perfect for the n-subsets of the ﬁnite

universe U = {0, . . . , u − 1}. For any S ∈





– the subsets of U of size n – a perfect class contains

a function which is 1-1 on S (“perfect” for S). We consider perfect classes with range {0, . . . , a−1}.

A perfect class of hash functions can be used to construct static dictionaries (data structures

storing sets S ∈





and supporting membership queries, “u ∈ S?”): Store a function h which

is 1-1 on the set, S, and for each element s ∈ S, store s in entry h(s) of an a-element table. A

membership query on s is processed by comparing s to entry h(s) in the table. The attractiveness

of using perfect hash functions for this purpose depends on several characteristics of the class.

1. The eﬃciency of evaluation, in terms of computation and the number of probes into the

description.

2. How hard is it to ﬁnd a perfect function in the class?

3. How close to n can we choose a, the range of the functions?

4. How much space is required to store a function?

It turns out that, for suitable perfect classes of hash functions, the answers to all of these

questions are satisfactory in the sense that it is possible to do (more or less) as well as one could

hope for. Nevertheless, theoretically optimal schemes have seen limited use in practice, being

substituted by heuristics which typically work well. It is thus still of interest to ﬁnd classes with

good properties from a theoretical as well as from a practical point of view.

Supported in part by the ESPRIT Long Term Research Programme of the EU under project number 20244

(ALCOM-IT)

Basic Research in Computer Science,

Centre of the Danish National Research Foundation.

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38736652

粉丝: 1
资源: 938

优化最小完美哈希函数：Hash and Displace技术

Finding Minimal Perfect Hash Functions - 1986 (10.1.1.144.9650)-计算机科学

Cache, Hash and Space-Efficient Bloom Filters-计算机科学

Ascon-128、Ascon-128a、Ascon-HASH、Ascon-HASHa的区别

CBC-Hash与CFB-Hash的相似与区别

CFB-HASH算法和CBC-Hash算法的相似之处和区别

NIST.FIPS.202

bbscan = 'python ' + path + 'BBScan\BBScan.py --host ' + target + ' --no-browser --out ' + logpath + hash + '-bbscan.txt '

使用Verilog实现轻量级算法Ascon，支持Ascon-128、Ascon-128a、Ascon-HASH、Ascon-HASHa四种参数配置。

d_hash函数调用dentry->d_hash的一个示例代码

dcache.c中的d_hash函数和dentry操作中的d_hash之间的调用路径展示一下

dcache.c中的d_hash函数和dentry->d_hash有什么区别

linkedhashlist treelist

c#实现俄罗斯方块，面向对象实现

最新资源