优化Bloom Filter：降低数量提升效率

70 浏览量更新于2024-08-27 收藏 596KB PDF 举报

"Reducing the number of Bloom Filters" 这篇研究论文探讨了如何减少Bloom Filter的数量，以解决在特定应用中，如数据库、网络管理和计算机通信等领域，Bloom Filter使用过多导致的计算量过大和内存访问频繁的问题。Bloom Filter是一种空间效率极高的概率数据结构，常用于快速查询某个元素是否可能存在于集合中，而不会产生误报（False Positives），但可能会有漏报（False Negatives）。在网络安全领域，特别是在骨干路由器和数据中心交换机的转发表查找中，Bloom Filter被广泛应用。传统的解决方案是构建多个Bloom Filter以实现快速查找。然而，这种方法的主要缺点是需要进行大量的哈希计算和内存访问，这不仅消耗计算资源，还可能导致系统设计复杂度增加，可能降低系统性能。论文作者包括Qingge Gong、Tong Yang、Hongwei Tong、Kai Shi、Jinghui Li和Xianyan Wu，他们分别来自中国不同的军事和教育机构。研究者们提出，不同大小的Bloom Filter增加了系统设计的挑战，并可能对性能产生负面影响。为了克服这个问题，论文可能提出了新的优化策略或数据结构，旨在减少Bloom Filter的数量，同时保持其高效性和准确性。这可能涉及到更有效的哈希函数选择、动态调整Bloom Filter大小、合并多个Bloom Filter或者采用其他创新方法来减少计算和内存开销。通过这些改进，论文可能旨在提高网络设备的处理速度，减少资源消耗，从而提升整体网络性能。由于摘要没有提供具体的技术细节，我们无法深入了解他们是如何实现这一目标的。不过，可以推测，研究可能涉及了对Bloom Filter理论的深入理解，以及对现有系统性能瓶颈的分析，以找出减少Bloom Filter数量的关键点。此外，论文可能还包含了实验结果，证明了所提方法的有效性，并与其他现有方案进行了比较。这篇研究对于理解和优化使用Bloom Filter的系统具有重要意义，特别是对于那些需要处理大量数据并要求高效查找操作的网络环境。通过减少Bloom Filter的数量，系统设计者可以减轻计算和存储负担，提高系统的整体效率。

Reducing the Number of Bloom Filters

Qingge Gong

, Tong Yang

, Hongwei Tong

, Kai Shi

, Jinghui Li

, Xianyan Wu

1. Engineering University of CAPF, Xi’an, China. 2. Information and Navigation College, Air Force Engineering University,

Xi’an, China. 3. Military representative office of certain company, Chengdu, China. 4. PLA military 95880, China.

Email: {gongqingge@sina.com, yangtongemail@gmail.com, tongweihong69@163.com, prideshikai@gmail.com,

lijinghui1020@sina.com, wuxinyan518@163.com}

Abstract

—Bloom Filters have been applied in many fields, in-

cluding data base, network management, computer network, and

computer communication etc., owing to its fast membership que-

ry and memory efficiency. In network field, Bloom Filters were

used to lookup the forwarding tables in backbone routers and

Data Centre switches. These solutions all build many Bloom Fil-

ters to achieve fast lookup. The main shortcoming of these solu-

tion is: too many Bloom Filters require too many hash computa-

tions and memory accesses, and the variety of Bloom Filter sizes

poses challenges to system design and probably degrades the

system performance. To address this issues, we proposed Set

absorption algorithm to reduce the number of Bloom Filters,

while balancing the size of Bloom Filters. Experimental results

showed that after using our algorithm, better performance of the

Bloom Filter-based solutions was obtained.

Keywords—Bloom Filter; set absorption; LPM; IP lookup

I. INTRODUCTION

Bloom Filter was first proposed in 1970 [3], but applied to

computer network field 33 years later [2], then a lot of applica-

tions of Bloom Filter came to the fore. One important applica-

tion is to accelerate lookup, including Longest Prefix Matching

(LPM) and exact matching.

Routing lookup belonging to LMP is a classical issue, vari-

ous solutions have been proposed. Generally speaking, the so-

lutions can be divided into two categories: software-based solu-

tions and hardware-based solutions. Most software-based solu-

tions [1] [17] build a trie

to find the longest matched prefix,

but need multiple memory accesses for one lookup, thus the

lookup speed is relatively slow. Hardware-based solutions use

TCAM [19] [20] or GPU [21] to perform parallel lookup, thus

can achieve high speed, but suffer from high power consump-

tion and high hardware cost.

LMP is usually simplified into Exact Matching [2], and

Bloom Filter is also applied to exact matching. In enterprise

and data center networks, the forwarding tables of switches

could have tens or hundreds of thousands of end-host MAC

addresses. The performance of the Data Centre is determined to

a great extent by the lookup speed of switches. Existing solu-

tions adopts hash function to store the MAC address and its

outgoing link

(next-hop) to a large table. However, 1) large

hash table can hardly be held in fast memory; 2) the worst case

of hash collision cannot be bounded except that the forwarding

table is static

. Unfortunately, the forwarding tables (MAC

This work is supported by NSFC (61202489, 61272486) and CAPF certain

database project.

Trie is a tree-like data structure allowing the organization of prefixes on a

digital basis by using the bits of prefixes to direct the branching [1].

In this paper, outgoing link, port and the next-hop are exchangeable.

If the forwarding table is static, Perfect Hash Function can achieve no hash

collision [22].

tables) changes frequently due to the immigration of virtual

machines [18] and network fault.

To perform fast MAC lookup, Minlan Yu et al. [24] pro-

posed a Bloom Filter-based solution: BUFFALO. Dan Li et al.

[11] proposed multi-class Bloom Filter to improve BUFFALO

in multicast situation. The basic approach of these two algo-

rithms is same: splitting the forwarding table into many small

sets according to the port (outgoing link), then building one

Bloom Filter for each set. When false positive doesn't occur,

only one Bloom Filter will report true for one lookup, then the

next-hop is figured out. The membership query of one Bloom

Filter is fast, but the query time grows linearly with the in-

crease of the number of Bloom Filters. Specifically, suppose k

is the number of hash functions of one Bloom Filter, then one

Bloom Filter query needs k hash computations and k memory

accesses. Both hash functions and memory accesses will be-

come rk for r Bloom Filters. Therefore, the lookup speed will

go down quickly along with the increase of the port number.

This problem occurs in the situation with many Bloom Filters

(including the above three solutions). Unfortunately, one

switch could have a large number of next-hops, thus there will

be many Bloom Filters with different sizes.

In addition, to make the membership query as fast as possi-

ble, the Bloom Filters should be stored in fast memory, such as

SRAM (BUFFALO) and on-chip memory (PBF). When

Bloom Filter is stored in on-chip memory of FPGA [15], fixed

memory size must be assigned for each Bloom Filter. Unfortu-

nately, the size of Bloom Filters varies a lot, thus each Bloom

Filter will be assigned the maximum size in the implementation

for scalability, which is a waste of on-chip memory. To achieve

memory efficiency, the size of Bloom Filters should be bal-

anced. To address these issues, we propose Set absorption al-

gorithm to reduce the number of Bloom Filters, while balanc-

ing the sizes of Bloom Filters. The main idea of is that one

small set will be absorbed by two larger sets. The details are

illustrated in Section IV.

The rest of the paper is organized as follows. Section II in-

troduces the background knowledge, including IP lookup, for-

warding table, and the principle of Bloom Filters. Section III

details the Set absorption algorithm. Performance evaluation is

provided in Section IV, and finally we conclude our paper in

Section V.

II. BACKGROUND

A. Longest Prefix Matching

Bloom Filter which was proposed in [3], has been applied

to many fields. In this paper, we focus on its application in the

lookup of forwarding tables of routers and switches. Note that

the lookup of the forwarding tables of routers (we usually say

'routing lookup' or 'IP lookup') must satisfies Longest Prefix

Matching (LMP) rule, while that of switches only performs

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38547532

粉丝: 5
资源: 962

优化Bloom Filter：降低数量提升效率

Reducing the Number of Gray Levels in an Image

Reducing the Dimensionality of data neural networks

Reducing the Dimensionality of Data with Neural Networks

reducing the dimensionality of data with neural network

Reducing the Dimensionality of Data with Neural Networks.pdf

Copysets Reducing the Frequency of Data Loss in Cloud Storage

Science2006 - Reducing the Dimensionality of Data with Neural Networks

reducing the dimensionality of data with neural networks---hinton

【3】Reducing the dimensionality of data with neural networks.pdf

Apolipoprotein E4 Impairs in vivo Hippocampal Long-Term Synaptic Plasticity by Reducing the Phosphorylation of CaMKIIα and CREB

最新资源