最小重建偏差哈希：学习紧凑二进制码的新方法

PDF格式 | 1.33MB | 更新于2024-08-30 | 127 浏览量 | 举报

"投影更多或量化更多：最小化重构偏差以学习紧凑的二进制代码"是一项创新性的研究，针对如何有效地结合和优化投影与量化两个阶段，提出了 Minimal Reconstruction Bias Hashing (MRH) 方法。该方法旨在学习能够保持相似度的紧凑二进制编码，同时在两个核心环节——投影和量化之间寻求最佳平衡。在传统的计算机视觉和机器学习任务中，将高维数据映射到低维二进制表示（如哈希）是常见的做法，但这个过程通常涉及到投影（降维）和量化（离散化）两个步骤，它们之间的关系和优化策略直接影响了最终编码的质量。传统的解决方案往往分开处理这两个问题，可能导致信息损失或性能下降。 MRH 的独特之处在于它设计了一个联合优化框架，通过最小化压缩信号的重构偏差来实现投影维度的自适应调整。这种自适应性使得算法能够动态平衡投影过程中信息的保留与量化阶段的紧凑性之间的关系。相比于现有的最先进的方法，MRH 在实验结果中展现出显著的优势，尤其是在处理复杂的数据集和提升检索精度方面。在实际应用中，MRH 可能被用于图像检索、推荐系统、数据压缩等场景，其紧凑的二进制编码减少了存储需求，而保持良好的相似度则保证了数据的可恢复性和查询效率。为了达到这些效果，MRH 方法可能采用迭代优化算法，如梯度下降或遗传算法，逐步调整投影矩阵和量化策略，直到达到最优的重构偏差。总结来说，"投影更多或量化更多"这一主题的核心贡献在于提出了一种新颖的理论框架和算法，能够在保证数据表示的有效性和效率的同时，解决投影与量化之间的协作优化问题，这对于提升现代信息技术系统的性能具有重要意义。未来的研究可以进一步探索如何扩展MRH到更复杂的网络架构或处理大规模分布式数据，以满足不断增长的数据处理需求。

To Project More or to Quantize More: Minimizing

Reconstruction Bias for Learning Compact Binary Codes

Zhe Wang

1,2

, Ling-Yu Duan

, Junsong Yuan

, Tiejun Huang

, Wen Gao

Institute of Digital Media, Peking University, Beijing, China

Rapid-Rich Object Search (ROSE) Lab, Nanyang Technological University, Singapore.

{zhew,lingyu,tjhuang,wgao}@pku.edu.cn, {jsyuan}@ntu.edu.sg

Abstract

We present a novel approach called Minimal Re-

construction Bias Hashing (MRH) to learn sim-

ilarity preserving binary codes that jointly opti-

mize both projection and quantization stages. Our

work tackles an important problem of how to ele-

gantly connect optimizing projection with optimiz-

ing quantization, and to maximize the complemen-

tary effects of two stages. Distinct from previous

works, MRH can adaptively adjust the projection

dimensionality to balance the information loss be-

tween projection and quantization. It is formulated

as a problem of minimizing reconstruction bias of

compressed signals. Extensive experiment results

have shown the proposed MRH signiﬁcantly out-

performs a variety of state-of-the-art methods over

several widely used benchmarks.

1 Introduction

Approximate nearest neighbour (ANN) search

[

Gionis et al.,

1999a

]

plays an important role in machine learning, com-

puter vision and information retrieval. Using similarity pre-

serving binary codes to represent original data points is

of particular interest for ANN search

[

Weiss et al., 2008;

Norouzi and Fleet, 2011

]

. The binary codes can bring about

low memory cost as well as fast similarity distance computing

speed. This is particular useful when dealing with large scale

database

[

Torralba et al., 2008; Gong and Lazebnik, 2011;

Weiss et al., 2008; Duan et al., 2016

]

A common binary coding approach, often called Hash-

ing, is to develop similarity preserving hashing functions for

mapping data points into a Hamming space. As it is NP-

hard to directly learn the optimal binary codes

[

Weiss et

al., 2008

]

, hashing methods typically work on a two-stage

strategy: projection and quantization

[

Kong and L, 2012;

Kong et al., 2012

]

. Speciﬁcally, given a data point x 2 R

they ﬁrst project x into a low dimensional vector

y =[f

(x), f

(x),...,f

(x)] 2 R

where real-valued functions {f

(·)}

i=1

are called projection

functions. Then they utilize Single Bit Quantization (SBQ)

to quantize each projection element f

(x) into a single bit by

thresholding

[

Kong and L, 2012; Wang et al., 2016

]

Lots of research efforts have been devoted to the ﬁrst stage,

with an aim to learn powerful projections to maintain the sim-

ilarity structure of the original data points. Local Sensitive

Hashing (LSH)

[

Andoni and Indyk, 2006

]

adopts a random

projection which is independent of training data. Similarly,

Shift Invariant Kernel Hashing (SIKH)

[

Raginsky and Lazeb-

nik, 2009

]

chooses random projection and applies shifted co-

sine function to generate binary codes. Both LSH and SIKH

are data independent and ﬂexible since they do not rely on

any training data. However, long codes are often required to

achieve satisfactory performance

[

Gong and Lazebnik, 2011;

Raginsky and Lazebnik, 2009

]

To build up more effective projection, many promising

data dependent methods have been proposed. Through learn-

ing the projection functions over training data, data depen-

dent methods usually outperform data independent methods

at relatively shorter codes

[

Liu et al., 2010

]

. Representa-

tive methods include Spectral Hashing

[

Weiss et al., 2008

]

Binary Reconstructive Embedding Hashing

[

Kulis and Dar-

rell, 2009

]

, Semi-Supervised Hashing

[

Wang et al., 2010

]

Anchor Graph Hashing

[

Liu et al., 2010

]

, Iterative Quan-

tization

[

Gong and Lazebnik, 2011

]

, Minimal Loss Hash-

ing

[

Norouzi and Fleet, 2011

]

, Kernal Supervised Hash-

ing

[

Liu et al., 2012

]

, Isotropic Hashing

[

Kong and Li, 2012

]

K-means Hashing

[

He et al., 2013

]

, Inductive Hashing on

Manifolds

[

Shen et al., 2013

]

, Harmonious Hashing

[

Xu et

al., 2013

]

, Discrete Graph Hashing

[

Liu et al., 2014

]

, Sparse

Projection Hashing

[

Xia et al., 2015

]

, etc.

Moreover, recent works have reported the impact of quan-

tization on Hashing performance. Single Bit Quantization

(SBQ) in most hashing methods incurs lots of quantization er-

rors, which would seriously degrade the performance

[

Kong

and L, 2012; Kong et al., 2012

]

. Thus, promising Multi-

ple Bits Quantization (MBQ) methods have been proposed.

Double Bits Quantization

[

Kong and L, 2012

]

divides each

projection dimension into three regions and uses double bits

code to represent each element region. Manhattan Quan-

tization

[

Kong et al., 2012

]

proposes natural binary code

(NBC) and adopts Manhattan distance to compute the dis-

tance between NBC codes. Hamming Compatible Quanti-

zation

[

Wang et al., 2015

]

aims to minimize the distance

error function to preserve the capability of similarity met-

ric between Euclidean space and Hamming space. Over-

all, MBQ methods do facilitate the reduction of information

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16)

2181

下载后可阅读完整内容，剩余7页未读，立即下载

weixin_38599712

粉丝: 8

最小重建偏差哈希：学习紧凑二进制码的新方法

压缩感知 莱斯大学的一些代码

二维PCA法策略及证明

通过局部保留投影和稀疏编码实现人脸图像超分辨率

量化误差最小化指南：理解并控制数字信号处理中的误差影响

案例分析：最小噪声分离变换在实际中的应用

MATLAB颜色量化技术揭秘：掌握高维到低维的转换策略

信号采样与重构实战：原理透析与实践技巧

信号重构基础教程：一文理解信号抽样与重建

信号重构与插值滤波器：性能优化的关键技术

【实验设计策略】：用sparseLab设计与执行稀疏信号重构实验

最新资源

压缩感知莱斯大学的一些代码