大规模视觉模式发现的二值化模态搜索

112 浏览量更新于2024-08-27 收藏 1.16MB PDF 举报

"Binarized Mode Seeking for Scalable Visual Pattern Discovery - 这篇研究论文由 Wei Zhang, Xiaochun Cao, Rui Wang, Yuanfang Guo 和 Zhineng Chen 合著，探讨了大规模图像集合中通过二值化模式搜索进行视觉模式发现的方法。文章提出了一种二值均值漂移（bMS）算法，该算法直接在二值空间中寻找频繁模式，并引入了基于二项式核和二值约束的分析方法。此外，还进一步扩展了 bMS，形成了对比二值均值漂移（cbMS），以最大化二值空间中的对比度密度，寻找既有信息性又具有区分性的模式。" 这篇研究论文聚焦于在大规模图像数据集中的视觉模式发现，尤其是在资源有限的情况下。由于二值化的图像编码可以高效存储和计算，因此，研究团队提出了二值化模式寻求的策略。他们首先介绍了“二值均值漂移”(bMS)算法，这是一种在二值空间中直接寻找模式的新方法。bMS 的核心是利用模式寻求的概念，但在此过程中，图像被转换为二进制代码，从而降低了计算复杂性和存储需求。为了适应二值环境，论文引入了两个关键概念：基于二项式的核函数和二值约束。二项式核函数可能用于增强二值特征之间的相似性度量，而二值约束则确保在分析过程中保持二值表示的特性。这些创新使得在二值空间中有效地执行模式分析成为可能。进一步，研究人员扩展了 bMS 算法，发展出“对比二值均值漂移”(cbMS)。cbMS 的目标是不仅找到频繁出现的模式，还要找到那些对分类或识别任务有显著区分性的模式。它通过最大化二值空间中的对比度密度来实现这一目标，从而提高所发现模式的信息价值和判别能力。这个工作对于大规模图像分析、图像检索以及计算机视觉领域的模式识别有着重要贡献，特别是对于需要高效处理和分析大量二值化图像数据的应用场景。cbMS 算法提供了一种优化的解决方案，能够在保持计算效率的同时，提升模式发现的质量和实用性。

Binarized Mode Seeking for Scalable Visual Pattern Discovery

Wei Zhang

, Xiaochun Cao

1,2∗

, Rui Wang

, Yuanfang Guo

, Zhineng Chen

1: SKLOIS, Institute of Information Engineering, Chinese Academy of Sciences

2: School of Cyber Security, University of Chinese Academy of Sciences

3: Institute of Automation, Chinese Academy of Sciences

{wzhang, caoxiaochun, wangrui, guoyuanfang}@iie.ac.cn, zhineng.chen@ia.ac.cn

Abstract

This paper studies visual pattern discovery in large-scale

image collections via binarized mode seeking, where im-

ages can only be represented as binary codes for efﬁcien-

t storage and computation. We address this problem from

the perspective of binary space mode seeking. First, a bi-

nary mean shift (bMS) is proposed to discover frequen-

t patterns via mode seeking directly in binary space. The

binomial-based kernel and binary constraint are introduced

for binarized analysis. Second, we further extend bMS to

a more general form, namely contrastive binary mean shift

(cbMS), which maximizes the contrastive density in bina-

ry space, for ﬁnding informative patterns that are both fre-

quent and discriminative for the dataset. With the binarized

algorithm and optimization, our methods demonstrate sig-

niﬁcant computation (50×) and storage (32×) improvement

compared to standard techniques operating in Euclidean s-

pace, while the performance does not largely degenerate.

Furthermore, cbMS discovers more informative patterns

by suppressing low discriminative modes. We evaluate our

methods on both annotated ILSVRC (1M images) and un-

annotated blind Flickr (10M images) datasets with million

scale images, which demonstrates both the scalability and

effectiveness of our algorithms for discovering frequent and

informative patterns in large scale collection.

1. Introduction

Pattern discovery is one of the most fundamental prob-

lems in computer vision and pattern analysis. Given a large-

scale un-ordered image collection (e.g., images crawled

from an anonymous website), the ﬁrst question is to ask

“What kind of images are in the dataset? What’s the differ-

ence with other ‘common’ datasets?” Visual pattern discov-

ery aims to automatically ﬁnd dominant items in an unsu-

pervised setting. A lot of researchers have studied this prob-

∗

Corresponding author.

10110100001011

10100011101010

00101100110111

00110110101010

00111111010101

10111000110010

00111011101000

......

10101101001010

00101010101000

......

(a) large image sets

(b) binary codes (c) modes

(d) patterns

bMS

cbMS

10110100001011

10100011101010

00101100110111

00110110101010

00111111010101

00111011101000

......

10101101011010

00101010101000

00101011101000

......

targetset

contrastiveset

frequentpatterns

informativepatterns

Figure 1. Given a large collection of images (a), our goal is to

discover patterns (d) by seeking modes (c) in the binary space (b).

Our bMS seeks frequent patterns from the target set, while cbMS

discovers more informative patterns by referencing an additional

contrastive set.

lem due to its wide applications in various vision tasks such

as classiﬁcation [

7], retrieval [27], summarization [32]. In

the context of big data, visual pattern discovery is becoming

even more important, as it provides an efﬁcient way charac-

terizing a large image collection. Particularly, this problem

is even important as the data explodes in photo sharing web-

sites, such as Instagram, Flickr, Imgur.

Among the techniques for pattern discovery, cluster anal-

ysis serves as the core part. By projecting data into the fea-

ture space, patterns are closely related to the high density

regions. Previous techniques are mostly based on cluster-

ing directly in Euclidean space. However, there are still two

fundamental problems seldomly addressed.

One problem is the scalability issue. It is quite inefﬁcien-

t to cluster large datasets in Euclidean space. First, large

amount of real-value arithmetic operations is involved dur-

ing distance evaluation, which makes it impossible for an-

alyzing large dataset. Second, it is also difﬁcult to keep all

the data in memory for large datasets. For example, keeping

one million real value vectors, say R

4,096

- FC7 activation-

s of AlexNet[

11], takes 30GB memory, not to mention the

3864

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38635323

粉丝: 9
资源: 955

大规模视觉模式发现的二值化模态搜索

Progressive Mode-Seeking on Graphs for Sparse Feature Matching 代码

Mean Shift, Mode Seeking, and Clustering

Model for seeking sweet spot

Seeking-K!-The-connection-plus.rar_数学计算_Visual_C++_

Extremum Seeking Based Fault-Tolerant Cooperative Control for Multiagent Systems

Stability Criterion for Nonsmooth Systems and Nash Equilibrium Seeking via Projected Gradient Dynamics

Seeking New-crx插件

Seeking Arrangement Canada-crx插件

Seeking Arrangement Site-crx插件

simplest-treasure-seeking-game

最新资源