稀疏表示与特征融合在图像检索中的分离词汇方法

111 浏览量更新于2024-08-28 收藏 951KB PDF 举报

"这篇研究论文探讨了在图像检索中如何利用基于稀疏表示的可分离词汇和特征融合技术来提高检索效率和准确性。" 在计算机视觉领域，图像检索是一个重要的研究方向，它涉及到如何从大量图像数据库中快速、准确地找到与查询图像相似的图片。传统的图像检索方法通常采用Bag-of-Visual-Words (BOW)模型，其中视觉词汇是模型的核心。然而，为了确保检索的准确性，传统方法通常需要构建大规模的词汇，这会带来计算复杂度的增加，导致检索速度下降。基于稀疏表示的可分离词汇技术旨在解决这个问题。稀疏表示理论允许将复杂的信号或数据表示为一个稀疏的线性组合，即用少数几个基元素来近似表示原始数据。在图像检索中，这意味着可以将图像特征有效地编码为一个稀疏向量，从而降低处理复杂性的负担。可分离词汇的概念则进一步优化了这一过程，通过分解大词汇为多个小词汇，使得每个小词汇只关注图像特征的一个特定方面，降低了计算复杂性，同时也提高了表示的针对性和准确性。特征融合是另一个关键点。在图像分析中，往往有多种特征（如颜色、纹理、形状等）可供选择。这些特征在不同情况下对图像的描述能力各异。通过将多种特征有效地融合，可以增强图像的表示能力，提高检索性能。文中提到的方法可能包括线性或非线性融合策略，旨在最大化不同特征之间的互补性，同时减少冗余信息。论文的摘要指出，尽管大型词汇可以提高检索准确性，但也会导致效率降低。因此，研究者们探索了如何在保持高精度的同时，通过分离词汇和特征融合来减小词汇规模，提升检索速度。这种方法有望在保持甚至提高检索质量的同时，显著改善大规模图像库的检索性能。关键词包括“可分离词汇”、“稀疏表示”、“特征融合”和“图像检索”，表明文章深入探讨了这些关键技术在实际应用中的挑战和解决方案。这篇论文对于理解如何优化图像检索系统，尤其是针对大数据集的高效检索，具有重要的理论和实践价值。

Contents lists available at ScienceDirect

Neurocomputing

journa l homepa ge: www.elsevier.com/locate/neucom

Separable vocabulary and feature fusion for image retrieval based on sparse

representation

Yanhong Wang

a,b

, Yigang Cen

a,b,

⁎

, Ruizhen Zhao

a,b

, Yi Cen

, Shaohai Hu

a,b

, Viacheslav Voronin

Hengyou Wang

Institute of Information Science, Beijing Jiaotong University, Beijing 100044, China

Key Laboratory of Advanced Information Science and Network Technology of Beijing, Beijing 100044, China

School of Information Engineering, Minzu University of China, Beijing 100081, China

Department of Radio-electronic Systems, Don State Technical University, Shakhty 346500, Russia

School of Science, Beijing University of Civil Engineering and Architecture, Beijing 100044, China

ARTICLE INFO

Keywords:

Separable vocabulary

Sparse representation

Feature fusion

Image retrieval

ABSTRACT

Visual vocabulary is the core of the Bag-of-visual-words (BOW) model in image retrieval. In order to ensure the

retrieval accuracy, a large vocabulary is always used in traditional methods. However, a large vocabulary will

lead to a low recall. In order to improve recall, vocabularies with medium sizes are proposed, but they will lead

to a low accuracy. To address these two problems, we propose a new method for image retrieval based on feature

fusion and sparse representation over separable vocabulary. Firstly, a large vocabulary is generated on the

training dataset. Secondly, the vocabulary is separated into a number of vocabularies with medium sizes.

Thirdly, for a given query image, we adopt sparse representation to select a vocabulary for retrieval. In the

proposed method, the large vocabulary can guarantee a relatively high accuracy, while the vocabularies with

medium sizes are responsible for high recall. Also, in order to reduce quantization error and improve recall,

sparse representation scheme is used for visual words quantization. Moreover, both the local features and the

global features are fused to improve the recall. Our proposed method is evaluated on two benchmark datasets,

i.e., Coil20 and Holidays. Experiments show that our proposed method achieves good performance.

1. Introduction

In recent years, content-based image retrieval (CBIR) is a very hot

research issue of computer vision and multimedia information.

Although it has achieved rapid development, researchers have not yet

to standardize various image retrieval systems [1]. Image retrieval still

remains as a challenging problem. It is the fact that eﬀects of image

retrieval are failed due to occlusion, distortion, corrosion and the

diﬀerent lighting conditions.

Image retrieval means that, for a given query image, we will retrieve

all the similar images from the database. Similar images are deﬁned as

images contain the same objects or a scene viewed under diﬀerent

imaging conditions [2]. In the past years, the BOW model [3,4] has

achieved great eﬀect in image retrieval area. This model is inspired by

the text retrieval system [3–5]. It contains four major steps: (1). Local

features are extracted from each image, such as the SIFT descriptor [6],

rootSIFT descriptor [7] and SURF descriptor [8] etc. (2). Each local

descriptor is quantized to a visual word according to a pre-trained

vocabulary by an unsupervised clustering approach. (3). Each image is

represented by a frequency histogram of visual words. (4). Retrieval

results are returned according to the similarities between the query

image and the images of dataset.

Vocabulary plays a very important role in the BOW model. For a

large number of local features, in order to ensure the retrieval accuracy,

we need to train a large visual vocabulary. But a large visual vocabulary

will lead to a low recall and other issues [9,10]. In order to improve the

recall, in previous works, there are two main types of solutions: Firstly,

the size of the vocabulary is changed. For examples, in [2], Jegou et al.

proposed to use the vocabulary with medium size to improve recall.

However, this will lead to a low accuracy [10].In[11,12], the author

represented images with vector of locally aggregated descriptors

(VLAD), which can be viewed as a simpliﬁcation of the ﬁsher vector

(FV) [13] representation. Moreover, the VLAD method only requires a

small vocabulary in the retrieval process. Secondly, multiple vocabul-

aries based strategies are used. The vocabularies are usually generated

by an independent training dataset. In [14], the author proposed a

Bayes merging approach to down-weight the indexed features in the

intersection set. In [15], instead of computing the multiple vocabul-

http://dx.doi.org/10.1016/j.neucom.2016.08.106

Received 27 February 2016; Received in revised form 17 July 2016; Accepted 8 August 2016

⁎

Corresponding author at: Institute of Information Science, Beijing Jiaotong University, Beijing 100044, China.

E-mail address: ygcen@bjtu.edu.cn (Y. Cen).

Neurocomputing 236 (2017) 14–22

Available online 17 November 2016

MARK

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38652270

粉丝: 3
资源: 893

稀疏表示与特征融合在图像检索中的分离词汇方法

基于稀疏表示的图像检索：分离词汇与特征融合方法

基于联合稀疏表示的图像特征提取与融合

基于稀疏斑块表示的标签融合方法在脑部MRI图像分割中的应用

结合VLAD特征和稀疏表示的图像检索

稀疏表示图像matlab代码-multifocus-image-fusion-:空间域中基于稀疏表示的多焦点图像融合

基于稀疏表示的像素级图像融合比较分析

chengxu.zip_分块融合_图像分块融合_图像稀疏表示_稀疏图像融合_稀疏表示融合

一种基于群稀疏特征选择的图像检索方法

Image fusion.zip_KSVD_图像融合_基于稀疏表示_稀疏_稀疏表示

论文研究-基于稀疏表示分类的图像检索方法 .pdf

最新资源