自适应Dense-SIFT在大规模图像检索中的应用

30 浏览量更新于2024-08-27 收藏 288KB PDF 举报

"这篇研究论文探讨了一种基于自适应Dense-SIFT的大型图像检索方法，旨在提升大规模图像检索的效率和准确性。作者是来自北京工业大学信号与信息处理实验室的Qiaopeng Han、Li Zhuo和Haixia Long。" 在当前的数字时代，图像检索变得越来越重要，尤其是在海量图像数据的背景下。传统的图像检索方法往往面临效率和精度的挑战。这篇论文提出的创新点在于引入了自适应Dense-SIFT特征提取技术，该技术能够根据图像边缘信息自适应地调整局部窗口的大小，从而更好地捕捉图像的局部细节。 Dense-SIFT（密集尺度不变特征变换）是一种广泛用于图像描述符的方法，它能够在多个尺度和位置上提取特征点，具有良好的旋转、尺度和亮度不变性。然而，固定大小的窗口可能无法适应图像中不同区域的复杂性。因此，论文提出的自适应Dense-SIFT特征提取方法可以根据图像内容动态调整窗口大小，使得在保持特征描述能力的同时，能够更好地适应图像的局部变化。在特征表示阶段，论文采用了Bag-of-Words (BoW)模型，将提取的自适应Dense-SIFT特征转化为视觉词频直方图，以此构建图像的语义描述。BoW模型通过将大量的局部特征归类到一个词汇表中，形成一种统计表示，降低了计算复杂性，同时保留了图像的主要视觉信息。为了进一步提高检索的描述能力，论文还结合了72维HSV颜色特征。HSV色彩空间包含了色调（Hue）、饱和度（Saturation）和明度（Value），这三者共同提供了丰富的颜色信息，有助于区分颜色相似但实际不同的图像。在检索过程中，通过计算查询图像与数据库中图像的特征向量之间的相似度，找出最匹配的图像。检索结果的前h个最相似图像被返回。为了提升检索准确性，论文还采用上下文相似性对返回的图像进行重新排序，这可能涉及到图像间的相关性和上下文信息，以确保返回的图像不仅在视觉特征上接近，而且在内容意义上也相关。这篇论文提出的自适应Dense-SIFT方法以及结合HSV颜色特征和上下文相似性的检索策略，为大规模图像检索提供了一种更高效、更准确的解决方案，对于图像检索领域的研究具有重要价值。

Large Scale Image Retrieval Based on Adaptive

Dense-SIFT

Qiaopeng Han, Li Zhuo, Haixia Long

Signal & Information Processing Laboratory

Beijing University of Technology

Beijing, China

s201302102@emails.bjut.edu.cn

Abstract—In this paper, firstly, an adaptive Dense-SIFT

feature extraction method is proposed, which can adaptively

adjust the size of local window using the edge information of

image. Next, a large scale image retrieval method is proposed.

The adaptive Dense-SIFT features are extracted from the

database images. Bag of Word (BoW) model is then adopted to

create the corresponding histograms of visual words frequency to

represent the features. To efficiently describe the image content,

the feature vectors are constructed by combining the visual

words histograms of Dense-SIFT feature with the 72-dimensional

HSV (Hue, Saturation, Value) color feature. In retrieval process,

the top-h most similar images are returned by computing the

similarity between the feature vectors of querying image and

those of the images in database. Finally, to further improve the

accuracy, the returned images are re-ranked with context

similarity information. The experimental results on Corel-5K

and Oxford Buildings dataset show that the proposed method

outperforms the existing image retrieval methods.

Keywords—image retrieval; adaptive Dense-SIFT; visual words;

re-ranking

I. INTRODUCTION

This Large scale image retrieval has been become a hot

research topic in multimedia retrieval community, in which

Content Based Image Retrieval (CBIR) is the most popular

retrieval method. CBIR utilizes the features to represent the

image content, and determines the similarity between images

by comparing the similarity of the features. The key parts of

CBIR contain feature extraction, similarity matching, etc.

Moreover, to further improve the retrieval accuracy, re-ranking

technique has been proposed.

Scale Invariant Feature Transform (SIFT) feature has been

proven to be the most representative extraction algorithm of

local feature, which has been widely used in some domain,

such as image retrieval and image classification, due to its

strong robustness. Timothee [1] uses SIFT as a local feature to

construct the index based on the vocabulary tree. And the

image retrieval scheme is achieved by the index structure

which is improved with contextual weighting of the local

features. However, SIFT can only represent the details of

images by using gray scale information, and it contains no

other visual characteristics. Thus, to solve this problem, a

coupled Multi-Index (c-MI) framework [2] has been proposed.

to perform feature fusion at indexing level, which takes each

of the SIFT and Colour Names features as one dimension of

multi-index to form the feature vectors. It can describe the

image well both in details and global perspectives. The

method can achieve better performance in image retrieval. To

get more accurate results, re-ranking [3] technique has

been introduced into the field of image retrieval. Yang [4]

presents a new prototype-based re-ranking method based on

SIFT, which utilizes a re-ranking model as prior knowledge.

This model is learned offline from user-labeled training data.

Although the built system enhances retrieval performance to

some extent, the main restriction is that the accuracy of the

classifier cannot be guaranteed, due to the limited number of

user-labeled images. This shortcoming can be conquered by a

context-sensitive similarity re-ranking method [5], which

returns an initial search result based on SIFT and re-ranking

the returned result by using shortest path method [6]. The

advantage of this method is that there is no need to train a

classifier, so that the retrieval accuracy and efficiency can be

improved.

The aforementioned methods adopt SIFT as the local

feature. Although SIFT shows superior performance in image

retrieval, image classification and other application areas, the

process of extraction has a high computational complexity. To

overcome this drawback, a variety of fast SIFT algorithm have

been proposed, such as SpeedUp Robust Features (SURF),

Dense-SIFT, etc. In the extraction process of Dense-SIFT, a

fixed window size is employed to traverse the image. There is

no key-point detection stage while local feature descriptors are

extracted at each patch. Since the texture information in

different image areas is not the same, using a fixed window

size will lead to either insufficient extraction in texture

complex areas or overdone extraction in smooth areas.

To solve the problem during the extraction process of

Dense-SIFT feature, we propose an adaptive Dense-SIFT

features extraction method. The size of window is adjusted

adaptively based on the edge information of images. Then, a

large scale image retrieval method is proposed. BoW model is

exploited to represent the local features, then associating with

HSV colour feature to construct the feature vectors to

represent the image content. In retrieval process, the similarity

between the feature vectors of querying image and those of

database images is computed and then the initial search results

____________________________________

369

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38730767

粉丝: 8
资源: 923

自适应Dense-SIFT在大规模图像检索中的应用

图像检索源代码！

图像特征sift的matlab代码-dense_feat:密集颜色直方图和密集SIFT特征提取的软件包

dense Sift 图像特征提取

dense-sift原理

基于级联Dense-UNet和图割的肝脏肿瘤自动分割.docx

人工智能-项目实践-信息检索-基于VGG-16的图像检索系统

"基于级联Dense-UNet和图割的肝脏肿瘤自动分割技术研究及应用

SIFT vs Dense-SIFT有什么优缺点

python中dense-sift如何使用

dense-sift如何使用以及可视化

最新资源