形状词驱动的草图图像检索方法

PDF格式 | 1.36MB | 更新于2024-08-25 | 53 浏览量 | 举报

"这篇研究论文探讨了通过形状词进行基于草图的图像检索技术，旨在提升检索效率和准确性，同时降低内存需求。" 在当前触摸屏设备广泛应用的背景下，基于草图的图像检索已经成为一个重要的研究领域。然而，现有的大多数工作主要集中在形状和草图的低级描述符上。这篇由上海交通大学、微软研究院和微软公司共同完成的研究论文提出了利用形状词（ShapeWords）作为描述符的新方法，以改进这一领域的技术。首先，论文定义了形状词的概念，并设计了一种高效的算法来提取这些形状词。形状词是能够捕捉图像形状特征的关键元素，它们可以更准确地描述草图的轮廓和结构。这种方法比传统的基于形状的描述符更为抽象和语义化，有助于提高检索的精确度。接着，论文将经典的查米弗匹配算法（Chamfer Matching）进行了推广，以解决形状词的匹配问题。查米弗匹配通常用于计算两个形状之间的距离，但在此处被扩展以适应形状词的特性，从而实现更精细的匹配。最后，为了应对大规模图像数据库的检索需求，论文提出了一种新颖的倒排索引结构。这种结构优化了形状词表示的可扩展性，使得在保持高精度的同时，大大降低了内存占用。实验结果显示，该方法的准确性具有竞争力，且内存需求仅为MindFinder的3%以下，显著提升了检索系统的效率和实用性。这篇论文为基于草图的图像检索提供了新的思路，通过形状词的引入，不仅提高了检索的准确性，还实现了对大规模数据集的高效处理，对于移动设备和云端应用中的图像检索技术具有重要的推动作用。未来的研究可能会进一步探索形状词在复杂场景识别、多模态检索以及实时搜索等领域的应用。

展开

Sketch-based Image Retrieval via Shape Words

Changcheng Xiao

1∗

, Changhu Wang

2†

, Liqing Zhang

, Lei Zhang

Shanghai Jiao Tong University, Shanghai, China

Microsoft Research, Beijing, China

Microsoft Corporation, Redmond, USA

xchangcheng@gmail.com, chw@microsoft.com,

zhang-lq@cs.sjtu.edu.cn, leizhang@microsoft.com

ABSTRACT

The explosive growth of touch screens has provided a good

platform for sketch-based image retrieval. However, most

previous works focused on low level descriptors of shapes and

sketches. In this paper, we try to step forward and propose

to leverage shape words descriptor for sketch-based image

retrieval. First, the shape words are deﬁned and an eﬃcient

algorithm is designed for shape words extraction. Then we

generalize the classic Chamfer Matching algorithm to ad-

dress the shape words matching problem. Finally, a novel

inverted index structure is proposed to make shape words

representation scalable to large scale image databases. Ex-

perimental results show that our method achieves competi-

tive accuracy but requires much less memory, e.g., less than

3% of memory storage of MindFinder. Due to its compet-

itive accuracy and low memory cost, our method can scale

up to much larger database.

Categories and Subject Descriptors

H.3.3 [Information Retrieval]: Search Process, Query for-

mulation

Keywords

Sketch-based Image Retrieval; Shape Words

1. INTRODUCTION

Owing to the popularity of digital cameras, millions of new

digital images are freely accessible online every day, which

brings a great opportunity for image retrieval. Usually, users

search images with text queries. But the shape and location

of the object are hard to be formulated with a few keywords.

Thus, query-by-example (QBE) was proposed. However, in

∗

Changcheng Xiao performed this work while being an in-

tern at Microsoft Research Asia.

†

Corresponding author.

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for proﬁt or commercial advantage and that copies bear this notice and the full cita-

tion on the ﬁrst page. Copyrights for components of this work owned by others than

ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or re-

publish, to post on servers or to redistribute to lists, requires prior speciﬁc permission

and/or a fee. Request permissions from permissions@acm.org.

ICMR’15, June 23–26, 2015, Shanghai, China.

ACM 978-1-4503-3274-3/15/06 ...$15.00.

http://dx.doi.org/10.1145/2671188.2749360.

Figure 1: Example results of our system. Suppose

we want to search for a bike. We may draw two

circles as its wheels and draw some line segments as

its frame. When we draw these sketches the system

will extract shape words (line segments and circu-

lar arcs) and search similar images with these shape

words.

QBE, the query has to be an example image, which is usu-

ally the reason of searching. As the popularity of touch-

screen devices, searching images by drawing has become a

highly desired feature for users, which is complementary and

thus can be combined with query-by-keyword and query-by-

example modalities.

Sketch based image retrieval (SBIR) has been extensively

studied since 1990s, and stepped into large-scale scenarios

in recent years. In 2010, Eitz et al. [5] built an SBIR system

based on Tensor descriptors by linearly scanning the whole

database for each query, which greatly limits its scalability.

In 2011, Cao et al. [3] built the MindFinder system based on

indexable Oriented Chamfer Matching (OCM) to solve the

indexing problem of SBIR. In 2012, Zhou et al. [8] proposed

a convolution based descriptor, and Kai-Yu Tseng [7] pro-

posed to use ‘HashBits’ to compress the Distance Transform

descriptor. In 2013, Sun et al. [6] built a billion scale SBIR

system with vector-like Chamfer feature pairs.

However, most of these methods [3, 5, 6, 7, 8] focus on low

level descriptors of sketches like local patches or edge pixels,

which require huge amount of memory storage. In this work,

we try to go one step forward and see the shapes/sketches

in a higher view. Diﬀerent from MindFinder [4] which in-

dexes and matches with (sampled) edge pixels, we propose

to use shape words to represent both the query sketch and

database images. As shown in Fig. 1, when asked to draw a

下载后可阅读完整内容，剩余3页未读，立即下载

身份认证购VIP最低享 7 折!

30元优惠券

weixin_38641561

粉丝: 5

形状词驱动的草图图像检索方法

数字图像处理与通信：图像检索技术.pdf

通过自适应加权的基于草图的图像检索

通过显着性检测进行鲁棒的基于草图的图像检索

OpenSSE引擎：基于草图图像的3D对象检索技术

SketchBasedShapeRetrieval:该存储库包含论文基于草图的形状检索中形状匹配管道的易于阅读的C ++ Python实现

基于二维手绘草图的三维形状检索

matlab的素描代码-color-GFHoG:论文“使用颜色梯度特征的可伸缩基于草图的图像检索”中描述的颜色梯度梯度直方图（颜色GF-HoG

令人敬畏的基于草图的应用程序：基于草图的应用程序文件的集合

matlab检索相似图像 - 一种改进的基于内容的图像检索系统

零镜头图像检索：基于草图的新方法与挑战

最新资源